Desired sample proportion/minimum sample size in each group if we want to compare them?

05 May 2019 22 8K Report

I need to conduct a study in which the treatment group has two levels, say A and B. The sizes of these groups are really different, size of group A is 1 million while the size of B is 20,000. I will use the Welch's test in comparing means and multiple linear regression or weighted least squares in predicting say, cost, hospital stay, etc. Now, my problem is the big gap of sample sizes. My questions are:

1.) I'm planning of getting a subset from the larger group (group A) for the analysis, so that the sample sizes between two groups will not be that large. Is this right?

2.) Is there a minimum required proportion of sample in both group? If so, by how much? What are the disadvantages of this?

Thank you.

Badges
Science topic

More Jane Pads's questions See All

What is the meaning of fractional difference parameter d in ARFIMA model?

I know that the series is stationary and has long memory property if the fractional parameter d is between 0 and 0.5. But is it for stationarity or for testing that the series has a long memory?...

04 May 2016 8,148 5 View

Why does long memory property common in financial and economic fields?

I've read articles, journals and books regarding long memory property of a time series data. And most of it found that, it is common in financial and economic perspective. I know , for example, in...

03 April 2016 4,824 7 View

Where can i download S-PLUS trial version?

i've been searching for the trial version of s-plus in internet but i found none. Anyone please tell me or provide me a link where I can download it?

03 April 2016 5,897 1 View

Arfima forecast codes in R?

Hi, i just want to ask what is wrong with my codes that the forecast plot looks like in the attach file. The codes are these, > x ...

10 November 2015 9,877 0 View

How do I create power series expansion of long memory properties of time series?

Hi. I look at the journal of Hosking in proving the properties of ARFIMA (0,d,0). It says there, when d < 1/2, the power series expansion of (1-z)^(-d) converges for absolute value of z less...

10 November 2015 1,386 3 View

Topics for master's thesis?

Hi! i'm still confused of what problem in statistics should i consider to be included in my study. I want to find a topic for my thesis but i find it hard looking for it. can anyone suggests?

06 July 2015 6,302 5 View

Can I ask for a Randomization topic?

Can anyone give me a topic in statistics that might involve using randomization? just like the one sample test, anova, regression.. But this particular topics were already absorb in my class. So i...

02 March 2015 9,138 8 View

ARCH in R Software?

I used the package "FinTS" in R to test if the residuals have an arch effect by using the function ArchTest(). Now the test is significant, so i want to see if arch(1) is ok. How is this done? Is...

02 March 2015 9,304 1 View

Does anyone know about R codes for Spearman's correlation with randomization?

I am trying to do a randomization test in Spearman's correlation in R. And i find it difficult to do the randomization part.

02 March 2015 192 3 View

Can anyone explain to me the concept of exponential smoothing in time series?

I have read that it can be used in forecasting. so what is the difference of this method and the usual method in forecasting? And also, I don't understand the concept that this method assigns...

09 October 2014 2,181 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View