Stratified sampling vs cluster samping

More Srikanth Potharla's questions See All

How to calculate the c lattice parameter of MXene?

I need to find the c lattice parameter of HF etched Ti3C2 MXene and delaminated Ti3C2 MXene. Papers are reporting an increase in c lattice parameter after delamination. Any formulas, calculations...

17 May 2024 8,458 0 View

How to solve this error?

I have designed an Extended source vertical double gate TFET in Sentaurus TCAD. I have implemented the non-local mesh into it but it shows that it doesn't have non-local lines. Can anyone tell me...

12 April 2024 7,816 0 View

Why is Galvanostatic Charge and discharge curve is not perfectly triangular?

I am facing a reproducibility issue. I am measuring the specific capacitance for my sample (carbide based). I am getting pseudocapacitive behavior in CV curves. The first time while measuring GCD...

13 March 2024 5,666 2 View

Can I evaporate some water content from a slight colloidal solution using a rotary evaporator?

Iam having a slight colloidal solution with some dark suspensions. The solution is aqueous based (It has water). Can i evaporate some of this water content using a rotary evaporator? Can i extract...

12 January 2024 9,183 2 View

How to extract effective mobility in Symmetrical doubel gate MoS2 Field Effect Transistor?

Effective Mobility of charge carries in the channel

19 April 2023 6,809 0 View

Can contact stresses become independent of the rolling velocity in a viscoelastic rolling-contact?

In nonlinear viscoelastic rolling contact problems (assuming a viscoelastic wheel rolling/slipping with zero slip angle on a rigid body with friction), is it possible that the contact stresses...

11 April 2023 5,153 0 View

Did anyone tried bonding PET sheet to glass slide for microfluidic applications? If so, could you please help?

I want to bond 2 glass slides with a PET substrate in between them.

06 March 2023 9,164 2 View

What free / unlimited software is required for .XRDML file analysis?

I have not found any free unlimited versions of software for .XRDML file analysis. Please suggest.

24 February 2023 9,425 0 View

How to reduce impurities in synthesis of Manganese ferrite?

I have synthesized Manganese Ferrite nanoparticles by co-precipitation method. In the XRD analysis i have found two extra peaks around 31 and 45 degrees however the other peaks are matching with...

22 January 2023 7,453 4 View

Any suitable Ball mills for mixing elemental powders of 325 mesh?

I need some suggestions and suitable ball mill for mixing the elemental powders of 325 mesh size like titanium, tungsten,carbon,aluminium powders etc.. I need the model, name and brand of the ball...

22 January 2023 6,391 4 View

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

What is the best sampling strategy?

I am conducting a qualitative study that uses interviews to investigate the perceptions of teachers about a particular leadership practice and I am focusing on 3 schools which have a total number...

01 August 2024 8,457 10 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Request for Advice: Starch Metabolism Research Project?

I am currently considering a research project focusing on a comparative analysis of starch metabolism in orchids and roses. I am particularly interested in identifying the types and quantities of...

30 July 2024 4,267 2 View

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Can an analytical method's limit of quantification (LOQ) be outside its linear dynamic range, or is it always required to be within it? Please provide a thorough explanation supported by verified...

29 July 2024 7,198 9 View

Pragmatic inquiry research design?

Employing a pragmatic inquiry research design, looking for published research using this method, employing qualitative research data collection methods of semi-structured interview and focus...

28 July 2024 540 2 View

Can i apply a public questionnaire but analyse it with a different data analysis method?

First, thanks for taking the time to read my puzzle 🥹! I am currently working on the methodology part of my dissertation, and I have chosen a questionnaire as the instrument for my quantitative...

27 July 2024 3,457 3 View

How to determine method detection limit in an analytical method?

I know the difference between instrumental LOD and method LOD but my query is - in case of any sample whose concentration is zero or not detected by the instrumental LOD, is it possible to get...

24 July 2024 6,592 5 View

James R Knaub

Hello Srikanth -

With stratified sampling (you probably mean stratified random sampling) you break the population into subpopulations which generally have less variance within them, and at best have fairly large differences from one stratum to another, so as to reduce overall variance, as compared to simple random sampling. Even if you really just have different categories, this usually decreases overall variance, but if you also want to publish the categories, that is an additional problem generally requiring more data.

In cluster sampling, you start by breaking your population into groups (clusters), but now these clusters become the sampling unit. You draw some of them, and either census each cluster selected, or do a second stage to the sampling. Multistage sampling is thus a bit more complicated.

Unlike stratified random sampling, cluster sampling is actually less efficient than simple random sampling. However, the larger overall sample size needed is often offset by data collection considerations. If in-person data collection is needed, then it reduces the number of locations to visit. However, because many locations may not be visited at all, the random sampling of clusters had better be good.

Note: Above refers to random sampling for design-based sampling and estimation purposes. Survey statistics, as with other statistics, can make good use of models, but regression modeling is beyond the scope of this discussion.

Cheers - Jim

PS - So for stratified random sampling, you divide your population into parts, and sample from each part. For cluster (random) sampling, you divide your population into parts, and treat those parts as the units from which you draw a random sample.

It may not be very helpful to you, so you may just want to think about my first answer, but interestingly to me, it just occurred to me that a stratified random sample is like a two-stage cluster sample, where the first stage was a census. - Further, a one-stage cluster sample, is a two-stage cluster sample, where the second stage is a census of each selected cluster.

Srikanth Potharla

thank you prof. James R Knaub for elaborating the difference between stratefied and cluster sampling

Weisheng Zeng

Dear Srikanth,

I almost agree with James's explanation about the difference between stratified sampling and cluster sampling. But magbe it will make people a little confusion about cluster sampling and two-stage sampling. Theoretically speaking, stratified sampling and cluster sampling are two special cases of two-stage sampling:

a.Stratified sampling: when all 1st-stage samples are selected (n=N);

b.Cluster sampling: when all 2nd-stage samples in the selected 1st-stage samples are selected (m=M).

For clear understanding, you need read the calculation equations of two-stage sampling. In some references I read, two-stage sampling and cluster sampling are the same or confused.

Interesting, Wei-Sheng. It looks like you repeated my second answer, which actually I expect is not the way you would normally see it explained, but as I said, that works also.

Here is a link to a bibliography I put together for another purpose, but it contains a number of survey statistics textbooks, copies of which I own, which I expect make good references here also. I know that Cochran (1977) does a really good job on stratified random and cluster sampling.

https://www.researchgate.net/publication/317914104_Handout_Bibliography_for_Comparison_of_Model-Based_to_Design-Based_Ratio_Estimators_Poster

.....

- To be clear, you don't actually have to mention stages of sampling to explain stratification, and you don't have to mention it for one-stage cluster sampling either. You can just say, as I did the first time, that "...for stratified random sampling, you divide your population into parts, and sample from each part. For [one-stage] cluster (random) sampling, you divide your population into parts, and treat those parts as the [primary sampling] units from which you draw a random sample."

Data Handout Bibliography for "Comparison of Model-Based to Desig...

conceptually, your discussion sheds light on the difference between stratified sampling cluster sampling. But, in real life application, i am getting many doubts on classifying a given sampling framework as stratified or random sampling.

For example, I want to study the perceptions of the students in the city on the present educaiton policy of the government. there are 10 colleges in the city and total 500 students are there in the city.

how to develop sampling framework, if i apply stratified sampling or cluster sampling.

thanks in advance

If you are trying to compare colleges, then you need data on each, which would be a kind of stratification, but really just a series of simple random samples.

If you just want the optimal overall sample, then that would mean stratified random sampling, unless you had some size measure for unequal probability sampling or modeling. Anyway, stratified random sampling is more efficient than cluster sampling. Cluster sampling could be easier, if you have to travel to administer this survey, but requires a larger overall sample size.

So it sounds like you want stratified random sampling, unless you have logistics problems, generally regarding travel, and would like to reduce the number of trips you make.

It sounds like, in your case, the clusters would probably be the ten colleges. If you only have a universe of N=10 from which to draw for your (first stage) selection, that sounds too coarse to work well. If however they were 10 strata instead, and you drew from each and every one of them, I think results would more likely be much better.

So, hopefully you can use stratified random sampling.

Nikolaos G Farmakis

Dear Srikanth, Good morning from Thessaloniki, Greece.

The main diference between the Stratified Random Sampling (SRS) and the Cluster One is that:

In SRS you have to decide about the strata (under variance within temselfes criteria) and for Custer Sampling the Clusters are redy (already defined) waiting for you take a random sample from their set. They are the sample units.

Dr Nikolaos FARMAKIS Assoc Professor On Statistics

Aristotle University of Thessaloniki.

Dear Prof. Nikolaos G Farmakis

i have two doubts in your explanation.

1. how do we define clusters(can we use cluster analysis, provided we have the data on required variables or is there any other alternative approach)

2. does each cluster represent similar profile of the sample units in the cluster?

3. For example, we have 4 clusters, do we have to select sample from all the four clusters?