Does anyone know how to determine the differences in degradation rates between different treatments are statistically significant?

More Barry Michael Rainbird's questions See All

Can someone explain the intuition behind, Blue stars are small, but wavelength is big. Red stars are Big but their wavelength is small?

can someone use quantum mechanics to explain the intuition behind this phenomenon?

07 June 2024 9,584 2 View

Does anyone have a working copy of the HFE-Diagram macro developed by Gimenez-Forcada and Sanchez San Roman (2015) that they can share?

Does anyone have a working copy of the HFE-Diagram macro developed by Gimenez-Forcada and Sanchez San Roman (2015) that they can share? The download from the site is not functional. Thanks Barry

27 May 2024 1,908 0 View

How would i calculate the density matrix for the singluarlity at the big bang, and the entanglement entropy?

this is to see if entanglement can replace inflation and/or explain the low entropy at the beginning of the universe.

23 March 2024 4,782 4 View

Is there any evidence that the global decline in food micronutrient levels might be connected to the incidence of medical conditions such as autism?

There has been a global decline in food micronutrient levels since the 1980s. This is related to the way most crops are grown: Monocropping (predominantly corn, soybeans and wheat or rice,...

21 April 2023 1,340 15 View

Can the below mechanism generate purely electric, propellant-less propulsion?

The proposed mechanism is based on BLDC motor technology but utilizes the equal and opposite force imparted to the stator when driving the rotor at speed. Typically, the rotor performs the work,...

01 March 2023 6,258 0 View

Why are air sea fluxes of CO2 many times greater than the surface water net ecosystem production?

It is an empirical fact that CO2 air-sea flux measurements from either eddy covariance or differences in their respective partial pressures with turbulence etc are much greater than the surface...

14 December 2022 5,356 3 View

Can I measure the Ion Leakage/Conductivity of Tobacco plant leaves during Programmed Cell Death at 6hrs and 12hrs?

I need to measure the conductivity of tobacco leaves as they are undergoing PCD at timepoints of 6, 12, 18 and 24hrs. The problem that I have is if I sample the leaves at 6hrs (for example) I...

15 March 2022 9,224 0 View

Help troubleshooting failed phage transduction?

Lysate prep: donor strain is inoculated overnight in LB then diluted 1:100 into 5 mL LB + 0.2% glucose + 5 mM CaCl2, incubated for 1 hour. 100 uL of P1vir was added, then incubated until total...

23 September 2021 8,024 4 View

Help with LDH activity assay?

Is is possible to find the LDH concentration using a solution NAD+, LACATE and CAPS buffer with the LDH sample at 340nm to find the LDH concentration in human blood? what would be the protocol?

26 June 2021 6,790 2 View

Are there studies available that have looked at the minimal clinical important difference (MCID) for STAXI-2?

I am analyzing data from a pre-post study where the STAXI-2 was administered to measure changes in levels of anger. The STAXI-2 has 12 summary measures of 3 domains of anger. The "post" statistics...

09 June 2021 4,599 2 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

Could dyes amplify the spectrum of light to a specific wavelength?

I am interested to know the behavior of dyes toward light. Specifically, Blue dyes re-emit the spectrum, especially from the green zone (known as principal in LED lamps, and blue dyes are known...

05 August 2024 3,290 1 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

Why 3 replicates for most biological assays? Is it enough to examine the data fits normal distribution?

Just bounced on me. Before statistically analysing significant difference, shouldn't we see if data fits normal distribution first? Is 3 replicates enough to testify the hypothesis of normal...

31 July 2024 8,141 13 View

Which statistical test should we use?

N=6 Comparing pre and post test likert scale responses. Participants are mix of practicing & preservice teachers.

30 July 2024 7,233 4 View

Request for Advice: Starch Metabolism Research Project?

I am currently considering a research project focusing on a comparative analysis of starch metabolism in orchids and roses. I am particularly interested in identifying the types and quantities of...

30 July 2024 4,267 2 View

Can the limit of quantification (LOQ) of an analytical method fall outside its linear dynamic range, or must it always be within it?

Can an analytical method's limit of quantification (LOQ) be outside its linear dynamic range, or is it always required to be within it? Please provide a thorough explanation supported by verified...

29 July 2024 7,198 9 View

Pragmatic inquiry research design?

Employing a pragmatic inquiry research design, looking for published research using this method, employing qualitative research data collection methods of semi-structured interview and focus...

28 July 2024 540 2 View

Rajiv Pandey

Dear Barry

You have to fix some treatment as control, then you can estimate degradation value for each treatment with control. The data thus obtained may be analysed similar as you analysed the rest. Pl see.

Sal Mangiafico

Your question is not clear. What kind of model or test are you using to calculate significance?

Jochen Wilhelm

If the degradation rates are constant over time, the log-transformed values should decrease linearily over time. In this case you could do a multiple linear regression analysis and test the differences between slopes.

Otherwise you might consider a survival analysis (e.g. cox proportional hazards regression or some other model).

Jerry Miller

I'm unsure of the details of your experiment, but you might consider Kaplan-Meier analysis. This would plot the degradation over time and produce "survival" curves for each treatment (i.e., degradation curves) and you can determine significance by log-rank tests.

Timothy A Ebert

The null hypothesis is that there is no difference. IF there is no difference in degradation rates, then it does not matter which treatment any given observations originated from. Take your data and randomize it. Calculate the absolute value of the difference in degradation rate between two groups that have the same number of observations as the original. Do this 20,000 to 50,000 times. Sort the differences from small to large. Find where the original data fit in the sorted data. Is the result where 95% of the observations are smaller than the observed difference? This is a randomization test. It will not work if you only have 3 or 4 replicates in each treatment. I am ok if you have about 20, but some would strongly recommend 100 or more. There are no assumptions about the distribution of the data. However, there is an assumption that the distribution of the data is the same in all treatments. Like with many statistical test, minor violations of the assumption are usually not a problem. I know this test can be done in R and in SAS. I am not familiar enough with other software to answer the question.

Timothy, the randomization test you describe is about the (expected) difference. This test works "ok" for about 20 reps and "good" for about 100 reps. Now there is an elegant test available for testing the exprected difference ("t-test") that does not need big calculations - at the cost of the additional assumption of a particular shape of the likelihood function. But this assumption is almost always resonable for 20 or more reps. So I don't see any advantage using a randomization test in this case (I would see it if one wishes to test some statistic or which no designed test is available). Do I miss something?

Dear Jochen, You did not miss anything.

1) some people would prefer not to have to assume a specific distribution.

2) The numbers sound big. In R on my laptop, I can run 1,000,000 randomizations in 1 to 5 minutes. I expect the described test to take something like this. However, if a non-linear regression is being done, then this might take a bit longer. I suppose if there are thousands of data points and a complex equation it might take a few hours. Yet that is not the typical case. Thus, I don't see this as "big calculations."

3) A sample size of 20 is insufficient to identify the underlying distribution with any accuracy. Even if you assume a specific distribution, a sample size of 20 is still subject to considerable error (though much less than a sample size of 5).

Well, a t-test takes about a fraction of a microsecond, so 1-5 min is considerably more computer intensive, and that is a comparably "big calculation" (think of all the power plants we need to run just to let some 1000s of scientists do their beloved tests all using randomization strategies!).

I doubt that the differences are relevant. If the sample sizes are small, all tests and estimates are "difficult", and if the exceed 20 or 50 or so, knowing the "exact" distribution does not matter. Here the bigger problem is that scientists tend to use entirely unsuited models for the problem at hand. To give an example from my area: often, the expected difference for gene expression is "tested", what simply makes not much sense because the expression changes are proportional to the gene activity. A randomization test would allow to technically correctly asses the significance of the data under an expected difference of zero, but the biologically sensible question would be about the relative change, which is assesed by the difference on the log-scale. The failure to see these things is the main problem. If this is recognized, the assumption that the likelihood function of the difference of a sample of 20 or more log expression values is very well justified. And in this case the direct test and the randomization test would give very very similar answers again.

I hope you got the point.

Another note: I think you actually meant "bootstrap tests" rather trhan "randomization tests". A randomization test is a test on actually all permutations. When the sample size gets larger, the number of permutations quickly becomes extremely huge, therefore the "permutation space" is sampled using a bootstrap algorithm. The permutation test is called an "exact" test, whereas the bootstrap test is an "approximate" test.

I guess it all depends on how you use the computer. If I go get a drink of water do I power down the computer? If it is just sitting idle (consuming power), it might be better to get it to do something. Of course if you turn the computer off, then you have to add in the time (and power) lost in having the computer boot up. The computer still uses power when it is "asleep." It might be more productive to have the computer doing something.

One small issue: I did mean randomization. However, you make a good point.

A permutation test looks at all possible arrangements of the data. This is good if there are a small number of permutations (two or three million). It is unproductive when there are a large number of permutations.

A randomization test randomly samples all possible arrangements of the data, but without replacement. All observations are used once and only once for each randomization. The number of randomizations should be a small fraction of the number of permutations. Thus 50,000 randomizations with 109 permutations is 0.005 %. A small fraction by most reckoning.

A bootstrap is similar but resamples with replacement. I am not a big fan of this approach because it allows for the possibility that a treatment in one randomization will consist entirely of the results from one observation. It also (fairly or unfairly) reduces the perceived impact of an unusual observation because the unusual observation gets swamped by the bulk of data about the mean value. Of course, this might be a good thing if the unusual observation was caused by some undiscovered error.

These definitions were set forth in Bryan Manly's book "Randomization, Bootstrap and Monte Carlo Methods in Biology." This is the book that I use, so those are my definitions.

Barry Michael Rainbird

Thank you for all your help :)

It was very much appreciated :)