Is the use of GLM correct for Binary response?

More Manuel Alonso Ma's questions See All

How can we improve storage conditions of museum objects?

90% of museum objects worldwide are in storage.

21 July 2024 8,038 6 View

How to removal of dead cells and debris from organoid cultures?

I'm currently working with human pancreatic organoids and I am wondering to remove the dead cells and debris from health organoids, any suggestions would be appreciated!

17 July 2024 8,330 1 View

Are the coordinates the center of the cell or a corner in GADM / worldclim maps in R?

I am currently using a map downloaded from https://gadm.org/maps/DEU/bayern.html and I would like to assign the value of a variable to each cell with R. I was also trying to get this information...

15 July 2024 8,844 1 View

Does it work as antibiotic resistance if plamid has no RBS??

I am on gene work to express gene of interest in some Bacillus species. Thus far, i need to try some RBS in Vector which is optimized for bacillus subtilis. Before i design vector to express in...

11 July 2024 7,847 1 View

Warning: convergence tolerance of 1.000000e-06 not reached?

Hola a todos, Me gustaría realizar una consulta con relación al mensaje de advertencia que se muestra en la imagen anexada. Es un mensaje que aparece al inicializar la solución de una simulación...

05 July 2024 974 4 View

Can a liquid metal-ligand sample be directly analyze in flame atomic absorption spectrosocopy?

Can I directly analyze a liquid metal-ligand sample without acid digestion in flame atomic absorption spectroscopy?

25 June 2024 1,866 2 View

Is this a mixed methods approach?

For my PhD dissertation: If Chapter 1 is a systematic literature review (QUANTS/QUALS) while Chapters 2, 3, and 4 are empirical chapters (QUALS), is this considered a mixed methods research...

16 June 2024 3,220 5 View

What can I do if primer solution gives band in electrophoresis before PCR reaction?

Trouble 1: I ran a PCR reaction and found a low molecular weight band (100 bp) when doing electrophoresis. 1ng plasmid (OD260/280=1.84) was used as template. Final concentration of primers was...

13 June 2024 5,540 6 View

Does anyone have the manual for the PBCON-24T-700-4 lighting controller?

Good afternoon, everyone. I am trying to locate the manual of a lighting controller, whose code is PBCON-24T-700-4 from the company DesignInnova or Photo Biosim. Does anyone know how to get it or...

28 May 2024 8,240 0 View

How to find the original questionnaire of the article-Dimensional analysis of schoolchildren’s food label comprehension: a pilot study?

27 May 2024 2,551 0 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Alexander Pabst

GLM means generalized linear models, which you can use for a variaty of outcomes, not only continuous. Given your data, you can thus either use logistic regression or - as you did - GLM with option family=binomial. Both should give you the same, correct results.

Jochen Wilhelm

What you are doing seems correct. Maybe just a notational thing: "X" in your model is a predcitor, not a "response" (as you wrote). I presume you wanted to say that both variables, X and NIH, are binomial. In your model, the binomial variable NIH is the response and the binomial variable X is the predictor.

253266 degrees of freedom indicates that you have a huge data set. If this is so, then looking at p-values makes little sense. You should interpret the estimate instead. The estimate here is about 1.43 (you can get a confidence interval using confint(fit.1way) - this will presumably be quite narrow, given that sample size). The estimate is a log odds ratio; the odds ratio is exp(1.43) = 4.12, saying that the odds of getting an NIH grant are roughly 4 times higher when X is TRUE as when X is FALSE. Depending on what X is, this may or may not be relevant. However, keep in mind that getting a grant is one thing, and the hight of the grant is another. It could be that X is TRUE for smaller grants, which are more likely and more often to be given, because there is not that much money involved. Further, think carefully if there is a confounder that could also explain the higher odds of getting a grant (e.g. X is correlated with the NIH butgets for different resesarch topics). And lastly, be careful not to confuse correlation with causation when interpreting your results.

Patrice Showers Corneli

Formally called a log-linear model just in case you'd like to read about the family. Clearly you have a tremendous amount of data so are very likely to get a statistically significant result.

Takashi Suzuki

Logit or probit model.

Martin Schmettow

Looks correct. But, do not get too excited about the p-value when you have a large sample size. Most likely, it is the effect size you are really interested in and this can be inferred from the estimates.

Patrice Showers Corneli: this is not a log-linear model, as logistic regression uses the logit link function.

A log-linear model is simply a logistic regression with categorical explanatory variables and can use a log or logit link. A logistic model has categorical response and also accommodates categorical and also continuous explanatory and may use a logit or probit.

To the main point, the question is whether the statistically different between the two groups is in a practical sense a meaningful difference.

Martin is certainly right. A very small p-value will be obtained from very large studies. But a small p-value is only important if the difference between the groups is large enough to provide insight. A very small difference can always be detected with lots of observations. But whether the difference is meaningful can only be judged by the researcher who understands the size of the effect that is informative about the process being investigated.

A drug tested against a control in a very large study with thousands of participants is expected to be statistically significant. But if it reduces the probability of dying over the control group by, say 3%, it is not really biologically important.

We use statistics to inform our scientific practices. So knowing the magnitude of effect that we would consider important requires scientific knowledge of the study material.

Patrice Showers Corneli : That simply is not correct. A logistic regression is a logit-linear model. The logit function serves to map a response variable that has a lower and upper boundary to "linear space" (which has no limits). In other words: counts with a maximum number. The very name "logistic" comes from the fact that the logistic function is the inverse of the logit.

A log-linear model is a Generalized Linear Model with a logarithmic link function, typically this is Poisson or Negative binomial regression. It applies when there is no upper limit to counts. The type of predictors is generally irrelevant for all linear models. See chapter 7.2.1 of my online book: https://www.researchgate.net/project/Book-New-statistics-for-the-design-researcher

That being said:

log-linear models can approximate logit-linear models fairly well, when the upper bound is very large. (but, why would you want an approximation, when doing the real thing is the same effort)
Patrice Showers Corneli explanations on p-values versus effect sizes are spot on
when the purpose of modelling is prediction (rather than hypothesis testing), the AIC is the appropriate model score (rather than the p-value).