How would you explain the difference between hierarchical and stepwise regression to undergraduate students?

More Witold Orlik's questions See All

Nursing Boards in Europe - is there any document online with contacts, e-mails etc?

Hello all Research Gate users. I wonder if Nursing Boards in Europe or Nursing Accreditation Bodies for nursing and/or midwifery contacts (e.g. e-mail addresses) are somewhere available online or...

06 May 2021 520 3 View

Is it a problem if I would like to implement 10 mediators in one structural equasion model (SEM)?

Hello scientific world, I just simply wonder if 8 to 10 mediators that I found as important for my model is just not too much? One idea that comes to my mind is that I could reduce this number...

05 March 2018 657 2 View

Any articles on children/adolescent social development and language?

Hello researchers, I am looking for articles on language (speech, vocabulary, and so on) and social development of children and adolescents. Especially I would be interested in materials that...

04 March 2018 5,956 3 View

Are confidence intervals used by frequentists in reality bayesian credibility intervals?

Hello, Is this fair to report 95% confidence intervals in frequentist approach when we have doubts about representativeness of the sample, etc? Using other words... having just one analysis how...

06 February 2018 5,643 10 View

Why do significance values differ when I compare standardised and unstandardised results?

I was running mediation models and no surprise that coefficoients dfiffer... and majority of significance values are the same.... However, when I looked at indirect effects significance value...

15 January 2018 3,653 3 View

Regression, is my analysis NOT producing WEIRD results?

When both predictors are applied in the model with one outcome variable, they are both non-significant. However, when I apply them on one to one basis, they both are significant. Is it not...

05 December 2017 1,701 8 View

Should bootstrapping be always used while running mediation in SEM context?

Example from my analysis: A and B (main predictors) predicting C (mediator) and C predicting D (outcome variable). Example 1 : Analysis without bootstrapping: Direct effects: A - D =...

04 December 2017 5,531 8 View

Why such high standardised coefficients in mediation model in Mplus?

In the STDYX Standardization section of Mplus output, some coefficients are above 1. I thought as they are standardised they should be betwwen -1 to +1. (e.g. I have some of above 3 or -3). Why is...

26 November 2017 8,558 2 View

How to transfer back data from Mplus to STATA?

Hello Research gate users, Please help :).I converted data set from Stata to Mplus, then ran some latent class analysis using Mplus. Now I would like to transfer back 3 class solution from Mplus...

25 September 2017 7,179 1 View

How to implement covariates in LCA/LPA?

Hello, I ran latent profile analysis on mplus. Now I would like to add two covariates - gender and socioeconomic status. I have couple of questions: - Can I just regress latent variable (c) on...

21 September 2017 2,153 5 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Why does my protein refolded to beta sheet during thermal denaturation analysis?

Hi! So i attempted to understand a novel protein behavior towards heat application by analyzing its secondary structure change. I subjected the protein to a thermal denaturation analysis using...

06 August 2024 1,989 3 View

Julia B. Smith Popular answer

I learned regression from Tony Bryk, and he explained step-wise regression as "a-theoretical and lazy." He then explained how economists approach data analysis, but I don't want to get into that fight :-)

Seriously, why WOULD you want undergraduates (or, anyone) to get into the habit of analyzing data without having a reason why he/she thought X might predict Y?

If I were putting the lesson together, I would probably give my students an empirical demonstration: first have them pick out logical predictors for an outcome from a codebook, then demonstrate how the results look if you put the predictors into the model according to their theory, then demonstrate how the results look if you let the program do it automatically. Then talk about what the differences are between the two sets of results - why they came out differently, what the analysis used to decide on a second and third step vs. what the students thought should go in second and third. You will get a lot farther if you have them figure out what is different than trying to explain to them that it is.

Daniel Wright

Witold, do you mean hierarchical in the Bryk and Raudenbush sense, or some other way? As far as the variants for model selection, comparing for example forward stepwise with the lasso, it depends what level students. The new lasso book (https://www.crcpress.com/Statistical-Learning-with-Sparsity-The-Lasso-and-Generalizations/Hastie-Tibshirani-Wainwright/9781498712163) is good, but would require the students have some stats background. However, I think their plots showing diamonds might be able to help explain these without any equations.

Witold Orlik

Dear Daniel,

I think this link answers the question. However, some slightly more approachable way would be highly appreciated.

http://imaging.mrc-cbu.cam.ac.uk/statswiki/FAQ/hier

Okay, that is different from the way Raudenbush and others use the phrase. This hierarchical means variables are entered according to some model and presumably there are ANCOVA-like research hypotheses tested at each stage. The forward and backwards approaches aren't liked by many methodologists and it is difficult to interpret any of the p values, so I am not sure when they would be recommended.

Julia B. Smith

Another good and freely available source is Efron et al.'s LARS paper (http://statweb.stanford.edu/~imj/WEBLIST/2004/LarsAnnStat04.pdf) which describes some of the problems with the forward approach.

I think a big difficulty is what to do with the results from a forward/backward/stepwise regression, and I think that is tricky to teach because what if a student asks what an individual coefficient and its p value mean. Given the way it is selected to be in the model, this is difficult to answer. Keeping with atheoretical approaches (and I agree with Julia that on most psychology datasets theory should be important for analysis), I think one of the more surprising (to me) results was that p values can be calculated for the lasso. This is described in section 6.3 in Hastie, Tibshirani, & Wainwright's (2015) book, and various papers cited within (e.g., http://www.stat.cmu.edu/~ryantibs/papers/lassosignif-aos.pdf, but there are other approaches). Of course, p values are tough to explain anyway.