Am I right in this interpretation of how LASSO deals with the issues of stepwise model building?

More David M Sidhu's questions See All

Calculating power for a multivariate regression?

How do I determine the number of participants needed to achieve X power, with a small effect size (assuming f2 of .02), for a multivariate regression? As far as I can tell G*Power can only do...

11 December 2017 8,218 4 View

What is the best way to estimate a linear mixed effects model using MCMC sampling in R?

I am interested in running a model predicting a continuous variable, using several fixed effects as well as random subject and item effects. I would like to estimate the model using MCMC sampling...

09 October 2017 3,978 6 View

Is there anything wrong with a perfectly correlated random slope and intercept, assuming there is variation in both?

I know that when there is close to zero variance in either random effect, a perfect correlation can mean that the model isn't able to estimate both effects. However, I have a good deal of variance...

07 August 2017 4,951 8 View

Is there a database of object concepts that have been normed on semantic differential scales?

I'm wondering if there is a database of objects (either word or picture targets) that have been rated by a good number of people (>=30) on Osgood et al.'s semantic differential scales (e.g.,...

03 April 2017 6,464 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Using OBD technique i am trying to measure laser induced shockwaves velocity i found that at start velocity increases and then decay?

i am unable to interpret why its increases in start as shown in figure

11 August 2024 2,179 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

James Renwick Beattie

Many regression methods limit the magnitude or norm of the coefficients to constrain the solution space. I understand the main benefit of LASSO to be penalising very small coefficients (setting them to zero if small enough) based on the assumption that these are more likely to be low signal to noise and therefore add more noise than signal to the prediction when applying to independent data.

Brian Schwartz

To my knowledge, LASSO not only limits the sum absolute value of coefficients in a regression model by shrinking the extremely high coefficients – which are likely to be overfitted and won't be replicated in an independent sample – but also by setting very small coefficients to zero, as James mentioned, and by setting coefficients of variables to zero, which are highly correlated with other variables in the predictor set (to deal with the problem of multicollinearity).

Summed up, after shrinkage, the high coefficients should remain the highest in the set of predictors and therefore 'especially influential' (except for the highly correlated ones, because all but one of them is set to zero). Results of a recent analysis I performed, comparing the estimates of a bootstrap ranking LASSO and a logistic regression using glm, point in this direction.

Deepika Joshi

Hi David,

Please share your work upon Lasso. It may be useful to me. Thanks