What analysis to choose with a large neste dataset and clearly skewed or kurtotic distributions?

03 March 2021 1 8K Report

Hello!

I have a dataset of n=3000 nested within 8 countries with approximately 200 or 400 responses in each country. I originally planned to perform multilevel modelling with 4 dependent variables (DV) as fixed effects in SPSS.

The DV variables are responses in a scale of 1-100 and this kind of variables is treated as metric in psychology.

However, all my DV and the error terms are clearly skewed or clearly curtotic. My questions are:

1. I have read that in some cases the size of the dataset or the number of nesting groups allow to use the general linear model. Does it make sense, however, to do so if the dataset clearly shows extreme tendencies? It looks to me like clearly different distributions, but I am not sure how to define them. Should I regard them as continuous distributions?

2. Am I right to think that data transformation is not a good option if there is a different form of distribution?

3. What would be the advantages and disadvantages of bootstrapping or simulation?

4. What would be good reasons for using a generalized linear or a mixed model?

5. Would it be appropriate to perform a factor analysis of the four DV. If not, are there alternatives?

I would appreciate if someone can answer any of these questions or suggest some not very technical references !

Abdulrazzag Falah

Olga Kostoula If understood your question correctly, you are doing a multivariate analysis since you have four DVs, right? If that is the case, you do not need to worry about the univariate normality. Instead, you should check for the multivariate normality using the Mahalanobis distance, for example. Also, you could visualize that normality using a Q-Q plot. Literature in educational psychology in this matter suggests that if multivariate normality is met, it is safe to assume univariate normality, as well.

Also, have you treated the outlier cases, maybe some multivariate outliers are that cause of the extreme skewness and/or kurtosis. I usually use R's package QuantPsyc to do that using "maria" function.

In the case you conclude that you don't have multivariate normal data, identify which variable is the skewed and follow common data transformation steps, i.e., if it's not severe, so use a square-root transformation; however, if it were more severe, try a log transformation. Also, inverse transformations are only for very strange looking data. Those are heuristics and won't affect your results in any way (Tabachnick & Fidell, 2019).

Badges
Science topic

More Olga Kostoula's questions See All

Weak DAPI staining after immunohistochemistry - how to improve?

After immunohistochemistry of previously fixed in PFA and EtOH and then frozen 20 μm sections of zebrafish brain, DAPI staining is very weak (right) compared to the same sections stained without...

05 August 2024 9,637 2 View

A twin-shaft kiln or a rotary kiln for lime production 1 class? if compressive strength of limestone is low 60 MPa?

My research work is related to the calcination of limestone. I have a complex question and would like to consult with experts in this field. Could you please advise which kiln to choose (a...

18 July 2024 1,390 2 View

#dinoflagellate_cysts/ Can you help with identification of these bugs?

Upper Jurassic of Siberia.

26 June 2024 8,177 0 View

Why am I getting signal amplification after ultracentrifugation?

I am trying to clen the free antibodies after immunostaining exosome like nanoparticles. I am trying by ultracentrifugation and sucrose cushion. And I do see that my sample is cleaner but I get...

30 April 2024 5,221 0 View

How to remove double citation by a preprint and an associated article?

I added a version of my article (as a preprint, though after review and confirmation of acceptance), which cites my previous article. But now, a published version is added separately, and a double...

05 March 2024 4,583 4 View

What is the easiest but still effective method for slides pre-cleaning before poly-lysine coating for IHC?

I see in the instructions and in some other manuals for poly-lysine coating that slides must be cleaned 'before attempting this procedure. Clean with acidic alcohol (i.e., 1% HCl in 70% ethanol)...

14 February 2024 8,147 4 View

Can poly-lysine be applied to charged slides?

We have a surplus of positively charged slides (Superfrost Plus Microscope Slides) since they did not perform well with small zebrafish brain slices, which were washed off from these slides during...

13 February 2024 2,688 4 View

What is the reason for the partial lack of focus in images of adult zebrafish brain slices?

I recently conducted staining on brain sections of adult zebrafish using Nissl stain. The brains underwent pre-fixation in 4% paraformaldehyde, followed by storage in 75% ethanol at -20°C for a...

12 February 2024 4,114 7 View

Can I get arXiv endorsement?

I'm new to arXiv and need endorsement to submit an article to the q-bio.NC section of arXiv. Could you please help me? Here is the link to endorse my article:...

03 February 2024 5,527 0 View

How to validate a magnetoelectric device COMSOL simulation?

Hi, Can someone please suggest a reliable article to use to validate a magnetoelectric device simulation in COMSOL? I am talking about something simple, a laminate/ cantilever with 2 layers (one...

14 January 2024 3,632 0 View

How much total RNA concentration to be extracted from sorted plasma cells from bone marrow of C57BL/6 mice for RT-PCR ?

i have sorted anti-NP specific plasma cells from bone marrow of C57BL/6 mice at certain times after immunization with variable counts and isolated total RNA using TRIZOL method for RT-PCR using...

05 August 2024 8,835 1 View

Please explain how the plastic input value should be considered from the true stress-strain curve for the bilinear elastoplastic material model ?

I am working on Abaqus/Explicit(Quasistatic ) for the deformation of the auxetic structure model. Please explain how the plastic input value should be considered from the true stress-strain curve...

05 August 2024 454 3 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

Request a single Lecture notes for math as detailed as this that I can find in one place?

- The Existence/Uniqueness of Solutions to Higher Order Linear Differential Equations - Higher Order Homogenous Differential Equations - Wronskian Determinants of $n$ Functions - Wronskian...

03 August 2024 2,366 0 View

Is it necessary to covary exogenous constructs in a structural model?

I am working on a SEM model where i have 7 latent variables (6 exogenous and 1 endogenous). In AMOS when I co-vary the exogenous constructs, only 2 paths are coming significant out of 6. But when...

03 August 2024 6,028 4 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

How to calculate effect size of AMCE (Average Marginal Component Effect) in Randomized Conjoint Experiment?

I am following Hainmueller, Hopkins, and Yamamoto's (2014) paper for my randomized conjoint experimental data analysis. The link to the paper is provided below. I received a comment from the...

02 August 2024 4,406 0 View

Why can't academics earn the money they deserve?

Only Journals make money from the articles we have worked on for years. Academics do not earn money from their refereeing. Then shouldn't the solution be a system in which academics can earn...

01 August 2024 6,469 6 View