How do I perform a gender and age correction in a voxel data to then implement a machine learning classification task?

Walter Hugo Lopez Pinaya @Walter_Pinaya2

10 October 2017 1 5K Report

Hi. I am developing a machine learning model to predict the diagnosis of a subject using his MRI data preprocessed using Voxel-based Morphometry. So, I have two groups of subjects (patients and control), and each person has a 3D volume with 128x128x128 voxels in MNI152 space.

The problem is that the groups have significantly different proportions of gender (verified using Chi-square proportion test). In another type of data, I would "regress out" (using a linear detrending algorithm) the effect of the non-imaging variable. However, I am not sure if this would make sense in this kind of data.

Is it make sense "regress out" the gender (or age and head size) from the intensity value of the voxel? What would be the best way to address the confounding effect of these non-imaging variables?

PS: I am using a deep neural network as the classifier.

Evgeny D Petrovskiy

If the underlying algorithm was linear, 'regress-out' approach would be somewhat applicable, yet, since this is ML and deep-learning, linear decomposition doesn't really make sence - because, if the relationship between, say, age, and Morphometry metrics is not linear, your regression would basically corrupt the data, when without the regression the non-linar algorithm would probably identify the relationship quine nicely.

So why not just incorporate age into the model? Add another input into your model, and pass age into it. Same with gender and any other characteristic of the subjects. With deep learning it's reasonable to assume the model would make appropriate use of additional inputs, if given enough samples. But, well, the same goes for any ML algorithm

Since this incorporation is a bit different than, say, adding several more voxels, and carries a more 'modulation-like' sence, I would assume that the NN would require more layers/neurons to 'build' the underlying relationship. However I do not know the starting point so the structure used (or planned for use) may easily be enough already.

Significant between-group difference isn't perfect, but is not a deal breaker, in my opinion

Badges
Science topic

any study on protective factors of care givers of dementia and resilience ?

need to review articles on care giving in dementia and challenges faced

12 August 2024 6,424 1 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How can I prepare virus for a TEM or SEM imaging?

I have virus (viral hemorrhagic septicemia virus) in suspension and the experiment will not involve cells. What level of TCID50 is preferred?

11 August 2024 3,115 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

Please explain how the plastic input value should be considered from the true stress-strain curve for the bilinear elastoplastic material model ?

I am working on Abaqus/Explicit(Quasistatic ) for the deformation of the auxetic structure model. Please explain how the plastic input value should be considered from the true stress-strain curve...

05 August 2024 454 3 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

The Optimal Experimental Approach for Gene Function Analysis?

What is the most suitable experimental setting to understand the consequences of a particular gene: (1) targeted degradation of the specific transcription factor (TF) of the gene, followed by...

04 August 2024 6,265 1 View