Should data be preprocessed before or during cross-validation?

More Irakli Skhirtladze's questions See All

How to perform generalized forecast error variance decomposition ?

Hello everyone. I am using the VECM model and I want to use variance decomposition, but as you know variance decomposition is very sensitive to the ordering of the variable. I read in some papers...

28 May 2021 3,502 5 View

I have problem with structural VAR model. can anyone help?

I was using the VAR model and to see the impulse response function I used Cholesky decomposition. data is quarterly from 2003q1 to 2019q2. The first variable is oil price as a supply shock, second...

06 April 2021 4,390 6 View

I am using VAR model and it shows me wrongly that exchange rate has negative effect on CPI and what can be problem?

I am using VAR model and to see the impulse response function I used Cholesky decomposition. The first variable is oil price as a supply shock, second output gap by applying Hodrick-Prescott...

20 March 2021 7,054 9 View

Weights in mixed model ANOVA, should I use weights prior to any analysis?

I have a within subjects survey data. One variable is gender where I check if there is a gender difference in body perception. For this I understand that I have to use weights but the rest of the...

13 September 2020 7,079 1 View

A manual of geotechnical investigation?

Can someone provide a manual or a guide of geotechnical investigation procedures for different types of constructions in English? I mean a guide of methodology and standard procedures for...

17 February 2020 6,879 3 View

What method can be used to determine the P-T conditions of low temperature/pressure metamorphism?

If all known classical geotermobarometers does not work... :)

27 November 2019 2,231 8 View

I am looking for person, who can help me in my PhD. I study metamorphism and I need consultant and collaborator. Who is open for collaboration?

I study contact and regional metamorphism of Dizi series, Great Caucasus, Georgia. I have a problem only with analytical part of my work. And It would be great to share knowledge and experience...

13 February 2019 6,123 14 View

Could anybody advise me the best textbook for X-ray fluorescence?

Basics, methods, calibrations etc.

28 January 2018 9,605 8 View

Are there improved methods for feedback control systems better than Bode diagrams?

What methods are established after Bode's diagrams to find optimal regulators for linear (time-invariant) dynamical systems of the type: dx/dt=Ax+Bu, y=Cx, where A, B and C are real matrices (with...

08 November 2014 2,269 3 View

Can someone cite some references for connections between graph theory and general topology?

I especially have interests to graphs on finite sets and finite topologies.

05 December 2013 4,423 25 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Piotr Picheta

All operations like preprocessing, augmentation, hyperparameters tuning, and so on, should be performed on every split independently to avoid data leakage and adaptative overfitting.

Irakli Skhirtladze In sklearn the easiest would be to use Pipeline and cross_validate like described here -> https://inria.github.io/scikit-learn-mooc/python_scripts/02_numerical_pipeline_cross_validation.html

Irakli Skhirtladze

Thank you! Appreciated.