What is the rationale for the choice of number of segments in cross-validation?

More Adeyemi Adegbenjo's questions See All

What determines the output spectra you get from an equipment? Is it the imaging light source mode or the sensor signal default of the equipment?

I'm just curious to know what determines the output spectra you get from an equipment. Is it the imaging light source mode or the sensor signal default of the equipment? Im thinking its the latter...

05 June 2016 6,960 3 View

During a SIMCA classification modelling, what do I do if my global PCA does not give clearly distinguishable classes?

I am doing a SIMCA classification modelling using the Unscrambler software. The usual approach is to run a global PCA first to ascertain existing classes are clearly distinguishable before...

09 October 2015 2,897 9 View

Difference between Type I and Type III SS decision tables in statistical analyses?

I am running a GLM proc SAS analysis and my type I and Type III SS tables arrived at different conclusions entirely for my main effects. How do I know which of the two tables to use in making my...

31 December 2014 4,477 17 View

What is the acceptable number of PLS components in PLSR analysis?

I am running a PLS analysis on an imbalanced data and seems to be getting good results with number of PLS components 'n' = 30 and 40. I have up to 335 sample sets. Are these number of PLS...

05 June 2014 1,083 2 View

Where can I get a sample source code for a fuzzy one support vector machine?

I wish to try fuzzy one SVM for my classification problem. Does anyone know how I can get a sample source code to start with?

04 May 2014 1,256 1 View

Does the choice of gamma value has any serious effect on model performance in SVM classification?

When trying to fine tune the SVM classification model using the grid parameter optimization, i found many values of Cs and gamma with different numbers of support vectors having 100% cross...

04 May 2014 537 10 View

How to get a fuzzy support vector machine algorithm for my data analysis?

I need a supervised learning algorithm to solve my imbalanced data classification problem. Can anybody tell me how i can get the fuzzy support vector machine algorithm which, as I have read in...

03 April 2014 9,465 0 View

I want to work on fuzzy support vector machine algorithm for classification problem. How do I start? Its a new area to me.

Im working on an imbalanced data classification problem that K-mean clustering seems not to be solving well. I heard Fuzzy support vector machine can help. How do I get this algorithm to work?

03 April 2014 3,479 5 View

How do I classify image samples with two different features using their image parameters as the basis of isolation?

I have two kinds of features (say "intact" and "cracked" samples) in my image data. How do I classify new set of samples into "intact" or "cracked" samples based on my already known samples for...

03 April 2014 9,223 5 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Dr. Indrajit Mandal

10 is a good number to start with.

In most research publications , it is followed across the globe.

Thank you.

Best,

INDRAJIT MANDAL

Mahmoud Omid

Hi, It depends on the data set. If you have large number of instances (pattern) then 10-fold is good to start with. But if you have few instances the you should apply LOO (leave one out) cross validation.

Pekka Jounela

Hi, the choice of folds is compared by Ron Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, Proceedings of the 14th international joint conference on Artificial intelligence, p.1137-1143, August 20-25, 1995, Montreal, Quebec, Canada.

Benammar Riyadh

Dear professor,

I've done an OCR project within a team of 4 persons based on KNN method. And we have validated our parameters using cross validation. But we haven't any idea about the size of the folds.

So we have done an experience to see relationship between percentage of folds and our parameters and we have found that with from 15% to 20% we get the best ones.

Best regards.