To develop a prediction model using different machine learning techniques, we need to divide our samples in calibration and validation samples. On what grounds we can divide the those samples.
see https://www.r-bloggers.com/2020/03/simulating-your-model-training-for-robust-ml-models/
Thank you for the help Jin Li
I would recommend stratified sampling to avoid unbalanced dataset and bias issues.
Please see the following links. Scikit-Learn guide provides detailed explanation and analysis related to sampling and dataset splitting.
1. https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedKFold.html
2. https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedShuffleSplit.html
3. https://scikit-learn.org/stable/modules/cross_validation.html#stratified-shuffle-split
I hope that I provided informative and beneficial information.
Good Luck!
I am working in synthesis of material.
14 December 2020 8,092 5 View
I isolated a plasmid (image attached along with), however the 260/230 ratio was less than desired i.e. 1.29 as compared to the ideal 2.0-2.2. Would it influence the digestion reaction?
09 November 2020 7,666 2 View
I am looking for datasets of ice cover of Arctic Antarctic. I want only India related dataset. If you have any source of dataset please share with me. It will be very helpful for my...
13 October 2020 7,137 1 View
I have SEM image of cross section of multilayer thin film sample and need to represent the different color for different layer without losing its morphological information on the image. Is there...
08 October 2020 9,764 6 View
We need to model deformable metal powder. It will be further used in simulation process.
30 September 2020 9,329 7 View
i want to buy Human Squamous Cell Carcinoma Cell Lines for my PhD research .? One place is NCCS Pune...Can some give more suggestions..? Humble Regards
20 September 2020 3,940 3 View
The homoeopathic medicine consist minerals in it and the same nutrients are essential for the growth and development of the plant. So I asked myself whether the homoeopathic medicine useful for...
19 September 2020 8,339 10 View
While incorporating the DER to the conventional grid, then what will be the effect of system stability and frequency
07 September 2020 2,386 7 View
Monosilicic acid is the available form in which plant can take silicon and this is not a stable form. That's why I want to know how to prepare that and how to estimate that. Kindly give your inputs.
01 July 2020 6,226 3 View
Hi! I am doing an experiment where I need to spray a chemical on plant. I will be using autoclaved milliQ water as a base to dissolve small quantities of that chemical. I was wondering how can we...
09 April 2020 5,808 2 View
What Characteristics makes CNN work better?
03 March 2021 1,458 4 View
i would to know some of the research gaps in the artificial intelligence field in most african countries.
03 March 2021 6,145 3 View
I have selected brain tumor images ...but now found that already lots of research done n this topic.
03 March 2021 5,774 3 View
What's the best way to measure growth rates in House sparrow chicks from day 2 to day 10? Since, the growth curve from day 2 to 10 won't be like the "Logistic curve" it might not follow logistic...
03 March 2021 1,401 3 View
Hi, I am after the reference below, my library says it cannot obtain a copy either locally or internationally, any help appreciated! Chris Wang ZM, Heshka S, Wielopolski L, Pi-Sunyer FX, Pierson...
03 March 2021 6,193 1 View
dear community, my model is based feature extraction from non stationary signals using discrete Wavelet Transform and then using statistical features then machine learning classifiers in order to...
03 March 2021 6,994 5 View
The term miscibility refers to the single-phase state in thermodynamics. I do not mean the compatibility of different components. To determine the miscibility I know several techniques such as...
03 March 2021 4,107 4 View
Hi, I am trying to construct a multi-layer fibril structure from a single layer in PyMol by translating the layer along the fibril axis. For now, I am able to use the Translate command in PyMol...
02 March 2021 4,569 4 View
I feel that the practice in teacher education in my country is below the expected performance level due to very poor management system. Hope I will learn something from your experiences.
02 March 2021 1,516 4 View
NFL theorem is valid for algorithms training in fixed training set. However, the general characteristic of algorithms in expanded or open dataset has not been proved yet. Could you show your...
01 March 2021 1,189 3 View