How much datasets should be available to make prediction using Support vector regression?

More Kinza Qadeer's questions See All

What is the main difference between LIBS and other elemental characterization techniques, i.e. XRD, XPS?

Can anyone explain the difference? and advantages of LIBS over other techniques.

18 April 2024 5,199 4 View

How to use Chromium oxide and Titanium dioxide for digestibility trials in Broiler chickens?

I want to do digestibility trials on broiler chickens using Chromium oxide and Titanium dioxide, but I cannot find suitable material to learn about procedure to use them. Can anyone help me in...

06 February 2023 4,499 2 View

Meat obtained from dead or slaughtered animal ?

How can we determine whether the meat used in the meat products formulation was obtained from a dead animal or slaughtered animal?

10 June 2022 6,667 6 View

Delipidation for brain slice culture?

Hello, has anyone tried any delipidation method for brain slice culture? I am using hippocampal slice cultures and need to remove lipids from the 2 weeks cultured tissue for imaging purposes. Thanks

30 March 2022 7,199 0 View

Black dots in organotypic slice culture?

Hi everyone, I am working with P7 mouse hippocampal slices, The media looks clear after 10 days now but I can see small black dots in the slices. They don't show any mobility ( as expected in...

08 March 2022 1,278 2 View

Anyone is familiar with Grid connected PV system reliability ? Can anyone help me to find the reliability of PV Components?

I cannot understand how to find the reliability of components of PV system using MIL-hbdk-217F. For example a DC link capacitor i used and how to use its parameters to find the failure rates...

26 October 2020 8,814 2 View

How can we effectively do Multilevel Theorizing?

Multilevel models are more in demand now in Organizational Behavior, Leadership, CSR, Teams, and HRM. An effective theory building in this context is indispensable. Any suggestions or personal...

23 March 2020 2,822 6 View

I am MS computer science (networking) student, I want research in 5G network.experts can guide me ?

i want guidance specific to computer networking point of view, mostly i find material related to physical layer which are mostly related to the electronics field. so i requested to the experts...

29 December 2018 6,161 7 View

Any method for Fractionation of lignocellulosic biomass?

Fractionation of ligncellulosic biomass is necessary to know its cellulose, hemicellulose and lignin content, which can be used further to design an engineered microbial consortia with optimum...

13 September 2018 1,329 7 View

What volume of Phenol chloroform I should add for phenol chloroform extraction ,if my sample volume is 1ml?

My lysate volume after ultracentrifugation is 1ml(after resuspension of pellet),but the protocols I have read are about 450-500ul volumes of lysate.What volume of P:C i should use for 1 ml volume?

15 April 2018 1,453 2 View

Could you recommend some articles on Urban Transportation System optimization and Innovation?

13 August 2024 2,595 3 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

How to develop investments in renewable energy sources?

08 August 2024 5,112 3 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

In order to run Multinomial Logistic Regression, is it required that the data be in the long format?

I am using unit level data (IHDS round 2) & Stata 17

06 August 2024 5,725 2 View

Mohammed Ashikur Rahman Popular answer

As much as you input data for SVM, you will get more accuracy for your model.

Radha MOHAN Pattanayak

Dear Kinza Qadeer

Question is not clear. Any how, there is no limit of input data, but the input data to SVM, should be a finite number and non-complex data. For floating point data it is possible for prediction.

Rajdeep Kumar Nath

Hello,

From what I can understand regarding the background of your problem, you might have less training sample, and some features are not reliable. Regarding the size of the dataset, there is no rule of thumb in particular. Your model will need as much as training examples so that your model can learn the underlying pattern and generalize it on the test set. I would recommend the following steps if it helps:

1) Augment your data set by using data augmentation techniques. This will help increase your training size. However, make sure not to augment your dataset before splitting into test and train. Augment your training set.

2) Use feature selection techniques to select useful and reliable features. You can use supervised and unsupervised feature selection techniques.

3) Finally, you can consider using other models like decision tree regressor, or even you can use boosting algorithms such as AdaBoost or XGBoost.

Mohammed Ashikur Rahman

Sushant K. Singh

Kinza Qadeer the basic rule is to have at least 10 and preferably 50 samples per variable. In your case, if you are using 14 independent variables, you may need at least 140 but preferably 700 samples. I would suggest maintaining this ratio when you split the sample for training and testing. Check the model performance metrics for both training and testing and see if the model is accurate and robust. CV will help mitigate many problems so try doing that. Good luck!

Shahir Asfahan

There is no rule of the thumb as such, need to see the metrics to evaluate the performance of your model, infact you may need to tinker between different algorithms to see which one fits your data best.

Shibaji Chakrabarty

Specifically, there is no upper limit of the data requirement based on the algorithm. However, as this is a data-driven algorithm and you have almost 14 variables, the algorithm will definitely perform poorly if you go lower samples (for example say < 2000). Being said that, if you don't have enough samples, I would recommend you to do PCA, SVD (for dimension reduction to find the most significant one) or some other type of dimension reduction techniques.

Muhammad Ali

Agree with the above expert's replays by Radha MOHAN Pattanayak , Rajdeep Kumar Nath, Mohammed Ashikur Rahman, Sushant K. Singh , Shahir Asfahan and Shibaji Chakrabarty . I would like to extend some more suggestions: https://www.sciencedirect.com/topics/nursing-and-health-professions/support-vector-machine Article Using Support Vector Machines for Survey Research

https://medium.com/@dhiraj8899/top-4-advantages-and-disadvantages-of-support-vector-machine-or-svm-a3c06a2b107

Khitam Muhammed Mesri

Do you have a reference or a paper that mentioned this rule ?

i have heard this rule before but i don't know in what paper or article i can find it written . I really need to mention this information in my thesis because my data set is small .

I will be grateful if you helped me .

Hi Khitam Muhammed Mesri , please see this paper where I have cited the original reference. https://www.currentscience.ac.in/Volumes/113/01/0080.pdf

Please let me know if it is helpful.

Thank you!

I found it . thanks a lot i appreciate your help . Sushant K. Singh

Great, and thank you Khitam Muhammed Mesri !!