What's the optimal observation count per category for Machine Learning?

More Guillermo Palchik's questions See All

¿Where to buy the Aspergillus oryzae NSAR1 strain?

Greetings: I am looking to buy/obtain the auxotrophic strain Aspergillus oryzae NSAR1, but I just have found one culture collection from Japan but haven't received any answer. Could you provide...

11 July 2024 6,847 0 View

How can I calculate the effective dielectric constant of a thin film and substrate?

I am trying to figure out a way of calculating the dielectric constant of a thin film of Yb2O3 on ITO. Assuming that the thickness of the Yb2O3 is known and that it's

05 May 2024 5,989 1 View

Some expert using "GRADE" to the assessment of certainty in evidence of systematic reviews?

We are preparing an umbrella review and we need someone who wants to collaborate and perform the assessment of certainty in evidence of systematic reviews using GRADE on the systematic reviews...

20 February 2024 9,244 0 View

What am I doing with AI in campus?

With the rising application in AI (ChatGPT, Dall-e, Tome, etc) in education, for better or for worse, what actions are you taking to improve your research and your teaching using AI...

12 February 2024 3,893 2 View

¿Any suggestion about cryostat slicing issue?

Hello everyone, I need advice on improving the quality of my coronal slices obtained from PFA-fixed mouse brains using a cryostat. As You can see in the attached image, slices are horrible! Any...

04 January 2024 6,071 2 View

How to use a bead beater to isolate live bacteria from mouse tissues?

I need to isolate live bacteria from mouse tissues. I have the "FastPrep-24™ 5G bead beating grinder and lysis system" and I will be using sterile tubes pre-filled with 3.0mm Zirconium beads....

18 October 2023 3,960 1 View

Interpreting the results of a GAMM model and plotting the estimated effect?

I am trying to model the visitation frequency of bird to a plant species along an elevational gradient and two season. Moreover I have the number of flowers each observed plant had. My response...

02 August 2023 8,342 2 View

Rapid Cuff Inflation System (Hokanson)?

Does anyone know of any equipment similar to the Hokanson rapid cuff inflator marketed in Europe and Spain? Do you know how much they cost? I will appreciate any assistance. Kindest...

15 February 2023 2,975 0 View

Any protist virus database?

Do you know any database for protistan viruses? Thank you!

06 June 2022 1,753 0 View

Is it possible to experimentally measure the softening behavior of plain concrete in a simple tension test?

It is difficult to capture the descending branch in the stress- strain diagram of the material in an experimental direct tension protocol (see Figure (a): tension behavior). The flexibility of the...

28 February 2022 6,613 4 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

How can i do multivariate Time Series forecast using MLP, ANFIS and LSTM?

I need the python code to forecast what crop production will be in the next decade considering climate and crop production variables as seen in the attached.csv file.

05 August 2024 2,977 3 View

The Curse of Evolution and Complexity?

Brain and body mass together are positively correlated with lifespan (Hofman 1993). The duration of neural development is one of the best predictors of brain size, and conception is the best...

05 August 2024 6,247 3 View

Need help with my research project on open source SIEM and machine learning?

Hello everyone, I am currently working on a research project that aims to integrate machine learning techniques into an open source SIEM tool to automate the creation of security use cases from...

04 August 2024 3,196 2 View

Swimming/space travel depends on the proprioceptive muscle spindles?

When the entire neocortex is ablated in rodents, although they are still able to swim, all the limbs move continuously and asynchronously (Vanderwolf 2006; Vanderwolf et al. 1978). Normal animals...

03 August 2024 835 3 View

What are the limitations and challenges of using machine learning for predicting concrete compressive strength in practical applications?

Machine learning (ML) has shown great potential in predicting the compressive strength of concrete, an important property for structural engineering. However, its practical application comes with...

03 August 2024 2,546 2 View

Some new emerging problems on application of RL for scheduling in IoT networks?

I have seen plenty of existing works on applied Reinforcement Learning (RL) policies for optimized scheduling in IoT networks including Q-learning, DQNs, and the newer ones including PPO for...

01 August 2024 8,754 2 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

Shafagat Mahmudova

Dear Guillermo Palchik ,

Generally speaking, the rule of thumb regarding machine learning is that you need at least ten times as many rows (data points) as there are features (columns) in your dataset. This means that if your dataset has 10 columns (i.e., features), you should have at least 100 rows for optimal results.

https://graphite-note.com/how-much-data-is-needed-for-machine-learning#:~:text=Generally%20speaking%2C%20the%20rule%20of,100%20rows%20for%20optimal%20results.

Regards,

Shafagat

Guillermo Palchik

Thank you Shafagat Mahmudova,

My question was about something a bit different, though. If I'm dealing with a column representing a categorical variable—for instance, "color", which includes three options: red, green, and blue—how many entries or rows would be necessary for each of these categories: red, green, and blue?