How much data is needed to train an associative rule or decision tree model?

More Ernest Vincent Cross II's questions See All

What's agile diagram ?

details of agile diagrams in a flow chart

12 June 2024 4,430 1 View

Where can I access resources tailored to families within Native American communities regarding autism prevalence, support, and awareness?

Seeking information on autism within Native American communities in the US, including prevalence rates, available support systems, and awareness initiatives, with a focus on resources tailored to...

21 April 2024 6,899 0 View

How can I find out how to reach a person by email to send them full texts of my articles?

How can I find out how to reach a person who wants full texts of one of my several articles listed on ResearchGate, so I may email the requestors one of my several articles they are requesting?

27 February 2024 1,957 0 View

Where can I read about design of a burner for hydrogen and oxygen?

I need to design a burner with multiple nozzle jets that burns hydrogen and oxygen in stoichiometric ratio. The supply pressure is about 3,000 psi (20,500 kPa) and the burner is feeding heat into...

31 December 2023 3,451 2 View

How to evaluate submerged aquatic vegetation vs factors using braun-blanquet like coverage data?

Let's say I have SAV (~4spp) cover on a Braun-blanquet like scale for several years & I want to evaluate the effect of sediment characteristics, would a logistic model be the right approach? I...

16 July 2023 783 0 View

How do I perform TPD for any gas on a surface (either periodic or non-periodic)?

How do I carry out TPD simulation of H2 on a C60 fullerene nanostructure surface or periodic materials on cantera/Kinetix module contained in Material studio. I will appreciate if i can be guided,...

07 May 2023 1,733 5 View

Can buffer blasting employed in quarrying to control the wall and blocky rock masses?

i want to assess that if it is possible in quarrying

26 February 2023 566 0 View

When using ML for classification tasks in disease diagnosis study, what level of accuracy can be considered enough, high, or a threshold/benchmark?

22 February 2023 9,121 9 View

Thyristor model in ADS software?

How to model an SCR thyristor using Keysight ADS software

11 June 2022 7,060 0 View

Are there any Jurkat T cell deactivation markers?

There is abundant literature regarding Jurkat T cell activation markers such as CD69, CD62L, and CD107a. However, there are hardly any for deactivation. I can't find whether CD69 downregulates...

23 May 2022 1,270 0 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

A Question about Phd thesis?

Hello everyone What is your opinion about the introduction of an expert decision support system in which the rules are extracted from existing data without human intervention, instead of being...

31 July 2024 5,785 4 View

Are these cassettes suitable for expressing PETase mutant in E. coli?

I created two potential gene expression cassettes (constitutive and inducible) for expression of a mutant PETase gene on PeptiCloud using the version tree feature, which allows users to create...

28 July 2024 7,559 1 View

Please, what is the memory consumption of the Matlab function quad tree decomposition procedure [S = qtdecomp(I)] with respect to the input set I?

27 July 2024 5,455 2 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

Is it redundant to use both Random Forest and Decision Tree algorithms in the same regression project?

I am currently working on a regression model for a project and considering using both Random Forest and Decision Tree algorithms. Given that Random Forest is essentially an ensemble of Decision...

23 July 2024 4,306 3 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

How to use NCBI datasets ?

I have been trying to extract genome from NCBI using their dataset tool, however some examples seem not to work : ./datasets download genome taxon "Homo Sapiens" --annotated --assembly-level...

20 July 2024 1,339 2 View

Vikas Ramachandra

instead of trying all combinations, why dont you ask the users to choose the best combinations based on their domain expertise. how is the output variable captured for you to train as the next step?

David F. Nettleton

As a general approximation, number of inputs X 10, so that would give you 1260 rows. The random generator sounds reasonable approach, but would need to know what are the most likely combinations, and the least. Also, I don't think you mention what is the output, how many categories, etc. So that will influence how many examples you need, and counter examples, etc

Safial Islam Ayon

You can see the following links:

1. https://www.datarobot.com/blog/how-much-data-is-needed-to-train-a-good-model/

2. https://machinelearningmastery.com/much-training-data-required-machine-learning/