Need your expertise in J48 decision tree algorithm data analysis and interpretation of data model. anybody can help?

More Jessie Richie Naval de Los Santos's questions See All

RNA later for the preservation of RNA in fecal samples at room temperature for one day (37°C)?

I am planning to collect human fecal samples for metatranscriptomic analysis using MGI. These samples are from indigenous people living in a region with high temperatures. I will have access to a...

06 August 2024 1,367 3 View

How to develop an academic literacy program for engineering at the higher education level?

Information literacy in higher education integration with curricula engineering

04 August 2024 5,368 3 View

What topic or subject does Production Engineering need to address more?

Contemporary scores or innovations in scientific approaches. Hybrid methodologies, emerging themes and cross-cutting issues?

27 July 2024 648 3 View

Can I use candle-jar technique to lactic acid bacteria?

I don't have the anaerobic generators (Anaeropac) so I'd like to try the candle-jar tecnique for the cultive of lactic acid bacteria

15 July 2024 8,890 2 View

How can i generate a CRISPR knockin mutation zebrafish model with a reporter?

Hey! I aim to generate a transgenic knockin zebrafish line that mimetizes a genetic condtition that leads to a certain disease on human. To do so, I need to insert a codon for mutagenic aminoacid...

14 July 2024 6,240 0 View

What should be the best Lumens range for T8 (120cm) full spectrum LED lamp tubes?

Please (for Arabidopsis), what could be a good Lumens and color range (Kelvin) range for full spectrum LED lamp tubes size T8 (120cm) for each shelve measuring 130x50 cm (length x width) and 60 cm...

11 July 2024 6,078 1 View

Cross Attention in Transformers: Standard applications of the same ?

What are the standard applications of Cross Attention in Transformer Architectures ?

09 July 2024 9,310 2 View

Help to isolate Ralstonia from plant tissue?

I need tips for isolating Ralstonia solanacearum from eucalyptus. I have used some methods, such as isolating from root exudates, but it didn't work. I also tried indirect isolation from macerated...

02 July 2024 9,807 3 View

Time Series Analysis: Has Recurrent Neural Networks (RNN) ever been used on Time Series Analysis ?

Are there standard RNN architectures been applied for Time Series Analysis, forecasting and anomaly detection problems ?

30 June 2024 3,169 8 View

LSTM on Time Series: Has LSTM architectures ever been applied to Time-Series Forecasting ?

Have we ever used LSTM architectures on Time-Series Forecasting and Analysis, and gotten a decent result ?

30 June 2024 6,924 3 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Using OBD technique i am trying to measure laser induced shockwaves velocity i found that at start velocity increases and then decay?

i am unable to interpret why its increases in start as shown in figure

11 August 2024 2,179 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

Mario José Diván

Dear @Jesie, you are talking about a supervised model. It is known as the C4.5 algorithm developed by Quinlan. This kind of model implies trainig, pruning, and test stages. The trainig will perform a tree growing up. It will split data based on the best feature choice iteratively. It will conitnue until a) there are not more data, b) the minimum volume of data in a node is not enough for a node splitting, or c) all data into a node belong to the same target class

After training the model, a pruning stage is performed to avoid the overfitting in the model. A cross-validation technique (k-folds=10) should be used for better results. You could use the weka software for analysis of this algorithm. I hope it is useful for you. (https://www.cs.waikato.ac.nz/ml/weka/).

Abhishek D. Patange

It would be more understandable if we go through the concept of Information Gain and Entropy.

Information Gain

If you have acquired information overtime which helps you to accurately predict if something is going to happen, then the information regarding the event which you have predicted is not new information. But, if the situation goes South and an unexpected outcome occurs, it counts as useful and necessary information.

Similar is the concept of Information gain.

The more you know about a topic, the less new information you are apt to get about it. To be more concise: If you know an event is very probable, it is no surprise when it happens, that is, it gives you little information that it actually happened.

From the above statement we can formulate that the amount of information gained is inversely proportional to the probability of an event happening. We can also say that as the Entropy increases the information gain decreases. This is because Entropy refers to the probability of an event.

Remal Al-Gounmeein

See these links:

https://www.analyticsvidhya.com/blog/2020/03/decision-tree-weka-no-coding/

http://www.cs.man.ac.uk/~gbrown/publications/ahmadPhDthesis.pdf

Article Comprehensive Decision Tree Models in Bioinformatics