Can anyone help for KDD dataset in Weka ?

More Ahmad Bilal's questions See All

Using OBD technique i am trying to measure laser induced shockwaves velocity i found that at start velocity increases and then decay?

i am unable to interpret why its increases in start as shown in figure

11 August 2024 2,179 1 View

Gromacs first step of minimization ?

I have face this problem anyone help me how to solve this issue ?which is below Fatal error: There are inconsistent shifts over periodic boundaries in a molecule type consisting of 78 atoms. The...

07 August 2024 2,598 1 View

Why does everyone use vs code?

Visual Studio Code (VS Code) has become a popular choice among developers for several reasons: 1. **Free and Open Source**: VS Code is free to use and open source, making it accessible to...

07 August 2024 7,013 4 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

I'm optimizing a tetra-ARMS PCR protocol with amplicon sizes: 165bp, 123bp and 93bp. Why, in the gel, only 165bp is visible while I'm expecting all 3?

It's an end-point PCR protocol. I'm using 1.5% agarose gel with SyBR Safe dye and TBE as a running buffer, visualization on BioRad XR+ system. I was primarily thinking of primer efficiency,...

01 August 2024 4,673 4 View

Question about combustion Kinetics ?

I am currently working on modeling FCC catalyst regeneration and have come across a point of uncertainty that I hope you can assist me with. In my previous models, I have utilized lumped kinetics...

30 July 2024 6,137 0 View

I have no added any resarch paper yet but showing three paper? how to delete it?

I have not addede any paper yet but it has selected 3 papers which are not mine in my account. i want to delete that information. please help me

30 July 2024 3,743 0 View

How to we see transcription activity of the bhlh79 family is at the N-terminal or C-terminal?

I'm a PhD Student. From Northwest Agriculture and Forestry University China. My Problem is this I start my work on bhlh79 transcription factor gene. I do my Y2H 5 time but the colonies appear on...

30 July 2024 1,412 8 View

How apply Kao Residual Cointegration Test in eviews?

Kao's panel cointegration tests , is there anyone willing to explain me the eviews-9 output for the Kao's panel cointegration tests?

23 July 2024 5,051 4 View

Please inform me about the International Conference on Plant Biology?

Please inform me about the International Conference on Plant Molecular Biology, or Plant Biology, scheduled for December 2024, January/February, or March 2025.

14 July 2024 3,120 7 View

Feedback defines the constitution of an organism?

“Here is a thought experiment. Let's place Rodolpho Llinas's jarred-brain on top of a body (Fig. 1). I bet Llinas would argue that his jarred-brain retains its own consciousness, and the android...

11 August 2024 2,483 1 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Can we mark 'EFL Learners shifting from general digital to AI technologies' as technological transition?

After COVID-19 it has seen that EFL learners technological affiliation has raised. In addition, in the post-COVID period learners started to engage AI technologies like ChatGPT while learning...

08 August 2024 8,964 4 View

What are examples of AI for good projects a teacher can assign to students?

So I am organizing an AI seminar. What are possible AI projects in the AI for good spirit? something the students can do and have an impact?

08 August 2024 9,437 4 View

Self-Organizing Superorganisms—as envisaged by Nenad Sestan (2018)?

The rate of glucose consumption by the neocortex is reduced by over 80% during anesthesia (Sibson et al. 1998), which disables the synapses (Richards 2002) that are inundated by glial tissue (Engl...

08 August 2024 3,118 0 View

How to design human-centered classroom in the age of A.I.?

08 August 2024 347 5 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

Measuring the Intelligence of a Species?

Larger brains, which typically contain more neurons, store and transfer more information (Tehovnik and Chen 2015), but the precise relationship between number of neurons and information has yet to...

05 August 2024 1,238 2 View

What's the role of IT & AI in Telecommunication Industry?

05 August 2024 8,264 3 View

Can usage of AI tools like chat GPT in research work is recommendable ?

AI tools like ChatGPT can enhance research work significantly when used responsibly and in conjunction with thorough human oversight.

05 August 2024 1,842 3 View

Muhammad Yousefnezhad

Dear Ahmad,

I think that you can find useful information from following links. Besides, you can download these links by using http://keepvid.com website from China.

https://youtu.be/M50pQfj9ZOI

https://youtu.be/uiDFa7iY9yo

https://youtu.be/gd5HwYYOz2U

https://youtu.be/w14ha2Fmg6U

Best Regards,

Tony.

Yacine Yaddaden

Hi,

I have worked on a project a few months ago using this dataset (KDD Cup). I have used WEKA and Python. For WEKA, you have two choices: The first one is the simplest, it consists on downloading the dataset in WEKA format (.arff file) from http://tunedit.org/repo/KDD_Cup or downloading the original one from http://www.kdd.org/kdd-cup/view/kdd-cup-2009/Data and try to convert it into WEKA format.

I just want to tell you that the dataset is huge, when using WEKA, it can crash because it's too big that WEKA can't handle it. The solution is to create a subset (10%) and then try the data mining methods available on WEKA.

Don't hesitate if you have more questions.

Regards,

Yacine.

Cristian Popa

Hello!

In Weka is very easy to test different classifiers' performance one against eachother by using the Experimenter interface. Load the dataset, filter it if necessary using filters, load the classifiers one by one or all, run the experiment, see the results. Or you have the possibility to save the output as csv or whatever and have it analyzed separately.

However, the Experimenter is not a good choice for very large datasets, and (randomly) splitting the data (recommendable on class feature) is a good choice if it's conveninent for your purposes.

If you want to process very large datasets in Weka use the command-line interface (CLI), pick only fast converging algorithms and avoid cross-validation for evaluation.

The best choice are the updateable algorithms (incremental classification models, learning by using only one one instance at a time) - Weka provides a good range of such.

Good luck!