Trying to train Neural Network on a mighty dataset, already taken 48 hours but yet to settle down...any idea about what could be the possible reason?

More Yasir Hamid's questions See All

Anybody well familiar with Map Reduce Programming?

Map Reduce Hadoop and Big Data

08 September 2016 8,699 0 View

Is it fine to publish a paper in International Journal of Network Security http://ijns.jalaxy.com.tw/ ?

Computer Science and Engineering

06 July 2016 9,234 2 View

Anybody working on Internet traffic classification ?

Real-time Internet Traffic Classification

06 July 2016 9,551 3 View

What are the possible approaches for solving imbalanced class problems?

Read 6 answers by scientists with 11 recommendations from their colleagues to the question asked by Yasir Hamid on Feb 5, 2016

01 February 2016 327 6 View

Why 10 fold cross validation doesn't go well with lazy classifiers?

Machine Learning, Weka, Soft Computing

31 December 2015 4,549 5 View

How can I convert nominal data to numeric data before feeding it to some classifier?

Machine Learning Techniques, Weka

31 December 2015 669 16 View

Which software tools are best for enhancing diagnostic accuracy in chest X-ray imaging using image reconstruction and neural networks?

I am reaching out to seek your valuable advice and recommendations regarding the best software tools to use for this research. Specifically, I am looking for software with a user-friendly...

22 July 2024 3,794 1 View

How can I extract the mathematical equation from existing Neural Network Model?

There exists a neural network model designed to predict a specific output, detailed in a published article. The model comprises 14 inputs, each normalized with minimum and maximum parameters...

14 July 2024 2,714 3 View

What is the current status of augmented learning in robotic surgery?

I would like to perform a literature review at this time on augmented learning and learning augmented algorithms to enhance performance-guided surgery

06 July 2024 246 1 View

How can I improve the purity of NPC cultures derived from human iPSCs during neural rosette selection?

Hi everyone, I've been working on differentiating human iPSCs to derive a pure and uniform culture of neural progenitor cells (NPCs). However, I'm encountering a significant issue during the...

01 July 2024 1,012 1 View

Is it possible to use neural network models for prediction if the sample size for the time series is very small??

Forecasting within neural Network

24 June 2024 6,800 1 View

What is information diffusion in the social network?How a message got viral in social network?

Please give answer. Also explain mathematical equations behind this.

22 June 2024 6,869 2 View

In CNN, is the feature map obtained randomly by convolution kernel?

In CNN(convolution neural network), can the feature map obtained determinately by a random initialization convolution kernel? if not, how to decide the weights in convolution kernel to obtain the...

20 June 2024 6,418 6 View

How does a Man-in-the-Middle (MitM) attack work in the context of Transport Layer Security (TLS), and what specific mechanisms can be employed ?

Context In the realm of cybersecurity, Transport Layer Security (TLS) is widely used to secure communications over a computer network. Despite its robust encryption mechanisms, TLS is still...

19 June 2024 6,595 3 View

Is there a plugin for weka to read *.sav files from SPSS?

Hello there. Several references say that there is a reader for .sav files from SPSS in Weka, but I just can´t find it. Can anyone provide help? Thanks in advance

12 June 2024 3,844 2 View

How to reduce the number of measurements/iterations needed in deep reinforcement learning?

I'm trying to use reinforcement learning with live EEG measurements. However, just 2000 measurements/iterations take 16.6 minutes to measure and it seems I need at least 10 hours of live...

12 June 2024 2,884 3 View

Ludovic Journaux

Dear Yasir Hamid,

This can come from the dimensionality of your data. This phenomenon is largely describe and also name "curse of dimensionnality". this phenomenon has always been a problem.a high dimension often prohibits the error minimization algorithms to converge properly.

This phenomenon is also known as Hugues phenomenon!

https://en.wikipedia.org/wiki/Curse_of_dimensionality

http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&tp=&arnumber=1054102

Best regards,

Ludovic

Jabar H. Yousif

i agree with Ludovic , that the high number of dimensions ( input and output variables) is one reason. Beside, The Percentage size of validation which is used to terminate

training must be set carefully ( 0-100). And value of threshold , or the number of epochs may be very large.

Nils Goerke

Dear Yasid Hamid,

long training times, of even several days, is not unusual for Technical Neural Networks. But as the others have meniontend before that depends on a lot of circumstances.

So without further information about your set-up it is hard to determine if there is a real problem, or just the "normal" training time.

Can you provide more Information?

Type of Network, number of layers, number of neurons, size of data set, input dimension, regression task, or classification, ...

What do you mean by "...but yet to settle down?"

are the weight changes oscillating?

Is the learaning curve going up and down?

And, beside that, have you checked the TNN implementation?

Is it "home brew" or a library?

Regards

Yasir Hamid

Ludovic Journaux ... Thank You Sir

The links you had shared were really helpful... yes my dataset has some 100 dimensions

Jabar H. Yousif : Thank You Sir for your time,I am actually using 10 fold cross validation .

Nils Goerke: Thank You Sir for commenting....Actually I am using weka tool and I have not tuned any parameter. My dataset has some 4 lakh records, each record of about 100 dimensions. here I am attaching the screen shot so that you can better find out if there is some problem with setting the parameters...

Andrey Bondarenko

Ok, 400'000 x 100, and how many classes ?

Are you sure these 100 features are relevant? Are they preprocessed? Have you randomized entries so that there would be different classes in sibling rows? Have you done scaling (0,1 in case of Sigm activation function and -1,+1 in case of Tanh act.function.) - check weka docs what activation functions they need.

Again what is your outputs? They can be coded in many ways but this is usual approach

Class -> Expected ANN Output

1 -> 1 0 0 (or +1 -1 -1 if Tanh act is used)

2 -> 0 1 0 (or -1 +1 -1)

3 -> 0 0 1 (or -1 -1 +1 respectfully)

As it was already mentioned - check you training an validation curves how they are behaving. You should get graph like this and apply early stopping (or at least set appropriate max epochs count to get similar result to first image http://stats.stackexchange.com/questions/131233/neural-network-over-fitting )

One way to resolve this kind of problem is to apply a dimensionality reduction method in order to reduce the data dimension while preserving the essential information.

In this sense, it already exist different approaches (linear vs. nonlinear).

you can find some informations here:

https://en.wikipedia.org/wiki/Dimensionality_reduction

https://en.wikipedia.org/wiki/Nonlinear_dimensionality_reduction

a really interesting book of my friend John Aldo Lee

http://www.springer.com/la/book/9780387393506

You can begin with the well known PCA (principal component analysis) (linear approach), it a good first approach.

if you need some help on these approach, no problem ;-)

Magdalena Tkacz

Did you try to rescale data to -1, 1 range? This can sometimes help. I'm working under this problem: why some computational intelligence algorithms, when working with multidimensional data - works better when rescaled to this range (instead (0,1))

Of course, the dimensionality, as mentioned above is probably the main problem...

Thank You Magdalena Tkacz Mam for your reply,

No Actually I didn't try that........

Mirza Waseem Hussain

Normalize ur data . ussually in matlab we use mapminmax() function

Thanks Waseem,

I will try that...

Samer Sarsam

Yasir Hamid ,

Did you get any error or even the classifier model in the output? If not, then probably there is a problem either with the algorithm or with the heap size.

HTH.

Dr. Samer Sarsam