In speech based emotion recognition, for individual artists almost get 90 percentage of accuracy using neural network, but accuracy is decreased up to 70 percentage for entire speech corpus.
How can I increase accuracy?
My speech database contain 10 artists.