11 November 2016 11 864 Report

I have 10 earthquakes, with  two input variables time and acceleration(pga), a total of 50,000 instances. How to decide which instances are to be considered for training and testing?

ex. Earthquake 1 has T vs PGA starting from 0 sec to 99 sec,total of 5000 instances, Earthquake 2 has T vs PGA from 0 to 65 sec, total of 3800 instances.. and so on..so total of 50,000 instances? so how i gonna choose data instances for training? Shall i use sub.clustering?? Please help

Thank you

Similar questions and discussions