hi guys, i am working on dataset with 1M user and 943,347 item , but i want to consider less than, for example 10,000. my platform is weka. my reason is that ram of laptop can not process on 1M. i need a strong reason that why i consider less of data?
My work is not on big data. i want to know this number (1M) is big data, or need parallel systems or distributed systems? thanks