If you are new to this field, I also recommend the very nice introductory book "Mining of Massive Datasets". The latest version is also available for free from the authors, see the attached link.
If you are new to this field, I also recommend the very nice introductory book "Mining of Massive Datasets". The latest version is also available for free from the authors, see the attached link.
For near real time you will have knowledge of unsupervised or supervised as well semi-supervised learning in machine learning. I would like to know first what type of knowedlge do you have. But generally my suggestion is below,
What is Data-structure.
Database concepts
Data classifications
Data Mining
Machine learning
Big Data
if you will have any further question then you are welcome!
BigData Mining is still a complex problem to solve. BigData means data which is huge (could be either structured or unstructured) and cannot be put on one machine and processed. Data Mining or using Machine Learning Algorithms or Statistical Algorithms are algos used to understand data and fit them in some model or learn something new out of it.
First you need to learn the data engineering so that you know how to handle BigData there are couple of open framework available like Hadoop, Spark. Also learn NONSQL Database for unstructured data.
Unless you are expert in mining algos, learn them to understand data. This is never easy all those regression models. Feature selection, classification etc
Once you know both then you can solve BigData Mining Problem. Due to large amount of data there are problems modeling. In regression for example any model will give you best results (R square value) and it may be possible that every model will fit (overfit) the data. So it is a challenge.
Just to give you can idea here we are blogs http://www.bigstem-analytics.com/blog.html ( from a startup I founded in this area).
For information on tools used for big data application: http://www.snia.org/sites/default/education/tutorials/2013/fall/BigData/SergeBazhievsky_Introduction_to_Hadoop_MapReduce_v2.pdf
To know about the importance of big data, specially in business environment: https://www.pwc.se/sv_SE/se/teknologi/assets/an-introduction-to-big-data.pdf
For network and infrastructure related to big data: http://www.juniper.net/us/en/local/pdf/whitepapers/2000488-en.pdf
For general introduction on big data: http://www.planet-data.eu/sites/default/files/presentations/Big_Data_Tutorial_part4.pdf
If your data are georeferenced, herewith some unique perspectives:
Jiang B. (2015a), Head/tail breaks for visualization of city structure and dynamics, Cities, 43, 69-77.
Jiang B., Yin J. and Liu Q. (2014, accepted), Zipf’s Law for all the natural cities around the world, International Journal of Geographical Information Science, xx(x), xx-xx, Preprint: http://arxiv.org/abs/1402.2965
Jiang B. and Miao Y. (2014, accepted), The evolution of natural cities from the perspective of location-based social media, The Professional Geographer, xx(xx), xx-xx, DOI: 10.1080/00330124.2014.968886, Preprint: http://arxiv.org/abs/1401.6756
Jiang B. and Liu X. (2012), Scaling of geographic space from the perspective of city and field blocks and using volunteered geographic information, International Journal of Geographical Information Science, 26(2), 215-229. Reprinted in Akerkar R. (2013, editor), Big Data Computing, Taylor & Francis: London.
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications: