Hi guys,
I'm still newbie in big data analysis.
I'm currently looking to do incomplete data analysis for the big data in R rattle package.
I refer this book for my reference to do an analysis but it is not specifically focus on the big data.
http://mineriaddatos.wikispaces.com/file/view/Data+Mining+With+Rattle+and+R_+The+Art+of+Excavating+Data+for+Knowledge+Discovery+-+Graham+Williams.pdf
I wish to get your knowldege sharing or opinion to do analysis the big data:
1) Any recommended data set that pretty enough with the big data size?
2) What is best size of data set that we should considered as a big dataset?
3) Any the best recommended tutorial to practice for the incomplete big data analysis ?
Thank you in advance