PIg and HIVE are used when you are interested to perform analysis on HDF file. if you have CSV file then first perform analysis on R and Weka which will gave you better way of exploration. You can also used RapidMiner. PIG and HIVE does not gave you algorithm based data Exploration!
Big data are very likely to be heavy-tailed, so head/tail breaks is a useful tool; see examples in the following papers:
Jiang B. (2015), Head/tail breaks for visualization of city structure and dynamics, Cities, 43, 69-77, Preprint: http://arxiv.org/ftp/arxiv/papers/1501/1501.03046.pdf
Jiang B. and Miao Y. (2014, accepted), The evolution of natural cities from the perspective of location-based social media, The Professional Geographer, xx(xx), xx-xx, DOI: 10.1080/00330124.2014.968886, Preprint: http://arxiv.org/abs/1401.6756