For the intended objective(s), the information content in a dataset, however BIG, is NOT necessarily proportional to the size of the data. Your thoughts?
Brijesh - I used the adjective BIG (before DATA) as this is what has become a buzz word today, which, as I understand, essentially implies LARGE (amount of) data. Do you have something else in mind?
Big data is not only used for large amount of data but it has characteristics of 3Vs, Volume(large amount of data), Variety(structured, unstructured, or semi structured data), and Velocity(speed of data generation).
we know about large amount of data but big data may not be in a conventional form of tables all the times, instead it might be in the form of text or multimedia, same way it might be generated very quickly, for eg. data generated from social networking sites.
so when we discuss about big data we need to consider all these 3Vs. people nowadays adding few more Vs like veracity(correctness of data).
Following paper may be useful for further investigation...
Hi Subash -- there was an international conference recently on BD, you may want to check some of the background papers and documents for more info : http://unstats.un.org/unsd/trade/events/2014/Beijing/
The meeting of UN Stat Commission last week in NY also included discussions and new policy resolutions on BD; a lot of progress is being made but seems more in a few domains (mobile data, geospatial,..) in specific countries. Time to develop more methods and applications!
Thanks Rachid. Have seen some of these. Will have a look at others. The point I was however trying to make is that BIG (or large) data doesn't necessarily mean that the data has large information content in it. This is because of the potential of strong temporal and/or spatial correlation in the data. I do not mean to say this makes the data useless - what is important is that to have confidence in and to get realistic results from large data, any approach to data analysis needs to account for this correlation.
The BIG Data is informative if it is really BIG( Basic Information Generation)!. It is possible to get much information content from small dataset and little information from the so called BIG data set.
Currently, the amount of information collected in Big Data database systems is growing rapidly. Many companies that have their own Big Data database systems containing huge amounts of information downloaded from the Internet carry out analyzes of these data, the results of which are used in business management. Many companies use these resources to manage their business. When processing information in the cloud, ie information collected in Big Data systems, the sentiment analysis uses the results of the analytical work carried out in this way for the strategic management of the company.
I invite you to the discussion
The issues of the use of information contained in Big Data database systems for the purposes of conducting Business Intelligence analyzes are described in the publications: