I'm very new in the field of big data analysis and I strongly believe there is potential know-how that could be beneficial also in the field of corpus linguistics. Has anybody ever tried to merge corpus linguistics and big data methodologies?
Hi, I think the answer to your question depends on exactly what you mean by "big data tools and methodologies". There's certainly a bit of data-driven, bottom-up, corpus-linguistic research about, if that was the aspect you had in mind. The definition of "big data" itself also matters. I guess a type of digital humanities type of research that fits the bill would be "culturomics" studies such as this: http://www.sciencemag.org/content/331/6014/176.abstract.
Hi. I quite agree with Gard Jenset that your question depends on exactly what you mean by big data tools and methodologies. It involves linguistics, corpus and statistics, which i could help if you more detail your question.
BigData: The very question is whether there exists a schema (type) or not. If there is a schema we are very close to databases, except for the huge extension of "big". If not then information retrieval comes into play. with its indexing techniques.