How to extract relevant words from a large corpus?

06 June 2018 1 7K Report

hi,

In large corpus which contains too many noisy and irrelevant data. How we can detect relevant and important words from that corpus?

For instance, in a movie reviews corpus there will be loads of irrelevant words or aspects which will be irrelevant to the movie domain. Therefore, what are the ways to filter out these irrelevant aspect/words.

By searching, I have found that word2vec builds a vocabulary for a specific domain and discovers similarity between words. Another method is Semantic Web for extracting relevant information. Please guide me the convenient way of filter word irrelevant words

Ajit kumar Roy

I think boolean search is useful.

Badges
Science topic

Similar topics
Phytochemistry
Extracts

More Jibran Mir's questions See All

What is the word length allowed in Data & Knowledge Engineering journal or it can accept any word length?

I have a long research article for submission in DKE journal, please, guide me is there any word length or is it limits the number of pages in the article?

08 September 2018 8,178 2 View

How to remove Noise from longer Text?

In a long text review, some of the sentences poses opinion and many sentences are none opinion. Moreover, sometime even many paragraphs in a long review article has no opinion. Therefore, to...

04 May 2018 5,971 4 View

can unsupervised method be evaluated in terms of Precision, recall and f1 measure?

I Know that the supervised method is evaluated in terms of precision, recall and f1 measure. Therefore, what evaluation criteria is used for the evaluation of unsupervised method? Can an...

01 February 2018 1,455 3 View

In aspect based sentiment analysis for the validation of proposed method, the only measures are precision, recall n f1 or r there other as well?

I need to know how aspect based sentiment analysis methods have been evaluated. Precision recall and f1 measures are the only measuring terms. However, I have seen Rand Index and I dont know why...

31 December 2017 9,422 4 View

Implicit Sentiment Identification from Reviews of critics

hi, I have to identify implicit aspects or sentiments in the review text written by critics. However, these critical reviews are much different and difficult then product or user reviews. for...

01 January 1970 6,988 6 View

Need suggestions about Suitable Journal

I have written a survey paper on aspect based sentiment analysis techniques. I am search a suitable journal for my paper, however, I have already submitted my paper in few journals. Unfortunately,...

01 January 1970 6,992 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

The aqueous fraction of the hydroethanolic extract is showing the presence of palmitic acid. What is the mechanism responsible ?

Palmitic acid presence in aqueous fraction

05 August 2024 8,624 4 View

Which solvent is better to dissolve with secondary metabolites extracted from fungi?

I work on MCF7 cell cell for anticaner purpose and I wa to do drug preperation the drug ( secondary metabolites extracted from Aspergillus) My question which solvent is better with these secodary...

03 August 2024 4,725 2 View

For systematic review data extraction, should I use ITT or PP analysis, and count all randomized participants or only those who completed the study?

I am conducting a systematic review and meta analysis; as I am extracting data I realized there is a consort diagram that shows the number of patients from randomization till end of the study. So...

01 August 2024 9,993 3 View

For the moringa oleifera extract using ethanol as the solvent,what is the alternative method to concentrate the extract other than rotary evaporation?

99% pure ethanol was used for maceration also alternate methods along with the temperature and time to concentrate the extract can be specified

31 July 2024 5,113 4 View

What are the factors that causes the sample to have higher efficacy at lower concentration?

We conducted an antibacterial study of a plant extract. Varying concentrations of crude extract were subjected to microbroth dilution assay. The result showed that only the lowest concentration of...

31 July 2024 8,666 3 View

What Are the Best Alternatives to the MTT Assay for Assessing Cell Toxicity of Colored Herbal Extracts?

I am currently investigating the cytotoxicity of a series of herbal extracts, and like many studies, I have been using the MTT assay to evaluate cell viability. However, I am encountering a...

31 July 2024 193 4 View

How can I extract my bibliography from researchgate ?

how can I extract my bibliography from researchgate ?

28 July 2024 6,737 1 View

How can biogenic synthesis techniques be applied to develop plants capable of extracting and processing heavy metals from contaminated soil, and wha?

Understand how to utilize biosynthesis to develop natural and sustainable solutions to address heavy metal pollution and improve environmental and agricultural conditions.

26 July 2024 1,469 1 View