What is the more appropriate information retrieval model for short texts ?

More Mosab Alfaqeeh's questions See All

I want someone to help me in analysis of serum specimens for Cytokines INF-gamma, TNF-alpha, TGF-beta and IL-10?

I have my PHD project for assessment of Cytokines INF-gamma, TNF-alpha, TGF-beta and IL-10 among Malaria Patients in River Nile state , Sudan, and I am looking for collaborators to help me to do...

06 June 2022 2,772 0 View

What is the correction of error: Failure in initial user-supplied objective function evaluation. PARTICLESWARM cannot continue?

I'm trying to solve an optimization problem using PARTICLESWARM function in MATLAB2014b. with aid of Open Source ECG Toolbox, version 1.0, November 2006 Released under the GNU General Public...

19 April 2018 238 1 View

G.lamblia prefer population with A blood group is it true and why?

before more than 8 years I read an article , mentioned that Giardiasis mostly affect population with A blood group due to low levels of HCL in their stomach. I am waiting your , answers and...

26 March 2018 1,009 7 View

Please tell me about the possible explanations of parathogenesis ?

is it related to genetic alteration and microorganism evolution , waiting your fruitful opinions

21 March 2018 1,880 4 View

We observe that in some occasion P.vivax cause hyperglycemia, explain ?

is it related to the action of hypnozoite , which is known as dormant liver stage, I think its a liver stage but its not dormant it has a action affect the action of insulin or glucagon hormones.

20 March 2018 9,265 8 View

How to increase my RG ??

According to your experience , tell me about the ideal ways to improve researcher RG .. I am new member on research gate and I hope to share knowledge and experience to the universe of research...

17 March 2018 4,729 15 View

(Mosab theory about cancer etiology)

I think that all types of cancer are chiefly result from microbial causes either changing in normal bacteria flora or due to the action of pathogenic microbes. almost of cancers occur after birth...

01 January 1970 8,011 0 View

Using a regression analysis

Hi respected sir/madam, What is the appropriate statistical test to predict the effect of more than two independent variables on a dependent variable?. I have the pain as dependent variable ,...

01 January 1970 3,331 9 View

Hydroxychloroquine : dose timing as crtical issue

usage of hydroxychloroquine in early stage of COVID.19 will give good results , because it inhibit release of pro-inflammatory cytokines such as TNF-alpha and then prevent formation of cytokines...

01 January 1970 3,341 4 View

How to maintain our livers health to protect ourselves from hypertension?

I think that as also some scientist suggest that liver dysfunction is the main cause of many diseases such as renal disease , atherosclerosis even some types of diabetes and failure in...

01 January 1970 3,764 2 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Is there an English Translation of the Carl Moller text: ZUR VERGLEICHENDEN ANATOMIE DER SILURIDEN?

I recently came across an anatomy text by Carl Moller that was published in 1915 but it is in German or Dutch neither of which I can understand. I would like to know if there is an English...

10 August 2024 4,347 1 View

How to convert a privately loaded document into a public document?

I attempted to make a privately uploaded text public but a window appeared that said an error occurred. There was no explanation provided as to why there was an error or what might be done to...

05 August 2024 8,025 7 View

"A Markov-like Model for Patient Progression"?

A Markov-like Model for Patient Progression" Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC) is a powerful computational technique used to draw samples from a probability...

05 August 2024 10,079 0 View

How to develop an academic literacy program for engineering at the higher education level?

Information literacy in higher education integration with curricula engineering

04 August 2024 5,368 3 View

How to Compress Information Neurally?

Samuel Morse, the inventor of the Morse Code, understood that certain letters in the English language occurred more frequently than others (Gallistel and King 2010). To deal with this, Morse used...

01 August 2024 4,456 2 View

How to change the version of the article full-text pdf file?

How to change the displayed full article text to its corrected version? In the file on the page of the journal where I published the article, there was an error in the text, the table is...

30 July 2024 3,229 2 View

What exactly is RAG-LLM doing? Isn’t it data engineering?

What exactly is Retrieval Augmented Generation for Large Language Model doing? Isn’t it data engineering?

30 July 2024 7,376 3 View

Can anyone please provide me the full text article of this clinical Trial?

Roflumilast Cream Improves Signs and Symptoms of Plaque Psor...

29 July 2024 5,250 0 View

May I know the exact Quartile of the journal- Advanced Engineering Materials (Wiley) for material science category?

In some data sources it has been grouped in Q1 and some shows it is Q2.

29 July 2024 4,227 2 View

Fadoua Ataa Allah

For information retrieval,

You find in this thesis, many experiments comparing the effect of some Weighting Schemes and natural language processing according to short and long queries.

Hope it will help.

Good luck

Thesis Information Retrieval: Applications to English and Arabic Documents

Tulu Tilahun Hailu

Dear Mosab,

For information retrieval: Bi-gram and Statistical co-occurrence and for sentiment analysis lexicon based opinion classification and summarization. I think the following link might helps to have related papers:

https://www.researchgate.net/profile/Tulu_Tilahun

Regards

Tulu Tilahun

Lecturer at Arba Minch University

Ethiopia

Alain Lesaffre

Hi Mosab,

for short text you can first clean you text remove the stop words does the stemming and then build the term to document matrix, this process could be implement in few lines of code (R or other) , from the term to document you can build your metrics such as TF-IDF or other based on your context.

Then As Tulu mentioned you can build the n-gram and later the chains (HM) if you need to do some prediction.

One point to keep in mind will be the number of documents, if you have one enormous amount you will have to work with Nosql Db in the back end, such as Hadoop for example to do the preprocess.

Hope this help a little.

Alain

Mohamed Mohsen Gammoudi

Dear Fadoua,

I think that the best model for short text is the vector space model

Read : http://www.csee.umbc.edu/~ian/irF02/lectures/07Models-VSM.pdf

Best regards

Dear Mohamed,

Please, can you specify according to the file attached why you consider that VSM is the best?

Don’t forget that there are some common advantages of LSA and VSM.

Regards,