How can I test a new short similarity measure?

More Mohammed Bekkali's questions See All

Is skin yellowness an numerical or ordinal variable?

I have a response variable called skin yellowness, which I will measure via a scored color chart, whereby 1 is pale yellow and 15 is orange. I'm not sure if this counts as an ordinal variable,...

11 August 2024 4,793 1 View

• What the possible Persistent Organic Pollutants and Heavy metals present in fluorspar, sediments, and water bodies around its mining area?

Approximate concentrations are require in compared with the WHO permissible limts

11 August 2024 2,723 1 View

Do you think can be any Uranium bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about Uranium ore deposits in world.

11 August 2024 6,720 0 View

Do you think can be any diamond bearing rocks in Eastern part of Iran and western part of Afghanistan?

I want to know more about diamond ore deposits in world.

11 August 2024 2,167 1 View

What is the difference between mathematical R^4 space and physical 4D unit space?

We assume that the difference is huge and that it is not possible to compare the two spaces. The R^4 mathematical space considers time as an external controller and the space itself is immobile in...

10 August 2024 6,678 14 View

If Banks do not provide credit facility, what are the options available for FPOs and impact on producer’s income?

10 August 2024 8,198 5 View

Controlling for pupil light reflex when analyzing pupil size time course?

I used eye tracking to examine how participants from two different populations (A and B) react to an image. Participants in population A exhibit larger pupil sizes over time, but they also have...

10 August 2024 3,229 0 View

What are a “Farmers Producer Organization” (FPO) and its essential features?

10 August 2024 477 5 View

Strugglling with m6A dot blot any suugesstion ?

I have been doing the m6A dot blot for a while with no improvement, I am extracting the RNA, and I can see the dots although the three biological replicas give a different reading on the memberan...

10 August 2024 8,539 5 View

Do interactions between biosphere, carbon cycle, & water cycle impact global warming & interaction between atmosphere & hydrosphere?

How do interactions between the biosphere, the carbon cycle, and the water cycle impact global warming and interaction between the atmosphere and the hydrosphere?

09 August 2024 3,291 2 View

The Bigger You Are, the Harder You Fall (some lessons from Dinosaurs)?

Evolutionary fitness is based on an organism’s ability to adapt rapidly to changing environmental circumstances. Large-bodied mammals have been equipped with large brains (and hence a high...

06 August 2024 4,849 2 View

Are there any instruments for studying time similar to the way it is in space?

There are a huge number of methods for studying objects in space, according to the senses (and not only). Mechanical, thermal, optical, acoustic, electrical, magnetic, based on particle beams,...

06 August 2024 7,102 0 View

Are air moisture harvesting technologies effective in combating desertification?

Air moisture harvesting Air water collection devices

06 August 2024 5,473 2 View

How to report results of Generalised Linear Mixed Models in a journal article?

Hi everyone, If you have written or come across any papers where Generalised Linear Mixed Models are used to examine intervention (e.g., in mental health) efficacy, could you please share the...

04 August 2024 4,130 4 View

Repeated measures ANOVA, ANCOVA or Regression?

Would anyone be able to advise me... I have an RCT with a control and experimental group. Participants were recruited from one school (n=59). Participants were assessed using repeated measures on...

04 August 2024 4,040 6 View

What is the best sampling strategy?

I am conducting a qualitative study that uses interviews to investigate the perceptions of teachers about a particular leadership practice and I am focusing on 3 schools which have a total number...

01 August 2024 8,457 10 View

State of art in natural disasters?

Are increasing the costs of disasters in the affected countries.

01 August 2024 1,794 2 View

Broca’s area must be intact for the learning of new movement sequences?

When the eyes of a person are damaged this causes complete blindness. Likewise, when Wernicke’s and Broca’s areas of neocortex are damaged this causes complete aphasia, losing the ability to...

01 August 2024 6,744 2 View

Should I remove an item from a scale to raise Cronbach's alpha and McDonald's omega or is it better to leave it if they are both over .7 already?

Hello! I have this scale which had 10 items initially. I had to remove items 8 and 10 because they correlated negatively with the scale, and then I removed item 9 because Cronbach's alpha and...

01 August 2024 4,606 7 View

How can I get my Granzyme B flow cytometry stain to be consistent?

I have used PE and PE-Dazzle 594 fluorochromes and have managed to get NK cells to properly show GranzymeB expression after 4 hr PMA/ionomycin stimulaton, but for some reason my CD8 cells in the...

01 August 2024 7,677 2 View

Jolanta Mizera-Pietraszko

You need to use a couple of Machine Translation tools to produce so called Candidate Translation equivalents of your short text. Your short text should be treated as a Reference Translation. Then you can test your measure or compare it to BLUE, METEOR or NIST.

Mohammed Bekkali

sorry, but i don't understand way should i use a machine translation tools; because what i try to do is calculating the similarity between a pair of short text writtren with the same language. for example what the degree of similarity between "united state president" and "Barak obama" even if they don't share any terms but the degree of similarity is too high

What similarity measure are you going to use?

What factors are measured - what each of your varieties represent?

What research goal do you plan to achieve? E.g. compare with some other measurements to prove that for Arabic it works better or worse and to what degree?

there is some measure like the cosine, jaccard and overlap and they are based on the common words between texst for computing the similarity...and since the short text do not provide enough contextual information we have developped a new method for computing the similarity between a pair of short text..and i ask whats the best factor we have to use in order to test the effectiveness of our method (for example for text categorization we use f1-measure which combine precision and recall to test the effectiveness of a text categorization system)

The example of the US President is rather more like Question Answering when your system works on a standalone database from which the right information snippet is extracted. It does not have much to do with similarity because the name of the President depends upon the tenure. For F1-score you need to provide a number of the system responses that should have been returned, I wonder how you do that. Although I know nothing about the concept of your measure, I would find some other measures applied to short texts only for comaparison which is always the best way to show how better it is from those well-known measures in science, of course not cosine similarity, Jaccard coefficient, mapping etc.