What would be the best way to find and show the overlapping in binary data sets in a big sample (Phi coefficient or something like Jaccard coef)?

30 July 2019 1 1K Report

I have very little background in statistics, so I need some advice. I know that Jaccard is used to compare the similarity (elements in data sets). But can I use the Jaccard coefficient (or something similar – any suggestions?) also to compare the overlapping of for example two symptoms (binary data, asymmetric) in a large sample? Should I then treat the patients as elements in the datasets of symptoms (so I count the yes/yes, yes/no, no/yes, while no/no is considered irrelevant)? For example I have 801 patients, and I want to know the overlapping of two (or more) symptoms (for example those who have nausea and also have vomiting). How would I calculate it? I took the phi coefficient (would it be better in this case to stick to it?) but I doubted – the symptoms are not necessarily correlated, but may simply co-exist. So I though that I just want to show the co-existence of symptoms. Should I simply show the percentages? Gratefully,

Liidia

Sal Mangiafico

The phi statistic would include the no/no in the calculation. I imagine this is not desirable here. What you are describing is used in a different situation than the Jaccard index is usually used, but the idea is the same. Really, you are just describing the proportion of cases where the two symptoms occur together out of all cases where at least one symptom occurs. It might be fine to describe it this way. Below is some R code. You can run it at http://rdrr.io/snippets . You might want to play with the numbers a bit just to get a handle on what the output means. In the example given, Nausea and Vomiting occur together in 50% of cases where at least one symptom occurs.

Input =("

Nausea VomitYes VomitNo

Yes 100 50

No 50 NA

Matrix = as.matrix(read.table(textConnection(Input), header=TRUE, row.names=1))

Proportion = Matrix[1,1] / ( Matrix[1,1] + Matrix[1,2] + Matrix[2,1] )

Proportion

Badges
Science topic

More Liidia Meel's questions See All

I am having problems quantifying TVC results due to Biofilm?

Hi. Im looking for some help. Im working on a Bacillus strain that forms Bio-film, it is growing extremely well reaching a high OD in a 5L Fermenter, however I am struggling to quantify it. Im...

09 October 2019 8,309 5 View

What is the reason for coating sagging/curtains on concrete?

It is moister compatible corrosion resistant moisture cure 2k epoxy system, VS 70%, specific gravity 1.25. There is no any fault in mixing ratio, apply by airless spray and tried by brush also too...

14 September 2015 6,842 1 View

What will be the suitable mulch (plastic mulch/straw mulch) for the rainy season in cucurbitacious crop in an arid region?

Please give the answer with suitable references.

18 January 2013 8,581 5 View

Rainfed vegetable production

Harvested water from field and vegetable production

07 September 2011 4,237 1 View

Interpretation of simple mediation analysis - individual paths or overall indirect effect that matters?

Hello, I would really appreciate some help interpreting the output from my simple mediation analysis using PROCESS macro in SPSS please. For context, the X predictor is severity of nausea and...

13 June 2024 9,597 3 View

I am seeking assistance in identifying the nature of the blood clot?

Today, I discovered a blood clot in the vicinity of my cat's vomit in the garden. I hypothesize that the vomit may have caused this condition, but I am uncertain. Could you please provide me with...

22 April 2024 1,638 2 View

Explain the complications and management strategies associated with paediatric anaesthesia, including emergence delirium, postoperative vomiting?

Complications associated with paediatric anaesthesia can arise due to various factors, including the child's physiological differences, underlying medical conditions, surgical procedures, and...

09 April 2024 5,900 1 View

How does Zofran (ondansetron) work to decrease nausea in relation to serotonin?

23 March 2023 9,678 1 View

Advice managing gastroparesis after renal transplant?

Hi everyone. We generally don't see many diabetic patients with gastroparesis make it to transplant. We have one such patient transplanted five years ago who spent most of the two years after...

30 March 2022 3,660 2 View

How can I measure the severity of disease or symptoms (categorical data) ?

we studying the nausea and vomiting because of chemo therapy for patients with cancer. how to measure the severity of nausea and vomiting.

16 October 2021 7,033 2 View

A 10-month infant after a congenital diaphragmatic hernia surgery not able to eat - what might be the cause?

A 10 month premature baby is not able to eat on his own. It had a CDH surgery right after birth and a re-herniation at 6 months. The smallest amount of food causes choking and vomiting, visible...

24 August 2021 1,900 2 View

Does trace of blood in acute pyelonephritis most likely contribute to the reagent strip protein result of30 mg/dL?

A 30-year-old woman is seen by her physician. She has a temperature of 101°F and reports nausea and headache, with flank (below ribs and above iliac crest) tenderness and pain. When asked, she...

01 March 2021 2,155 2 View

What is the main cause of posterior fossa tumor formation in child ?

Posterior fossa tumors are predominantly seen in children with a peak incidence in the first decade. The most common presenting symptoms are raised intracranial pressure with headache and...

31 January 2021 895 4 View

How commonly do you encounter Rummination Syndrome in your GI practice?

Do diaphragmatic breathing and speech therapy actually help? How often did you require multidisciplinary team approach?

17 January 2021 6,370 0 View