Controlling for multiple dichotomous variables and calculating effect size

More Michael Tsikerdekis's questions See All

Does my affordance identification framework make sense?

I am experimenting with finding a way to trace behaviors back to design features for a game and I figured affordances may be the way to go. The goal is through participant observation and...

09 October 2015 6,407 3 View

When does using binary variables converted from numeric in linear regression make sense?

I have a numeric output variable and a numeric predictor in a small sample size. For example, my output variable is percentage of domestic abuse per each state and my predictor is the percentage...

01 February 2015 838 12 View

Combining probabilities from two models (Bayesian approach?)

I have two models with the same binary dependent variable but different independent variables. I haven't used all IV in one model because: a) for the second set of IVs there are missing data for...

11 December 2013 6,299 13 View

Linear regression with compositional data: need example on dealing with this

I have a dataset consisting of proportion variables as independent variables. I need to run a linear regression however there is the issue of multicollinearity. I've read that using a centered log...

02 March 2013 1,596 8 View

How can I compare the number of sentences and words between two languages while controlling for natural language variation?

I have a set of paired texts in English and Spanish. I used the punkt tokenizer with pre-trained packages that can be found in the NLTK package for python. It works effectively however I want to...

02 March 2013 1,563 18 View

How to deal with percentage metric and "missing" data.

I have a metric that produces a percentage of the total number of registered users over the total number of all users. Question is that for a third of cases in my dataset, the total users are zero...

01 February 2013 1,176 25 View

Any good books about designing social media?

I am looking for books for software engineers and hci researchers focused on social media that will have enough scientific depth to be included as lectures for a course on social media. Any ideas?

02 March 2012 2,381 9 View

Two sample Bayesian hypothesis testing for nonparametrics

Hi everyone, I want to test for differences between two independent samples using bayesian hypothesis testing.(the equivalent test here would be a Mann-Whitney U test). Since i am no statistician...

02 March 2012 4,687 12 View

Equivalence & noninferiority testing in social science?

I would like to hear your thoughts on the topic. You see most of the text books tend to argue that the null hypothesis can never be proven because of falsifiability etc. So i have several...

01 February 2012 9,900 5 View

Which comparative analysis to use for rank ordered data?

I have two separate groups which are asked under different conditions to sort a list of items in terms of preference(building a hierarchy). The total number of items is 5 and the data is...

01 February 2012 10,062 43 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can I base on reverse DNA sequences to perform alignment, convert to amino acids and GenBank submission?

I have reverse sequences (AB1 format), can I base on reverse DNA sequences to perform nucleotide alignment, convert nucleotides to amino acids and deposit the sequence in GenBank database?

11 August 2024 5,138 1 View

Baseline drift in HPLC? What causes this?

Hello, Why do i see this baseline drift when i compare my blank (black) to the sample (blue)? Any suggestions as to why this happened? Thank you!

11 August 2024 3,770 4 View

Text-Communication from the M1 Hand Area using BCI—and then there is Elon Musk?

Willett, Shenoy et al. (2021) have developed a brain computer interface (BCI) that used neural signal collected from the hand area of the motor cortex (area M1) of a paralyzed patient. The...

10 August 2024 7,180 0 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

How can I use the cif data obtained from rietveld refinement extracted via gsas2, for microstructural analysis using ETEX software?

09 August 2024 7,718 0 View

How are iso-frequency contours plotted?

Let's say we have a standard, regular hexagonal honeycomb with a 3-arm primitive unit cell (something like the figure attached; the figure is only representative and not drawn to scale). The...

07 August 2024 1,937 1 View

How to prepare the nanoparticle treated fungal sample for Environmental SEM analysis?

A fungal strain was treated with nanoparticles. We want to do an environmental SEM analysis. So could anyone share your views on preparing the sample? Thank you.

07 August 2024 5,307 1 View

How to normalize and take the significance of the MTT OD values with 3 replicates for the same cell-line?

Hi, I have a question about normalizing the MTT OD values for doing the statistical analysis. So, if we have 3 different plates and we call them 3 different replicates, so, first we would...

07 August 2024 8,106 4 View

Hernando Castaneda Marin

I think in two AF(finite automaton) deterministic o non deterministc for soluction the problem

Michael Tsikerdekis

How would a finite automaton help me calculate the effect size for a correlation?

the computational power of Fahlman's Recurrent

Cascade Correlation (RCC) architecture to that of finite state automata

I agree Irfan, and that is why I wanted to use r for having the effect size. The issue is that in terms of a partial correlation, it doesn't work when you have two dichotomous variables and one interval. For example

r y z

1 100 55

2 200 58

1 300 55

2 400 58

1 500 55

2 600 58

pearson correlation will give

ry = a number

zy = a number (exactly the same to ry)

rz = 1

So mathematically a partial correlation cannot work.

Is there a way I can implement z in y and then run a correlation?

To give more information. y is the number of words between different texts and z is the number of words between english and spanish that I know should be identical. So z could be perceived as the natural variation between languages and the number of words needed to construct identical sentences.

In order to get the true difference for y I need to adjust it by z(the natural variance between the two languages).

Can I create a metric y.new = y / z and then run a correlation between r and y.new? Or is there another metric meant to adjust a number based on another?

Alexandrowicz Rainer

Use codings 0 and 1 for the dichotomous variable, then you may use pearson as well.