I have the following situation: A number of texts have been rated for readability. The research question is: Is the used readability score valid? We did a study where participants first read the text and then answer a multple choice question. Because the choices were exclusive (exactly one correct choice), there is a 25% chance of guessing.

We ran a logistic regression with correct/incorrect as response and readability score as predictor. We find the desired relationship, but we also want to predict performance at texts of lower readability than in our sample. How can we formulate the model or adjust the estimates such that predicted probability of correct answer would never fall below 25% ?

Note that the data analysis is done by a bachelor student. Pragmatic solutions, using standard software, are preferred over profound. Also note that we have to use either a mixed-effects model or GEE to account for repeated measures.

Similar questions and discussions