Hello friends, I'm looking for criteria for minimum number of words in keyword and collocate testing.  I have seen work addressing potential problems with very large corpora (for both log likelihood and chi-squared tests), but I can't seem to find anything on corpus size minimums.  Specifically I'm interested in log-likelihood and MI-score.  Thanks for your help.

More William M. Marcellino's questions See All
Similar questions and discussions