Distribution of word lengths? There are more 2 letter words than 1 letter words. There are more 3 letter words than 2 letter words. Not sure how far this goes, but there I would guess that there are fewer 14 letter words than there are 13 letter words (unless one is working with German, in which case all bets are off).
Maybe the probability that the letter u is a part of a word based on word length?
How about the distribution of the e sound as in the word "bet" over all languages. Certain language families will have a greater proportion of this sound, and the overall distribution within each language family might be Gaussian. If this one isn't Gaussian, maybe some other sounds are?