Besides common methodological limitations (e.g. limited historical data, chronological gaps, lack of annotated data), we still need to justify our choice, ex. 500/1000+ tokens per text, or century, or genre etc. Thank you in advance for you valuable input.