I'm looking for a dataset for blind text steganalysis readily labeled with stego/non-stego class. I'm just testing few ML/DL algorithms in this domain. Any pointer?
Yes been there, those are image dataset. The best text dataset I can find is from:
Yang, Z., Huang, Y., & Zhang, Y. J. (2020). TS-CSW: text steganalysis and hidden capacity estimation based on convolutional sliding windows. Multimedia Tools and Applications, 1-24.
available at https://github.com/YangzlTHU/TS-CNN but it's for Chinese text.