I don't have exact ideas, but, however, you may be able to find multimodal fake news datasets in English language by searching reputable academic sources, research publications, or open data repositories such as Kaggle, UCI Machine Learning Repository, or the Stanford Large Network Dataset Collection. Additionally, some organizations focused on misinformation research like the Fake News Challenge might provide access to relevant datasets or resources. Conducting a thorough search using specific keywords like "multimodal fake news dataset English" could lead you to the datasets you're looking for.
Unity Mmapula Nkateng I wish to know the method used to build a database of social media sites and how I can identify fake news to collect it in the data. I appreciate your contribution to me
Welcome, I think that one you can contact people like Sheena Gardener as they are more experienced in creating or using that. I do not know but find experts in corpus linguistics to assist it would be an interesting research to do.
Some multimodal fake news detection English datasets that I have worked with-
FakeNewsNet- It's a comprehensive dataset for fake news detection. Includes- news content, social context, & spatiotemporal information & also has data from both textual & visual modalities. Link: https://github.com/KaiDMML/FakeNewsNet
LIAR-: It's a benchmark dataset for fake news detection with 12.8K manually labeled short statements from various contexts (news articles, speeches, etc.), and it also includes metadata such as speaker, context, & subject. Link: https://paperswithcode.com/dataset/liar
Fakeddit- It includes text, images, and metadata from Reddit posts. It provides labels for different degrees of fake news (ex- completely true, partially true, false). Link: https://github.com/entitize/Fakeddit
Weibo-COVID19- it has multimodal data from Weibo posts related to COVID-19, including text, images, & videos that focuses on misinformation and fake news related to the pandemic. Link: https://paperswithcode.com/dataset/weibo-cov
Politifact Dataset- It includes fact-checked articles from the Politifact website, and has both text and image data for each news article. Link: https://www.kaggle.com/datasets/rmisra/politifact-fact-check-dataset
Hope this helps :) Iman Q. Abduljaleel Unity Mmapula Nkateng Manoj Kumar Yadav