Dear all,
Do you know any available data set for text summarization-with text summaries?
Dear Keramatfar,
Luis Adrián Cabrera-Diego is right. Please go through this.
Tanks, but i need english text.
Document understanding conference: http://duc.nist.gov
Look at R packages ie tm and others.
it depends on whether you are interested single or multi document summarization. Check TAC datasets
https://tac.nist.gov//
As Vesile Evrim and Nasreen Jadhim said, the DUC and TAC datasets, are the most important corpora in English about summarization.
For other summarization corpora, but in French, there is:
Puces: http://dev.termwatch.es/~fresa/CORPUS/PUCES/
RPM2: http://rpm2.org/outils_et_ressources-en.html (Multidocument)
you can check data repositories at kdnuggets:
http://www.kdnuggets.com/datasets/index.html
Check out the data set published on kaggle.
https://www.kaggle.com/sunnysai12345/news-summary
Try the Australian Legal Cases dataset:
https://archive.ics.uci.edu/ml/datasets/Legal+Case+Reports
I have already worked on it for extractive summarization:
https://github.com/aneesh-joshi/Auto-Text-Summarizer
Hi
This dataset is suitable:
https://rtltds.github.io
Raw Data RTLTDS Dataset
You could easily create this dataset, just upload the text files on dataturks and write summaries for them there and download, check out more here.
http://dataturks.com/
Datasets for text summarization:
1. https://github.com/mathsyouth/awesome-text-summarization#corpus
2. http://nlpprogress.com/english/summarization.html Benchmarks & papers are also given for each dataset mentioned on this page.
I have a binary feature that i want to use it with textual features i.e. unigrams. I use logistic regression and TF/IDF for representing text. So i simply add a unique feature, say ss or oo, to...
05 June 2017 2,389 5 View
.
01 February 2016 9,344 3 View
Hiiiii everyone! I have an enquiry on statistical analysis. I was looking for many forum and it's still cannot solve my problem. I want to compare means of two groups of data but only with two...
03 March 2021 8,796 3 View
I am on the lookout for the Enhanced Yellow Fluorescent Protein (Aequorea victoria) DNA sequence. Does anyone know where I can find it? Thank you in advance
03 March 2021 3,568 1 View
Hi, I want to start testing pitfall trap to obtain ants samples, but I need to conduct molecular analysis on those insects. So, what kind of fluid can I use? Ethanol expires too early and I need...
03 March 2021 5,978 5 View
What's the best way to measure growth rates in House sparrow chicks from day 2 to day 10? Since, the growth curve from day 2 to 10 won't be like the "Logistic curve" it might not follow logistic...
03 March 2021 1,401 3 View
I have conducted and published a systematic review and meta-analysis research with the topic related to public health and health pomotion (protocol was registed in PROSPERO). Now we want to...
03 March 2021 8,920 3 View
dear community, my model is based feature extraction from non stationary signals using discrete Wavelet Transform and then using statistical features then machine learning classifiers in order to...
03 March 2021 6,994 5 View
1. What is the impact of having different scales in a survey? and how can we solve this problem before and after data collection (Literature-based reflection)? Thank you for your time and for...
03 March 2021 2,870 3 View
I'm dealing with a mediation model and am using the PROCESS module in SPSS. Due to SPSS and PROCESS being limited in the imputation methods - being unable to handle multiple imputation - the other...
02 March 2021 4,362 1 View
I just wanted to check if I need to run a linear regression separately if I am using PROCESS MACRO to run mediation analysis. Thank you.
02 March 2021 4,359 3 View
If the detection range is in ng/ml but the reference range is in ug/ml for a molecule or protein in serum or plasma .how to dilute and what is the initial volume to be taken for quantitative analysis
02 March 2021 7,670 3 View