To evaluate an IR system, is there a test data set / corpora where each documents consists of multiple paragraphs?

02 September 2020 0 5K Report

I developed an information retrieval system for my master thesis, and I need to evaluate my term weighting function. My function takes into consideration paragraphs of the document when performing calculations. So, to evaluate it, I need a publicly available dataset with relevance judgments (query-document pairs) where each document consists of multiple paragraphs, and not just one single paragraph as in "Reuters" dataset for example, and there should be a ground truth for the document retrieval task, so I can compare my system's results to it.

Can you suggest to me such a dataset?

In the image is a sample where document structure meets my needs.

Badges
Science topic

Similar topics
Mathematical Sciences
Graphs

More Nina Saabiyeh's questions See All

I want to ask why the poly aniline film does not stick to the ITO type substrate knowing that I used the electrodeposition method?

In order to make a polyaniline film, I used the electrodeposition method and set all the conditions, but I noticed that the polymerization process is happening, but it doesn't stick to the...

09 July 2024 2,708 2 View

What statistical analysis should I use?

My study has 3 treatments. the values measured are only on day 0 and the last day of the experiment.

27 May 2024 5,972 6 View

What journals accept systematized reviews?

I have been working on a review that has been structured as a systematized review. I was told that we may need to conduct a systematic review to get published. I have seen systematized reviews...

17 April 2024 7,267 3 View

When is a cell line considered radiosensitive?

Dear readers, I am currently performing radiosensitivity assays on tumor cell lines, both using clonogenic survival and cell viability (cell titer glo). I was wondering whether there are some...

02 April 2024 7,273 2 View

How do I get SnO2 thin films clear without any stains in them, knowing that I use Ethanol as a solvent ?

I want to get good films out of them, knowing that the melting process was complete, but the films weren't clear.

07 March 2024 7,953 1 View

Are there specific terms for validity and reliability in qualitative research?

Our research adviser told us to use specified terms for validity and reliability in qualitative research, since he referred to these terms as specific for quantitative research. To my...

27 November 2023 7,380 4 View

Except any acid, what will be the best solvent for this oxides powders (SnO2,ZnO,CuO and NiO) to dissolve homogeneously?

I need just the name of the solvent

04 November 2023 511 4 View

I am looking for solvents for these oxides. I tried many solutions but failed. Please help ,CuO,SnO2,TiO2 and ZnO ?

My work requires dissolving these oxides. I tried with some solvents, but it did not work. I only want the names of the solvents

03 November 2023 8,757 2 View

How do I distill aniline ?

How do I distill aniline before use it because it is insoluble in water I need some details

31 October 2023 7,845 2 View

I am searching for oxide solvents as copper oxide and vanadium oxide I tried to dissolve them, but I was unsuccessful. Who is the solvent I used?

I want the names of the solvents that help me dissolve the mentioned oxides.

12 October 2023 8,504 5 View

How to convert a privately loaded document into a public document?

I attempted to make a privately uploaded text public but a window appeared that said an error occurred. There was no explanation provided as to why there was an error or what might be done to...

05 August 2024 8,025 7 View

What exactly is RAG-LLM doing? Isn’t it data engineering?

What exactly is Retrieval Augmented Generation for Large Language Model doing? Isn’t it data engineering?

30 July 2024 7,376 3 View

May I know the exact Quartile of the journal- Advanced Engineering Materials (Wiley) for material science category?

In some data sources it has been grouped in Q1 and some shows it is Q2.

29 July 2024 4,227 2 View

Are the apoptotic cells is positive for γH2AX ?

It has been documented that apoptotic cells themselves can induce phosphorylation of serine 139 on H2AX (γH2AX) due to DNA fragmentation during apoptosis (doi: 10.1074/jbc.275.13.9390). As γH2AX...

28 July 2024 7,983 2 View

I need the housing or real estate prices data since 1950 till date in India. Can anyone guide how can I get this data ?

I didn't find any data source for continuous time series since 1950. NHB Residex and RBI give data from 2013 and 2010 onwards. Please guide. Thanks !!!

27 July 2024 6,271 5 View

Do I need an antigen retrieval for a multiplex RNAscope experiment for adherent cells?

Hello! I planned to do an multiplex RNAscope fluorescent assay for cultured cells in a 96 well plate format according to the protocol provided by ACD:...

22 July 2024 3,334 0 View

How does the digestate compare to other organic amendments regarding quality and performance in different countries?

Experts are welcome to share their experience country-wise so that we can make a nice document listing all the country-wise experiences. ? How does the digestate compare to other organic...

18 July 2024 5,225 3 View

Information is Stored by Context?

While living in Boston, which has a fine metro system, there were times that I would exit from underground at the wrong location, thereby expecting a particular visual scene only to be momentarily...

17 July 2024 3,237 10 View

My question is about errors in Ansys mechanical APDL? Data file file.db does not exist for RESUME?

I did a welding simulation using ANSYS MECHANICAL APDL. The solution was done, but I encountered this error in the output file (Data file file.db does not exist for RESUME). On the other hand, I...

11 July 2024 4,480 0 View

How can I avoid uneven IHC staining on free floating tissue from frozen human CNS sections (cryostat 50um thickness)?

Hello, I've been doing free floating IHC using a VAChT antibody with a 1:1k dilution (139 103 - Synaptic Systems). However, there has been a struggle to keep an even stain with my samples of...

06 July 2024 4,521 2 View