Unknown

Dataset Information

0

Event-Dataset: Temporal information retrieval and text classification dataset.


ABSTRACT: Recently, Temporal Information Retrieval (TIR) has grabbed the major attention of the information retrieval community. TIR exploits the temporal dynamics in the information retrieval process and harnesses both textual relevance and temporal relevance to fulfill the temporal information requirements of a user Ur Rehman Khan et al., 2018. The focus time of document is an important temporal aspect which is defined as the time to which the content of the document refers Jatowt et al., 2015; Jatowt et al., 2013; Morbidoni et al., 2018, Khan et al., 2018. To the best of our knowledge, there does not exist any standard benchmark data set (publicly available) that holds the potential to comprehensively evaluate the performance of focus time assessment strategies. Considering these aspects, we have produced the Event-dataset, which is comprised of 35 queries and set of news articles for each query. Such that, C={Qs,Ds}, where C represents the dataset, Qs is query set Qs={q1,q2,q3,…….,q35} and for each qi there is a set of news articles qi={dr,dnr} . dr,dnr are sets of relevant documents and non-relevant documents respectively. Each query in the dataset represents a popular event. To annotate these articles into relevant and non-relevant, we have employed a user-study based evaluation method wherein a group of postgraduate students manually annotate the articles into the aforementioned categories. We believe that the generation of such dataset can provide an opportunity for the information retrieval researchers to use it as a benchmark to evaluate focus time assessment methods specifically and information retrieval methods generically.

SUBMITTER: Khan SUR 

PROVIDER: S-EPMC6554222 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Event-Dataset: Temporal information retrieval and text classification dataset.

Khan Shafiq Ur Rehman SUR   Islam Muhammad Arshad MA  

Data in brief 20190523


Recently, Temporal Information Retrieval (TIR) has grabbed the major attention of the information retrieval community. TIR exploits the temporal dynamics in the information retrieval process and harnesses both textual relevance and temporal relevance to fulfill the temporal information requirements of a user Ur Rehman Khan et al., 2018. The focus time of document is an important temporal aspect which is defined as the time to which the content of the document refers Jatowt et al., 2015; Jatowt e  ...[more]

Similar Datasets

| S-EPMC8627225 | biostudies-literature
| S-EPMC4762689 | biostudies-literature
| S-EPMC4028845 | biostudies-literature
| S-EPMC5381872 | biostudies-literature
| S-EPMC8655353 | biostudies-literature
| S-EPMC7406362 | biostudies-literature
| S-EPMC5098425 | biostudies-literature
| S-EPMC6925844 | biostudies-literature