Data & code for the ACL 2020 paper Examining the State-of-the-Art in News Timeline Summarization (paper, slides).
Available
- all datasets
 - methods & evaluation code
 - preprocessing instructions for new datasets
 
Planned
- instructions to train date ranking models
 - more user-friendly fast TLS version to run on unpreprocessed data
 
All datasets used in our experiments are available here, including:
- T17
 - Crisis
 - Entities
 
The news-tls library contains tools for loading TLS datasets and running TLS methods.
To install, run:
pip install -r requirements.txt
pip install -e .
Tilse also needs to be installed for evaluation and some TLS-specific data classes.
Check out news_tls/explore_dataset.py to see how to load the provided datasets.
Check out experiments here.
If you have a new dataset yourself and want to use preprocess it as the datasets above, check out the preprocessing steps here.
@inproceedings{gholipour-ghalandari-ifrim-2020-examining,
    title = "Examining the State-of-the-Art in News Timeline Summarization",
    author = "Gholipour Ghalandari, Demian  and
      Ifrim, Georgiana",
    booktitle = "Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics",
    month = jul,
    year = "2020",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2020.acl-main.122",
    pages = "1322--1334",
}