Sample image showing the conversation between a user and an agent

Welcome to the website of our EACL 2023 paper:

The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents

Data access

You can request access to the data via two ways: McGill Dataverse and Huggingface Dataset. In both cases, in order to use our dataset, you must agree to the terms of use and restrictions before requesting access (links at the top of the page). We will manually review each request and grant access or reach out to you for further information. To facilitate the process, make sure that:

  1. Your Dataverse/Huggingface account is linked to your professional/research website, which we may review to ensure the dataset will be used for the intended purpose
  2. Your request is made with an academic (e.g. .edu) or professional email (e.g. @servicenow.com). To do this, your have to set your primary email to your academic/professional email, or create a new Huggingface/Dataverse account.

If your academic institution does not end with .edu, or you are part of a professional group that does not have an email address, please contact us (see email in paper).

Python Library

We have published a Python library to help you work with the dataset. To get started, please refer to the documentation for a user guide and API references. For other code-related details, please check out our GitHub repository.

Citation

If you use our dataset, please cite as follows:

@inproceedings{lu-etal-2023-statcan,
    title = "The {S}tat{C}an Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine Intents",
    author = "Lu, Xing Han  and
      Reddy, Siva  and
      de Vries, Harm",
    booktitle = "Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics",
    month = may,
    year = "2023",
    address = "Dubrovnik, Croatia",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/2304.01412",
    pages = "2799--2829",
}

Video

You can watch the video presentation of our paper at EACL 2023 below: