Skip to content
This repository was archived by the owner on Oct 31, 2023. It is now read-only.
This repository was archived by the owner on Oct 31, 2023. It is now read-only.

Can you release the code for pre-processing the dataset? #1

@kimdev95

Description

@kimdev95

Hi. Can you release the code you have used for pre-processing the dataset? Because I found the dataset is a little bit noisy, and I want to evaluate our coreference resolution model in the same setting as reported in your paper.

Some issues in the dataset are:

  • Incorrect mention annotations. For example, in the utterance "I'll call them later .", sometimes both "I" and "I'll" are annotated as mentions in your dataset. Another similar example is the utterance "I've sent the PDF to both of them ."
  • There are some links (A, B) where either mention A or mention B has never appeared in the dialog.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions