Skip to content

Some factQA questions have identical correct_answer and wrong_answer #3

@dniku

Description

@dniku

Inspecting the dataset with:

wget https://storage.googleapis.com/ai2-mosaic-public/projects/fantom/fantom.tar.gz
tar xvf fantom.tar.gz
jq '.[117, 124, 141, 248, 295, 333, 339, 391, 458, 660, 683, 686, 763] | .factQA | {correct_answer, wrong_answer}' fantom_v1.json

I get:

{
  "correct_answer": "Italian cuisine.",
  "wrong_answer": "Italian cuisine."
}
{
  "correct_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group.",
  "wrong_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group."
}
{
  "correct_answer": "Victor",
  "wrong_answer": "Victor"
}
{
  "correct_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion.",
  "wrong_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion."
}
{
  "correct_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers.",
  "wrong_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers."
}
{
  "correct_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius.",
  "wrong_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius."
}
{
  "correct_answer": "The conversation shifted to running and cardio workouts after Aidan joined.",
  "wrong_answer": "The conversation shifted to running and cardio workouts after Aidan joined."
}
{
  "correct_answer": "They started discussing the concept of intersectionality after Juan's arrival.",
  "wrong_answer": "They started discussing the concept of intersectionality after Juan's arrival."
}
{
  "correct_answer": "Miguel",
  "wrong_answer": "Miguel"
}
{
  "correct_answer": "Jimmy's family emphasized the values of honesty and hard work.",
  "wrong_answer": "Jimmy's family emphasized the values of honesty and hard work."
}
{
  "correct_answer": "Yankees",
  "wrong_answer": "Yankees"
}
{
  "correct_answer": "Jett deals with diabetes.",
  "wrong_answer": "Jett deals with diabetes."
}
{
  "correct_answer": "Brian encountered a bear while hiking.",
  "wrong_answer": "Brian encountered a bear while hiking."
}

which means that there are some items where the factQA field has identical values for correct_answer and wrong_answer. Is this an error in the dataset?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions