Inspecting the dataset with:
wget https://storage.googleapis.com/ai2-mosaic-public/projects/fantom/fantom.tar.gz
tar xvf fantom.tar.gz
jq '.[117, 124, 141, 248, 295, 333, 339, 391, 458, 660, 683, 686, 763] | .factQA | {correct_answer, wrong_answer}' fantom_v1.json
I get:
{
"correct_answer": "Italian cuisine.",
"wrong_answer": "Italian cuisine."
}
{
"correct_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group.",
"wrong_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group."
}
{
"correct_answer": "Victor",
"wrong_answer": "Victor"
}
{
"correct_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion.",
"wrong_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion."
}
{
"correct_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers.",
"wrong_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers."
}
{
"correct_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius.",
"wrong_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius."
}
{
"correct_answer": "The conversation shifted to running and cardio workouts after Aidan joined.",
"wrong_answer": "The conversation shifted to running and cardio workouts after Aidan joined."
}
{
"correct_answer": "They started discussing the concept of intersectionality after Juan's arrival.",
"wrong_answer": "They started discussing the concept of intersectionality after Juan's arrival."
}
{
"correct_answer": "Miguel",
"wrong_answer": "Miguel"
}
{
"correct_answer": "Jimmy's family emphasized the values of honesty and hard work.",
"wrong_answer": "Jimmy's family emphasized the values of honesty and hard work."
}
{
"correct_answer": "Yankees",
"wrong_answer": "Yankees"
}
{
"correct_answer": "Jett deals with diabetes.",
"wrong_answer": "Jett deals with diabetes."
}
{
"correct_answer": "Brian encountered a bear while hiking.",
"wrong_answer": "Brian encountered a bear while hiking."
}
which means that there are some items where the factQA field has identical values for correct_answer and wrong_answer. Is this an error in the dataset?
Inspecting the dataset with:
wget https://storage.googleapis.com/ai2-mosaic-public/projects/fantom/fantom.tar.gz tar xvf fantom.tar.gz jq '.[117, 124, 141, 248, 295, 333, 339, 391, 458, 660, 683, 686, 763] | .factQA | {correct_answer, wrong_answer}' fantom_v1.jsonI get:
{ "correct_answer": "Italian cuisine.", "wrong_answer": "Italian cuisine." } { "correct_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group.", "wrong_answer": "The conversation topic shifted to first date ideas and tips when Piper joined the group." } { "correct_answer": "Victor", "wrong_answer": "Victor" } { "correct_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion.", "wrong_answer": "The conversation shifted to the topic of cooking and their favourite dishes to prepare when Amari joined the discussion." } { "correct_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers.", "wrong_answer": "Desmond owned the pet named Mittens and Remington owned the pet named Feathers." } { "correct_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius.", "wrong_answer": "No, the topic of influential figures in their understanding of feminism was not revisited in the conversation with Julius." } { "correct_answer": "The conversation shifted to running and cardio workouts after Aidan joined.", "wrong_answer": "The conversation shifted to running and cardio workouts after Aidan joined." } { "correct_answer": "They started discussing the concept of intersectionality after Juan's arrival.", "wrong_answer": "They started discussing the concept of intersectionality after Juan's arrival." } { "correct_answer": "Miguel", "wrong_answer": "Miguel" } { "correct_answer": "Jimmy's family emphasized the values of honesty and hard work.", "wrong_answer": "Jimmy's family emphasized the values of honesty and hard work." } { "correct_answer": "Yankees", "wrong_answer": "Yankees" } { "correct_answer": "Jett deals with diabetes.", "wrong_answer": "Jett deals with diabetes." } { "correct_answer": "Brian encountered a bear while hiking.", "wrong_answer": "Brian encountered a bear while hiking." }which means that there are some items where the
factQAfield has identical values forcorrect_answerandwrong_answer. Is this an error in the dataset?