Tasks
March 27, 2021 · View on GitHub
Supported Tasks
| Name | task_name | jiant | Downloader | jiant_task_name | Misc |
|---|---|---|---|---|---|
| Argument Reasoning Comprehension | arct | ✅ | ✅ | arct | Github |
| Abductive NLI | abductive_nli | ✅ | ✅ | abductive_nli | |
| SuperGLUE Winogender Diagnostic | superglue_axg | ✅ | ✅ | superglue_axg | SuperGLUE |
| Acceptability Definiteness | acceptability_definiteness | ✅ | ✅ | acceptability_definiteness | Function Words |
| Acceptability Coord | acceptability_coord | ✅ | ✅ | acceptability_coord | Function Words |
| Acceptability EOS | acceptability_eos | ✅ | ✅ | acceptability_eos | Function Words |
| Acceptability WH Words | acceptability_whwords | ✅ | ✅ | acceptability_whwords | Function Words |
| Adversarial NLI | adversarial_nli_{round} | ✅ | ✅ | adversarial_nli | 3 rounds |
| ARC ("easy" version) | arc_easy | ✅ | ✅ | arc_easy | site |
| ARC ("challenge" version) | arc_challenge | ✅ | ✅ | arc_challenge | site |
| BoolQ | boolq | ✅ | ✅ | boolq | SuperGLUE |
| BUCC2018 | bucc2018_{lang} | ✅ | ✅ | bucc2018 | XTREME, multi-lang |
| CommitmentBank | cb | ✅ | ✅ | cb | SuperGLUE |
| CCG | ccg | ✅ | ccg | ||
| CoLA | cola | ✅ | ✅ | cola | GLUE |
| CommonsenseQA | commonsenseqa | ✅ | ✅ | commonsenseqa | |
| EP-Const | nonterminal | ✅ | nonterminal | Edge-Probing | |
| COPA | copa | ✅ | ✅ | copa | SuperGLUE |
| EP-Coref | coref | ✅ | coref | Edge-Probing | |
| Cosmos QA | cosmosqa | ✅ | ✅ | cosmosqa | |
| EP-UD | dep | ✅ | dep | Edge-Probing | |
| EP-DPR | dpr | ✅ | dpr | Edge-Probing | |
| Fever NLI | fever_nli | ✅ | ✅ | fever_nli | |
| GLUE Diagnostic | glue_diagnostics | ✅ | ✅ | glue_diagnostics | GLUE |
| HellaSwag | hellaswag | ✅ | ✅ | hellaswag | |
| MCScript2.0 | mcscript | ✅ | mcscript | data | |
| MCTACO | mctaco | ✅ | ✅ | mctaco | |
| MCTest | mctest160 or mctest500 | ✅ | ✅ | mctest160 or mctest600 | data |
| MLM | * | ✅ | * | mlm_simple | See task-specific notes. |
| MLQA | mlqa_{lang1}_{lang2} | ✅ | ✅ | mlqa | XTREME, multi-lang |
| MNLI | mnli | ✅ | ✅ | mnli | GLUE, MNLI-matched |
| MNLI-mismatched | mnli_mismatched | ✅ | ✅ | mnli_mismatched | GLUE |
| MRPC | mrpc | ✅ | ✅ | mrpc | GLUE |
| MultiRC | multirc | ✅ | ✅ | multirc | SuperGLUE |
| Mutual (standard version) | mutual | ✅ | ✅ | mutual | site |
| Mutual ("challenge" version) | mutual_plus | ✅ | ✅ | mutual_plus | site |
| Natural Questions | mrqa_natural_questions | ✅ | ✅ | mrqa_natural_questions | MRQA version of task |
| NewsQA | newsqa | ✅ | ✅ | newsqa | |
| PIQA | piqa | ✅ | ✅ | piqa | PIQA |
| QAMR | qamr | ✅ | ✅ | qamr | |
| QA-SRL | qasrl | ✅ | ✅ | qasrl | |
| QuAIL | quail | ✅ | ✅ | quail | site |
| Quoref | quoref | ✅ | ✅ | quoref | |
| EP-NER | ner | ✅ | ner | Edge-Probing | |
| PAWS-X | pawsx_{lang} | ✅ | ✅ | pawsx | XTREME, multi-lang |
| WikiAnn | panx_{lang} | ✅ | ✅ | panx | XTREME, multi-lang |
| EP-POS | pos | ✅ | pos | Edge-Probing | |
| QNLI | qnli | ✅ | ✅ | qnli | GLUE |
| QQP | qqp | ✅ | ✅ | qqp | GLUE |
| ROPES | ropes | ✅ | ✅ | ropes | |
| RACE | race | ✅ | ✅ | race | race, race_middle, race_high |
| ReCord | record | ✅ | ✅ | record | SuperGLUE |
| RTE | rte | ✅ | ✅ | rte | GLUE, SuperGLUE |
| SciTail | scitail | ✅ | ✅ | scitail | |
| SentEval: Bigram Shift | senteval_bigram_shift | ✅ | ✅ | senteval_bigram_shift | SentEval |
| SentEval: Coord Inversion | senteval_coordination_inversion | ✅ | ✅ | senteval_coordination_inversion | SentEval |
| SentEval: Obj number | senteval_obj_number | ✅ | ✅ | senteval_obj_number | SentEval |
| SentEval: Odd Man Out | senteval_odd_man_out | ✅ | ✅ | senteval_odd_man_out | SentEval |
| SentEval: Past-Present | senteval_past_present | ✅ | ✅ | senteval_past_present | SentEval |
| SentEval: Sentence Length | senteval_sentence_length | ✅ | ✅ | senteval_sentence_length | SentEval |
| SentEval: Subj Number | senteval_subj_number | ✅ | ✅ | senteval_subj_number | SentEval |
| SentEval: Top Constituents | senteval_top_constituents | ✅ | ✅ | senteval_top_constituents | SentEval |
| SentEval: Tree Depth | senteval_tree_depth | ✅ | ✅ | senteval_tree_depth | SentEval |
| SentEval: Word Content | senteval_word_content | ✅ | ✅ | senteval_word_content | SentEval |
| EP-Rel | semeval | ✅ | semeval | Edge-Probing | |
| SNLI | snli | ✅ | ✅ | snli | |
| SocialIQA | socialiqa | ✅ | ✅ | socialiqa | |
| EP-SPR1 | spr1 | ✅ | spr1 | Edge-Probing | |
| EP-SPR2 | spr2 | ✅ | spr2 | Edge-Probing | |
| SQuAD 1.1 | squad_v1 | ✅ | ✅ | squad | |
| SQuAD 2.0 | squad_v2 | ✅ | ✅ | squad | |
| EP-SRL | srl | ✅ | srl | Edge-Probing | |
| SST-2 | sst | ✅ | ✅ | sst | GLUE |
| STS-B | stsb | ✅ | ✅ | stsb | GLUE |
| SuperGLUE Broad Coverage Diagnostic | superglue_axb | ✅ | ✅ | superglue_axb | SuperGLUE |
| SWAG | swag | ✅ | ✅ | swag | |
| Tatoeba | tatoeba_{lang} | ✅ | ✅ | tatoeba | XTREME, multi-lang |
| TyDiQA | tydiqa_{lang} | ✅ | ✅ | tydiqa | XTREME, multi-lang |
| UDPOS | udpos_{lang} | ✅ | ✅ | udpos | XTREME, multi-lang |
| WiC | wic | ✅ | ✅ | wic | SuperGLUE |
| Winogrande | winogrande | ✅ | ✅ | winogrande | |
| WNLI | wnli | ✅ | ✅ | wnli | GLUE |
| WSC | wsc | ✅ | ✅ | wsc | SuperGLUE |
| XNLI | xnli_{lang} | ✅ | ✅ | xnli | XTREME, multi-lang |
| XQuAD | xquad_{lang} | ✅ | ✅ | xquad | XTREME, multi-lang |
task_name: Name-by-convention, used by downloader, and used inJiantModelto map from task names to task-models. You can change this as long as your settings are internally consistent.jiant: Whether it's supported injiant(i.e. you can train/eval on it)- Downloader: Whether you can download using the downloader.
jiant_task_name: Used to determine the programmatic behavior for the task (how to tokenize, what kind of task-model is compatible). Is tied directly to the code. See:jiant.tasks.retrieval.