Tasks

March 27, 2021 · View on GitHub

Supported Tasks

Nametask_namejiantDownloaderjiant_task_nameMisc
Argument Reasoning ComprehensionarctarctGithub
Abductive NLIabductive_nliabductive_nli
SuperGLUE Winogender Diagnosticsuperglue_axgsuperglue_axgSuperGLUE
Acceptability Definitenessacceptability_definitenessacceptability_definitenessFunction Words
Acceptability Coordacceptability_coordacceptability_coordFunction Words
Acceptability EOSacceptability_eosacceptability_eosFunction Words
Acceptability WH Wordsacceptability_whwordsacceptability_whwordsFunction Words
Adversarial NLIadversarial_nli_{round}adversarial_nli3 rounds
ARC ("easy" version)arc_easyarc_easysite
ARC ("challenge" version)arc_challengearc_challengesite
BoolQboolqboolqSuperGLUE
BUCC2018bucc2018_{lang}bucc2018XTREME, multi-lang
CommitmentBankcbcbSuperGLUE
CCGccgccg
CoLAcolacolaGLUE
CommonsenseQAcommonsenseqacommonsenseqa
EP-ConstnonterminalnonterminalEdge-Probing
COPAcopacopaSuperGLUE
EP-CorefcorefcorefEdge-Probing
Cosmos QAcosmosqacosmosqa
EP-UDdepdepEdge-Probing
EP-DPRdprdprEdge-Probing
Fever NLIfever_nlifever_nli
GLUE Diagnosticglue_diagnosticsglue_diagnosticsGLUE
HellaSwaghellaswaghellaswag
MCScript2.0mcscriptmcscriptdata
MCTACOmctacomctaco
MCTestmctest160 or mctest500mctest160 or mctest600data
MLM**mlm_simpleSee task-specific notes.
MLQAmlqa_{lang1}_{lang2}mlqaXTREME, multi-lang
MNLImnlimnliGLUE, MNLI-matched
MNLI-mismatchedmnli_mismatchedmnli_mismatchedGLUE
MRPCmrpcmrpcGLUE
MultiRCmultircmultircSuperGLUE
Mutual (standard version)mutualmutualsite
Mutual ("challenge" version)mutual_plusmutual_plussite
Natural Questionsmrqa_natural_questionsmrqa_natural_questionsMRQA version of task
NewsQAnewsqanewsqa
PIQApiqapiqaPIQA
QAMRqamrqamr
QA-SRLqasrlqasrl
QuAILquailquailsite
Quorefquorefquoref
EP-NERnernerEdge-Probing
PAWS-Xpawsx_{lang}pawsxXTREME, multi-lang
WikiAnnpanx_{lang}panxXTREME, multi-lang
EP-POSposposEdge-Probing
QNLIqnliqnliGLUE
QQPqqpqqpGLUE
ROPESropesropes
RACEraceracerace, race_middle, race_high
ReCordrecordrecordSuperGLUE
RTErterteGLUE, SuperGLUE
SciTailscitailscitail
SentEval: Bigram Shiftsenteval_bigram_shiftsenteval_bigram_shiftSentEval
SentEval: Coord Inversionsenteval_coordination_inversionsenteval_coordination_inversionSentEval
SentEval: Obj numbersenteval_obj_numbersenteval_obj_numberSentEval
SentEval: Odd Man Outsenteval_odd_man_outsenteval_odd_man_outSentEval
SentEval: Past-Presentsenteval_past_presentsenteval_past_presentSentEval
SentEval: Sentence Lengthsenteval_sentence_lengthsenteval_sentence_lengthSentEval
SentEval: Subj Numbersenteval_subj_numbersenteval_subj_numberSentEval
SentEval: Top Constituentssenteval_top_constituentssenteval_top_constituentsSentEval
SentEval: Tree Depthsenteval_tree_depthsenteval_tree_depthSentEval
SentEval: Word Contentsenteval_word_contentsenteval_word_contentSentEval
EP-RelsemevalsemevalEdge-Probing
SNLIsnlisnli
SocialIQAsocialiqasocialiqa
EP-SPR1spr1spr1Edge-Probing
EP-SPR2spr2spr2Edge-Probing
SQuAD 1.1squad_v1squad
SQuAD 2.0squad_v2squad
EP-SRLsrlsrlEdge-Probing
SST-2sstsstGLUE
STS-BstsbstsbGLUE
SuperGLUE Broad Coverage Diagnosticsuperglue_axbsuperglue_axbSuperGLUE
SWAGswagswag
Tatoebatatoeba_{lang}tatoebaXTREME, multi-lang
TyDiQAtydiqa_{lang}tydiqaXTREME, multi-lang
UDPOSudpos_{lang}udposXTREME, multi-lang
WiCwicwicSuperGLUE
Winograndewinograndewinogrande
WNLIwnliwnliGLUE
WSCwscwscSuperGLUE
XNLIxnli_{lang}xnliXTREME, multi-lang
XQuADxquad_{lang}xquadXTREME, multi-lang
  • task_name: Name-by-convention, used by downloader, and used in JiantModel to map from task names to task-models. You can change this as long as your settings are internally consistent.
  • jiant: Whether it's supported in jiant (i.e. you can train/eval on it)
  • Downloader: Whether you can download using the downloader.
  • jiant_task_name: Used to determine the programmatic behavior for the task (how to tokenize, what kind of task-model is compatible). Is tied directly to the code. See: jiant.tasks.retrieval.