Tasks

March 27, 2021 · View on GitHub

Supported Tasks

Name	`task_name`	`jiant`	Downloader	`jiant_task_name`	Misc
Argument Reasoning Comprehension	arct	✅	✅	arct	Github
Abductive NLI	abductive_nli	✅	✅	abductive_nli
SuperGLUE Winogender Diagnostic	superglue_axg	✅	✅	superglue_axg	SuperGLUE
Acceptability Definiteness	acceptability_definiteness	✅	✅	acceptability_definiteness	Function Words
Acceptability Coord	acceptability_coord	✅	✅	acceptability_coord	Function Words
Acceptability EOS	acceptability_eos	✅	✅	acceptability_eos	Function Words
Acceptability WH Words	acceptability_whwords	✅	✅	acceptability_whwords	Function Words
Adversarial NLI	`adversarial_nli_{round}`	✅	✅	adversarial_nli	3 rounds
ARC ("easy" version)	arc_easy	✅	✅	arc_easy	site
ARC ("challenge" version)	arc_challenge	✅	✅	arc_challenge	site
BoolQ	boolq	✅	✅	boolq	SuperGLUE
BUCC2018	`bucc2018_{lang}`	✅	✅	bucc2018	XTREME, multi-lang
CommitmentBank	cb	✅	✅	cb	SuperGLUE
CCG	ccg	✅		ccg
CoLA	cola	✅	✅	cola	GLUE
CommonsenseQA	commonsenseqa	✅	✅	commonsenseqa
EP-Const	nonterminal	✅		nonterminal	Edge-Probing
COPA	copa	✅	✅	copa	SuperGLUE
EP-Coref	coref	✅		coref	Edge-Probing
Cosmos QA	cosmosqa	✅	✅	cosmosqa
EP-UD	dep	✅		dep	Edge-Probing
EP-DPR	dpr	✅		dpr	Edge-Probing
Fever NLI	fever_nli	✅	✅	fever_nli
GLUE Diagnostic	glue_diagnostics	✅	✅	glue_diagnostics	GLUE
HellaSwag	hellaswag	✅	✅	hellaswag
MCScript2.0	mcscript	✅		mcscript	data
MCTACO	mctaco	✅	✅	mctaco
MCTest	mctest160 or mctest500	✅	✅	mctest160 or mctest600	data
MLM	*	✅	*	mlm_simple	See task-specific notes.
MLQA	`mlqa_{lang1}_{lang2}`	✅	✅	mlqa	XTREME, multi-lang
MNLI	mnli	✅	✅	mnli	GLUE, MNLI-matched
MNLI-mismatched	mnli_mismatched	✅	✅	mnli_mismatched	GLUE
MRPC	mrpc	✅	✅	mrpc	GLUE
MultiRC	multirc	✅	✅	multirc	SuperGLUE
Mutual (standard version)	mutual	✅	✅	mutual	site
Mutual ("challenge" version)	mutual_plus	✅	✅	mutual_plus	site
Natural Questions	mrqa_natural_questions	✅	✅	mrqa_natural_questions	MRQA version of task
NewsQA	newsqa	✅	✅	newsqa
PIQA	piqa	✅	✅	piqa	PIQA
QAMR	qamr	✅	✅	qamr
QA-SRL	qasrl	✅	✅	qasrl
QuAIL	quail	✅	✅	quail	site
Quoref	quoref	✅	✅	quoref
EP-NER	ner	✅		ner	Edge-Probing
PAWS-X	`pawsx_{lang}`	✅	✅	pawsx	XTREME, multi-lang
WikiAnn	`panx_{lang}`	✅	✅	panx	XTREME, multi-lang
EP-POS	pos	✅		pos	Edge-Probing
QNLI	qnli	✅	✅	qnli	GLUE
QQP	qqp	✅	✅	qqp	GLUE
ROPES	ropes	✅	✅	ropes
RACE	race	✅	✅	race	`race`, `race_middle`, `race_high`
ReCord	record	✅	✅	record	SuperGLUE
RTE	rte	✅	✅	rte	GLUE, SuperGLUE
SciTail	scitail	✅	✅	scitail
SentEval: Bigram Shift	senteval_bigram_shift	✅	✅	senteval_bigram_shift	SentEval
SentEval: Coord Inversion	senteval_coordination_inversion	✅	✅	senteval_coordination_inversion	SentEval
SentEval: Obj number	senteval_obj_number	✅	✅	senteval_obj_number	SentEval
SentEval: Odd Man Out	senteval_odd_man_out	✅	✅	senteval_odd_man_out	SentEval
SentEval: Past-Present	senteval_past_present	✅	✅	senteval_past_present	SentEval
SentEval: Sentence Length	senteval_sentence_length	✅	✅	senteval_sentence_length	SentEval
SentEval: Subj Number	senteval_subj_number	✅	✅	senteval_subj_number	SentEval
SentEval: Top Constituents	senteval_top_constituents	✅	✅	senteval_top_constituents	SentEval
SentEval: Tree Depth	senteval_tree_depth	✅	✅	senteval_tree_depth	SentEval
SentEval: Word Content	senteval_word_content	✅	✅	senteval_word_content	SentEval
EP-Rel	semeval	✅		semeval	Edge-Probing
SNLI	snli	✅	✅	snli
SocialIQA	socialiqa	✅	✅	socialiqa
EP-SPR1	spr1	✅		spr1	Edge-Probing
EP-SPR2	spr2	✅		spr2	Edge-Probing
SQuAD 1.1	squad_v1	✅	✅	squad
SQuAD 2.0	squad_v2	✅	✅	squad
EP-SRL	srl	✅		srl	Edge-Probing
SST-2	sst	✅	✅	sst	GLUE
STS-B	stsb	✅	✅	stsb	GLUE
SuperGLUE Broad Coverage Diagnostic	superglue_axb	✅	✅	superglue_axb	SuperGLUE
SWAG	swag	✅	✅	swag
Tatoeba	`tatoeba_{lang}`	✅	✅	tatoeba	XTREME, multi-lang
TyDiQA	`tydiqa_{lang}`	✅	✅	tydiqa	XTREME, multi-lang
UDPOS	`udpos_{lang}`	✅	✅	udpos	XTREME, multi-lang
WiC	wic	✅	✅	wic	SuperGLUE
Winogrande	winogrande	✅	✅	winogrande
WNLI	wnli	✅	✅	wnli	GLUE
WSC	wsc	✅	✅	wsc	SuperGLUE
XNLI	`xnli_{lang}`	✅	✅	xnli	XTREME, multi-lang
XQuAD	`xquad_{lang}`	✅	✅	xquad	XTREME, multi-lang

task_name: Name-by-convention, used by downloader, and used in JiantModel to map from task names to task-models. You can change this as long as your settings are internally consistent.
jiant: Whether it's supported in jiant (i.e. you can train/eval on it)
Downloader: Whether you can download using the downloader.
jiant_task_name: Used to determine the programmatic behavior for the task (how to tokenize, what kind of task-model is compatible). Is tied directly to the code. See: jiant.tasks.retrieval.