SOTAVerified

Coreference Resolution

Papers

Showing 401450 of 880 papers

TitleStatusHype
Context-aware Attention Model for Coreference Resolution0
All Roads Lead to UD: Converting Stanford and Penn Parses to English Universal Dependencies with Multilayer Annotations0
Cross-Lingual Coreference: The Case of Bulgarian and English0
What You See is What You Get: Visual Pronoun Coreference Resolution in DialoguesCode0
Ellipsis Resolution as Question Answering: An EvaluationCode0
Partially-supervised Mention Detection0
BERT for Coreference Resolution: Baselines and AnalysisCode0
WikiCREM: A Large Unsupervised Corpus for Coreference ResolutionCode0
Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential ReasoningCode1
Improving Generalization in Coreference Resolution via Adversarial Training0
Fill the GAP: Exploiting BERT for Pronoun ResolutionCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
On GAP Coreference Resolution Shared Task: Insights from the 3rd Place Solution0
End-to-End Neural Context Reconstruction in Chinese Dialogue0
Neural Mention DetectionCode0
A Hybrid Neural Network Model for Commonsense Reasoning0
WinoGrande: An Adversarial Winograd Schema Challenge at ScaleCode1
SpanBERT: Improving Pre-training by Representing and Predicting SpansCode0
Solving Hard Coreference Problems0
Knowledge-aware Pronoun Coreference ResolutionCode0
Using Thesaurus Data to Improve Coreference Resolution for Russian0
R\'esolution des cor\'ef\'erences neuronale : une approche bas\'ee sur les t\^etes (Neural coreference resolution : a head-based approach)0
D\'etection automatique de cha\^ de cor\'ef\'erence pour le fran \'ecrit: r\`egles et ressources adapt\'ees au rep\'erage de ph\'enom\`enes linguistiques sp\'ecifiques (Automatic coreference resolution for written French : rules and resources for specific linguistic phenomena)0
End-to-end Deep Reinforcement Learning Based Coreference Resolution0
Coreference Resolution with Entity EqualizationCode1
Crowdsourcing and Aggregating Nested Markable AnnotationsCode0
Wikipedia as a Resource for Text Analysis and Retrieval0
Model-based annotation of coreferenceCode0
Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary DetectionCode0
Improving Multi-turn Dialogue Modelling with Utterance ReWriterCode0
Gendered Pronoun Resolution using BERT and an extractive question answering formulationCode0
Revisiting Joint Modeling of Cross-document Entity and Event Coreference ResolutionCode0
Resolving Gendered Ambiguous Pronouns with BERTCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
Evaluating Gender Bias in Machine TranslationCode1
Evaluation of named entity coreference0
Neural Coreference Resolution with Limited Lexical Context and Explicit Mention Detection for Oral French0
Deep Cross-Lingual Coreference Resolution for Less-Resourced Languages: The Case of Basque0
Cross-lingual Incongruences in the Annotation of Coreference0
Improving Event Coreference Resolution by Learning Argument Compatibility from Unlabeled Data0
GenderQuant: Quantifying Mention-Level Genderedness0
Attention Is (not) All You Need for Commonsense ReasoningCode1
Sentence Level Representation And Language Models In The Task Of Coreference Resolution For Russian0
Incorporating Context and External Knowledge for Pronoun Coreference ResolutionCode0
A Surprisingly Robust Trick for Winograd Schema ChallengeCode1
SP-10K: A Large-scale Evaluation Set for Selectional Preference AcquisitionCode0
SocialIQA: Commonsense Reasoning about Social InteractionsCode0
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning0
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual DialogCode0
Language Models are Unsupervised Multitask LearnersCode1
Show:102550
← PrevPage 9 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified