SOTAVerified

Coreference Resolution

Papers

Showing 251275 of 880 papers

TitleStatusHype
Adapted End-to-End Coreference Resolution System for Anaphoric Identities in Dialogues0
Towards Consistent Document-level Entity Linking: Joint Models for Entity Linking and Coreference ResolutionCode1
Annotation and Evaluation of Coreference Resolution in Screenplays0
Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution0
Joint Detection and Coreference Resolution of Entities and Events with Document-level Context Aggregation0
End-to-End AMR Coreference ResolutionCode1
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets0
Multilingual Coreference Resolution with Harmonized Annotations0
Injecting Knowledge Base Information into End-to-End Joint Entity and Relation Extraction and Coreference ResolutionCode1
End-to-end Neural Coreference Resolution Revisited: A Simple yet Effective Baseline0
The MultiBERTs: BERT Reproductions for Robustness AnalysisCode0
Realistic Evaluation Principles for Cross-document Coreference ResolutionCode1
Self-supervised Dialogue Learning for Spoken Conversational Question Answering0
OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More GenresCode0
Cross-document Coreference Resolution over Predicted MentionsCode1
Annotating anaphoric phenomena in situated dialogue0
qxoRef 1.0: A coreference corpus and mention-pair baseline for coreference resolution in Conchucos QuechuaCode0
Hierarchical Graph Convolutional Networks for Jointly Resolving Cross-document Coreference of Entity and Event Mentions0
GENE: Global Event Network EmbeddingCode0
Constrained Multi-Task Learning for Event Coreference ResolutionCode0
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking SystemCode1
Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Attention NetworkCode1
CEREC: A Corpus for Entity Resolution in Email ConversationsCode0
CREAD: Combined Resolution of Ellipses and Anaphora in DialoguesCode1
The Swedish Winogender Dataset0
Show:102550
← PrevPage 11 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified