SOTAVerified

Coreference Resolution

Papers

Showing 176200 of 880 papers

TitleStatusHype
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUDCode0
ENPAR:Enhancing Entity and Entity Pair Representations for Joint Entity Relation ExtractionCode0
BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on NovelsCode0
Entity-Level Sentiment Analysis (ELSA): An exploratory task surveyCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic InformationCode0
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic EvaluationCode0
Error-Driven Analysis of Challenges in Coreference ResolutionCode0
Exploring Pre-Trained Transformers and Bilingual Transfer Learning for Arabic Coreference ResolutionCode0
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual DialogCode0
How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?Code0
From Text to Lexicon: Bridging the Gap between Word Embeddings and Lexical ResourcesCode0
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMsCode0
Incorporating Context and External Knowledge for Pronoun Coreference ResolutionCode0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
EasyECR: A Library for Easy Implementation and Evaluation of Event Coreference Resolution ModelsCode0
Dynamic Entity Representations in Neural Language ModelsCode0
Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolutionCode0
Investigating Multilingual Coreference Resolution by Universal AnnotationsCode0
SP-10K: A Large-scale Evaluation Set for Selectional Preference AcquisitionCode0
Joint Coreference Resolution and Character Linking for Multiparty ConversationCode0
WikiCREM: A Large Unsupervised Corpus for Coreference ResolutionCode0
Knowledge-aware Pronoun Coreference ResolutionCode0
Ellipsis Resolution as Question Answering: An EvaluationCode0
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic StructuresCode0
Show:102550
← PrevPage 8 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified