SOTAVerified

Coreference Resolution

Papers

Showing 101150 of 880 papers

TitleStatusHype
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
Longtonotes: OntoNotes with Longer Coreference ChainsCode1
Autoregressive Structured Prediction with Language ModelsCode1
F-coref: Fast, Accurate and Easy to Use Coreference ResolutionCode1
Improving Coreference Resolution by Learning Entity-Level Distributed RepresentationsCode0
AmbiCoref: Evaluating Human and Model Sensitivity to Ambiguous CoreferenceCode0
Improving Multi-turn Dialogue Modelling with Utterance ReWriterCode0
Asking and Answering Questions to Extract Event-Argument StructuresCode0
A Simple Method for Commonsense ReasoningCode0
How Language Models Prioritize Contextual Grammatical Cues?Code0
Improving Span Representation for Domain-adapted Coreference ResolutionCode0
Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction ParadigmCode0
He Said, She Said: Style Transfer for Shifting the Perspective of DialoguesCode0
Are Large Language Models Robust Coreference Resolvers?Code0
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic EvaluationCode0
A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference ResolutionCode0
A Causal Inference Method for Reducing Gender Bias in Word Embedding RelationsCode0
Graph Refinement for Coreference ResolutionCode0
How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?Code0
Incorporating Centering Theory into Neural Coreference ResolutionCode0
Gender Bias in Neural Natural Language ProcessingCode0
From Text to Lexicon: Bridging the Gap between Word Embeddings and Lexical ResourcesCode0
Free the Plural: Unrestricted Split-Antecedent Anaphora ResolutionCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
Findings of the Third Shared Task on Multilingual Coreference ResolutionCode0
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMsCode0
Findings of the Shared Task on Multilingual Coreference ResolutionCode0
Fill the GAP: Exploiting BERT for Pronoun ResolutionCode0
Focus on what matters: Applying Discourse Coherence Theory to Cross Document CoreferenceCode0
Gendered Pronoun Resolution using BERT and an extractive question answering formulationCode0
Exploring Span Representations in Neural Coreference ResolutionCode0
A Controlled Reevaluation of Coreference Resolution ModelsCode0
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUDCode0
Exploring Pre-Trained Transformers and Bilingual Transfer Learning for Arabic Coreference ResolutionCode0
Exophoric Pronoun Resolution in Dialogues with Topic RegularizationCode0
A Hybrid Neural Network Model for Commonsense ReasoningCode0
Expletives in Universal Dependency TreebanksCode0
SP-10K: A Large-scale Evaluation Set for Selectional Preference AcquisitionCode0
Event Coreference Resolution for Contentious Politics EventsCode0
Exploring Multi-Modal Representations for Ambiguity Detection & Coreference Resolution in the SIMMC 2.0 ChallengeCode0
ezCoref: Towards Unifying Annotation Guidelines for Coreference ResolutionCode0
GENE: Global Event Network EmbeddingCode0
Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic InformationCode0
ENPAR:Enhancing Entity and Entity Pair Representations for Joint Entity Relation ExtractionCode0
End-to-end Neural Coreference ResolutionCode0
Entity-Level Sentiment Analysis (ELSA): An exploratory task surveyCode0
EasyECR: A Library for Easy Implementation and Evaluation of Event Coreference Resolution ModelsCode0
Ellipsis Resolution as Question Answering: An EvaluationCode0
A Brief Survey and Comparative Study of Recent Development of Pronoun Coreference ResolutionCode0
Dynamic Entity Representations in Neural Language ModelsCode0
Show:102550
← PrevPage 3 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified