SOTAVerified

Coreference Resolution

Papers

Showing 150 of 880 papers

TitleStatusHype
CORE-KG: An LLM-Driven Knowledge Graph Construction Framework for Human Smuggling Networks0
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic StructuresCode0
Multimodal Coreference Resolution for Chinese Social Media Dialogues: Dataset and Benchmark Approach0
Long-context Non-factoid Question Answering in Indic LanguagesCode0
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
Cross-Document Contextual Coreference Resolution in Knowledge Graphs0
A Rule Based Solution to Co-reference Resolution in Clinical Text0
LegalCore: A Dataset for Legal Documents Event Coreference ResolutionCode0
DR.GAP: Mitigating Bias in Large Language Models using Gender-Aware Prompting with Demonstration and Reasoning0
Reverse Probing: Evaluating Knowledge Transfer via Finetuned Task Embeddings for Coreference Resolution0
The Role of Natural Language Processing Tasks in Automatic Literary Character Network ConstructionCode0
IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark for LLMsCode0
Data-driven Coreference-based Ontology BuildingCode0
Findings of the Third Shared Task on Multilingual Coreference ResolutionCode0
Solving the Challenge Set without Solving the Task: On Winograd Schemas as a Test of Pronominal Coreference ResolutionCode0
How Language Models Prioritize Contextual Grammatical Cues?Code0
CorPipe at CRAC 2024: Predicting Zero Mentions from Raw TextCode0
Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding0
J2N -- Nominal Adjective Identification and its ApplicationCode0
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical CaseCode0
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUDCode0
Maverick: Efficient and Accurate Coreference Resolution Defying Recent TrendsCode2
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language ModelsCode0
Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic InformationCode0
Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction ParadigmCode0
SEAM: A Stochastic Benchmark for Multi-Document Tasks0
Contrastive Entity Coreference and Disambiguation for Historical Texts0
Major Entity Identification: A Generalizable Alternative to Coreference ResolutionCode0
EasyECR: A Library for Easy Implementation and Evaluation of Event Coreference Resolution ModelsCode0
Ranking LLMs by compression0
ThaiCoref: Thai Coreference Resolution DatasetCode0
Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language ModelsCode1
Persian Pronoun Resolution: Leveraging Neural Networks and Language Models0
Labeling supervised fine-tuning data with the scaling lawCode7
Transforming Dutch: Debiasing Dutch Coreference Resolution Systems for Non-binary PronounsCode0
Asking and Answering Questions to Extract Event-Argument StructuresCode0
REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity LinkingCode1
Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality EnsemblesCode0
Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge DistillationCode0
A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference ResolutionCode0
KoCoNovel: Annotated Dataset of Character Coreference in Korean NovelsCode0
A Controlled Reevaluation of Coreference Resolution ModelsCode0
Linear Cross-document Event Coreference Resolution with X-AMRCode0
SPLICE: A Singleton-Enhanced PipeLIne for Coreference REsolutionCode0
Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations0
Multilingual Coreference Resolution in Low-resource South Asian LanguagesCode0
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries0
GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres0
From Dialogue to Diagram: Task and Relationship Extraction from Natural Language for Accelerated Business Process Prototyping0
Towards Transparency in Coreference Resolution: A Quantum-Inspired ApproachCode0
Show:102550
← PrevPage 1 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7c2f + SpanBERT-LargeAvg F180.2Unverified
8SpanBERT + Cluster MergingAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified