SOTAVerified

Coreference Resolution

Papers

Showing 2650 of 880 papers

TitleStatusHype
A Case Study for Compliance as Code with Graphs and Language Models: Public release of the Regulatory Knowledge GraphCode1
ELIT: Emory Language and Information ToolkitCode1
2∗n is better than n^2: Decomposing Event Coreference Resolution into Two Tractable ProblemsCode1
A Cluster Ranking Model for Full Anaphora ResolutionCode1
Active Learning for Coreference Resolution using Discrete AnnotationCode1
DocAMR: Multi-Sentence AMR Representation and EvaluationCode1
2 * n is better than n^2: Decomposing Event Coreference Resolution into Two Tractable ProblemsCode1
DWIE: an entity-centric dataset for multi-task document-level information extractionCode1
End-to-End AMR Coreference ResolutionCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
A Context-Dependent Gated Module for Incorporating Symbolic Semantics into Event Coreference ResolutionCode1
A Structured Span SelectorCode1
Cross-document Event Coreference Search: Task, Dataset and ModelingCode1
CDLM: Cross-Document Language ModelingCode1
Attention Is (not) All You Need for Commonsense ReasoningCode1
Generalizing Cross-Document Event Coreference Resolution Across Multiple CorporaCode1
A sequence-to-sequence approach for document-level relation extractionCode1
Cross-document Misinformation Detection based on Event Graph ReasoningCode1
Deep contextualized word representationsCode1
CoRefi: A Crowd Sourcing Suite for Coreference AnnotationCode1
A Surprisingly Robust Trick for Winograd Schema ChallengeCode1
An Annotated Dataset of Coreference in English LiteratureCode1
BERT-based Cohesion Analysis of Japanese TextsCode1
A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch LiteratureCode1
CorefQA: Coreference Resolution as Query-based Span PredictionCode1
Show:102550
← PrevPage 2 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7c2f + SpanBERT-LargeAvg F180.2Unverified
8SpanBERT + Cluster MergingAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified