SOTAVerified

Coreference Resolution

Papers

Showing 301325 of 880 papers

TitleStatusHype
Descending-Path Convolution Kernel for Syntactic Structures0
Different Flavors of GUM: Evaluating Genre and Sentence Type Effects on Multilayer Corpus Annotation Quality0
BART goes multilingual: The UniTN / Essex submission to the CoNLL-2012 Shared Task0
Disambiguating Entities Referred by Web Endpoints using Tree Ensembles0
Dependency parsing representation effects on the accuracy of semantic applications --- an example of an inflective language0
Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event0
Better Call the Plumber: Orchestrating Dynamic Information Extraction Pipelines0
Discovering Implicit Discourse Relations Through Brown Cluster Pair Representation and Coreference Patterns0
Distributional Semantics for Resolving Bridging Mentions0
DIT: Summarisation and Semantic Expansion in Evaluating Semantic Similarity0
Bacteria Biotope Detection, Ontology-based Normalization, and Relation Extraction using Syntactic Rules0
Docforia: A Multilayer Document Model0
Beyond Plain Spatial Knowledge: Determining Where Entities Are and Are Not Located, and For How Long0
Domain Adaptation for Coreference Resolution: An Adaptive Ensemble Approach0
Domain Adaptation of Coreference Resolution for Radiology Reports0
Domain Adaptation with Active Learning for Coreference Resolution0
Domain-Specific Coreference Resolution with Lexicalized Features0
Domain-specific vs. Uniform Modeling for Coreference Resolution0
Analysis of Coreference Relations in the Biomedical Literature0
DramaCoref: A Hybrid Coreference Resolution System for German Theater Plays0
DR.GAP: Mitigating Bias in Large Language Models using Gender-Aware Prompting with Demonstration and Reasoning0
Deep Reinforcement Learning for NLP0
Anatomy of OntoGUM—Adapting GUM to the OntoNotes Scheme to Evaluate Robustness of SOTA Coreference Algorithms0
Dynamic Knowledge-Base Alignment for Coreference Resolution0
Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema0
Show:102550
← PrevPage 13 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified