SOTAVerified

Coreference Resolution

Papers

Showing 801825 of 880 papers

TitleStatusHype
Deep Cross-Lingual Coreference Resolution for Less-Resourced Languages: The Case of Basque0
Deep Learning Embeddings for Discontinuous Linguistic Units0
Deep Neural Networks for Coreference Resolution for Polish0
Deep Reinforcement Learning for NLP0
Dependency parsing representation effects on the accuracy of semantic applications --- an example of an inflective language0
Descending-Path Convolution Kernel for Syntactic Structures0
Detecting Scenes in Fiction: A new Segmentation Task0
Detecting Subevent Structure for Event Coreference Resolution0
D\'etection automatique de cha\^ de cor\'ef\'erence pour le fran \'ecrit: r\`egles et ressources adapt\'ees au rep\'erage de ph\'enom\`enes linguistiques sp\'ecifiques (Automatic coreference resolution for written French : rules and resources for specific linguistic phenomena)0
D\'etection de cor\'ef\'erences de bout en bout en fran (End-to-end coreference resolution for French)0
Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules0
DialogRE^C+: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs0
Different Flavors of GUM: Evaluating Genre and Sentence Type Effects on Multilayer Corpus Annotation Quality0
Disambiguating Entities Referred by Web Endpoints using Tree Ensembles0
Discontinuous Genitives in Hindi/Urdu0
Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event0
Discovering Implicit Discourse Relations Through Brown Cluster Pair Representation and Coreference Patterns0
Distributional Semantics for Resolving Bridging Mentions0
DIT: Summarisation and Semantic Expansion in Evaluating Semantic Similarity0
Docforia: A Multilayer Document Model0
Domain Adaptation for Coreference Resolution: An Adaptive Ensemble Approach0
Domain Adaptation of Coreference Resolution for Radiology Reports0
Domain Adaptation with Active Learning for Coreference Resolution0
Domain-Specific Coreference Resolution with Lexicalized Features0
Domain-specific vs. Uniform Modeling for Coreference Resolution0
Show:102550
← PrevPage 33 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified