SOTAVerified

Coreference Resolution

Papers

Showing 150 of 880 papers

TitleStatusHype
Labeling supervised fine-tuning data with the scaling lawCode7
Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingCode6
Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice PerspectiveCode4
N-Grammer: Augmenting Transformers with latent n-gramsCode4
RAKG:Document-level Retrieval Augmented Knowledge Graph ConstructionCode3
Scaling Instruction-Finetuned Language ModelsCode3
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
Language Models are Few-Shot LearnersCode3
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
Attention Is All You NeedCode3
Maverick: Efficient and Accurate Coreference Resolution Defying Recent TrendsCode2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-TuningCode2
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale InstructionsCode2
Hungry Hungry Hippos: Towards Language Modeling with State Space ModelsCode2
Crosslingual Generalization through Multitask FinetuningCode2
Ask Me Anything: A simple strategy for prompting language modelsCode2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq ModelCode2
PaLM: Scaling Language Modeling with PathwaysCode2
DeBERTa: Decoding-enhanced BERT with Disentangled AttentionCode2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text TransformerCode2
Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language ModelsCode1
REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity LinkingCode1
Seq2seq is All You Need for Coreference ResolutionCode1
2∗n is better than n^2: Decomposing Event Coreference Resolution into Two Tractable ProblemsCode1
2 * n is better than n^2: Decomposing Event Coreference Resolution into Two Tractable ProblemsCode1
Radar de Parité: An NLP system to measure gender representation in French news storiesCode1
Exploring the Benefits of Training Expert Language Models over Instruction TuningCode1
A Case Study for Compliance as Code with Graphs and Language Models: Public release of the Regulatory Knowledge GraphCode1
Autoregressive Structured Prediction with Language ModelsCode1
Cross-document Event Coreference Search: Task, Dataset and ModelingCode1
Longtonotes: OntoNotes with Longer Coreference ChainsCode1
Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot LearnersCode1
GRAVL-BERT: Graphical Visual-Linguistic Representations for Multimodal Coreference ResolutionCode1
F-coref: Fast, Accurate and Easy to Use Coreference ResolutionCode1
Knowledge Extraction From Texts Based on WikidataCode1
End-to-End Chinese Speaker IdentificationCode1
Cross-document Misinformation Detection based on Event Graph ReasoningCode1
Personal Entity, Concept, and Named Entity Linking in ConversationsCode1
VD-PCR: Improving Visual Dialog with Pronoun Coreference ResolutionCode1
LingMess: Linguistically Informed Multi Expert Scorers for Coreference ResolutionCode1
DeepStruct: Pretraining of Language Models for Structure PredictionCode1
UL2: Unifying Language Learning ParadigmsCode1
A Structured Span SelectorCode1
A sequence-to-sequence approach for document-level relation extractionCode1
Incorporating Constituent Syntax for Coreference ResolutionCode1
DocAMR: Multi-Sentence AMR Representation and EvaluationCode1
A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch LiteratureCode1
On Generalization in Coreference ResolutionCode1
Word-Level Coreference ResolutionCode1
Show:102550
← PrevPage 1 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7c2f + SpanBERT-LargeAvg F180.2Unverified
8SpanBERT + Cluster MergingAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified