SOTAVerified

Coreference Resolution

Papers

Showing 151200 of 880 papers

TitleStatusHype
How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?Code0
Improving Multi-turn Dialogue Modelling with Utterance ReWriterCode0
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic EvaluationCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
Graph Refinement for Coreference ResolutionCode0
Character Identification on Multiparty Conversation: Identifying Mentions of Characters in TV ShowsCode0
CHAMP: Efficient Annotation and Consolidation of Cluster HierarchiesCode0
CEREC: A Corpus for Entity Resolution in Email ConversationsCode0
A Brief Survey and Comparative Study of Recent Development of Pronoun Coreference ResolutionCode0
Improving Span Representation for Domain-adapted Coreference ResolutionCode0
Findings of the Third Shared Task on Multilingual Coreference ResolutionCode0
Fill the GAP: Exploiting BERT for Pronoun ResolutionCode0
An Empirical Study of Chinese Name Matching and ApplicationsCode0
Exploring Pre-Trained Transformers and Bilingual Transfer Learning for Arabic Coreference ResolutionCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
Exploring Span Representations in Neural Coreference ResolutionCode0
ezCoref: Towards Unifying Annotation Guidelines for Coreference ResolutionCode0
Focus on what matters: Applying Discourse Coherence Theory to Cross Document CoreferenceCode0
Exophoric Pronoun Resolution in Dialogues with Topic RegularizationCode0
Event Coreference Resolution for Contentious Politics EventsCode0
Expletives in Universal Dependency TreebanksCode0
Evaluating Coreference Resolvers on Community-based Question Answering: From Rule-based to State of the ArtCode0
Findings of the Shared Task on Multilingual Coreference ResolutionCode0
CAW-coref: Conjunction-Aware Word-level Coreference ResolutionCode0
Error-Driven Analysis of Challenges in Coreference ResolutionCode0
Event Coreference Data (Almost) for Free: Mining Hyperlinks from Online NewsCode0
Exploring Multi-Modal Representations for Ambiguity Detection & Coreference Resolution in the SIMMC 2.0 ChallengeCode0
From Text to Lexicon: Bridging the Gap between Word Embeddings and Lexical ResourcesCode0
Enhancing Cross-Document Event Coreference Resolution by Discourse Structure and Semantic InformationCode0
Gender Bias in Neural Natural Language ProcessingCode0
Gendered Pronoun Resolution using BERT and an extractive question answering formulationCode0
GENE: Global Event Network EmbeddingCode0
BiPaR: A Bilingual Parallel Dataset for Multilingual and Cross-lingual Reading Comprehension on NovelsCode0
ENPAR:Enhancing Entity and Entity Pair Representations for Joint Entity Relation ExtractionCode0
End-to-end Neural Coreference ResolutionCode0
He Said, She Said: Style Transfer for Shifting the Perspective of DialoguesCode0
Entity-Level Sentiment Analysis (ELSA): An exploratory task surveyCode0
Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUDCode0
Free the Plural: Unrestricted Split-Antecedent Anaphora ResolutionCode0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Incorporating Centering Theory into Neural Coreference ResolutionCode0
Dynamic Entity Representations in Neural Language ModelsCode0
EasyECR: A Library for Easy Implementation and Evaluation of Event Coreference Resolution ModelsCode0
WikiCREM: A Large Unsupervised Corpus for Coreference ResolutionCode0
A Hybrid Neural Network Model for Commonsense ReasoningCode0
Investigating Multilingual Coreference Resolution by Universal AnnotationsCode0
COMET-M: Reasoning about Multiple Events in Complex SentencesCode0
Jigg: A Framework for an Easy Natural Language Processing PipelineCode0
Does referent predictability affect the choice of referential form? A computational approach using masked coreference resolutionCode0
Dialogue Meaning Representation for Task-Oriented Dialogue SystemsCode0
Show:102550
← PrevPage 4 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified