SOTAVerified

Coreference Resolution

Papers

Showing 151200 of 880 papers

TitleStatusHype
Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models0
ÚFAL CorPipe at CRAC 2023: Larger Context Improves Multilingual Coreference ResolutionCode0
CHAMP: Efficient Annotation and Consolidation of Cluster HierarchiesCode0
Investigating Multilingual Coreference Resolution by Universal AnnotationsCode0
CorefPrompt: Prompt-based Event Coreference Resolution by Measuring Event Type and Argument CompatibilitiesCode0
Towards Harmful Erotic Content Detection through Coreference-Driven Contextual Analysis0
Semi-supervised multimodal coreference resolution in image narrationsCode0
Filling in the Gaps: Efficient Event Coreference Resolution using Graph Autoencoder Networks0
CAW-coref: Conjunction-Aware Word-level Coreference ResolutionCode0
A Survey of Document-Level Information Extraction0
Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better GeneralizationCode0
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
RGAT: A Deeper Look into Syntactic Dependency Information for Coreference Resolution0
Gender-specific Machine Translation with Large Language Models0
Generalised Winograd Schema and its Contextuality0
PronounFlow: A Hybrid Approach for Calibrating Pronouns in Sentences0
DialogRE^C+: An Extension of DialogRE to Investigate How Much Coreference Helps Relation Extraction in Dialogs0
Athena 2.0: Discourse and User Modeling in Open Domain Dialogue0
Similarity-based Memory Enhanced Joint Entity and Relation ExtractionCode0
Better Handling Coreference Resolution in Aspect Level Sentiment Classification by Fine-Tuning Language Models0
SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation0
Improving Automatic Quotation Attribution in Literary Novels0
How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?Code0
GENTLE: A Genre-Diverse Multilayer Challenge Set for English NLP and Linguistic EvaluationCode0
Light Coreference Resolution for Russian with Hierarchical Discourse FeaturesCode0
Examining risks of racial biases in NLP tools for child protective services0
Parallel Data Helps Neural Entity Coreference Resolution0
Sentence-Incremental Neural Coreference ResolutionCode0
COMET-M: Reasoning about Multiple Events in Complex SentencesCode0
Comparing Humans and Models on a Similar Scale: Towards Cognitive Gender Bias Evaluation in Coreference ResolutionCode0
Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective0
Are Large Language Models Robust Coreference Resolvers?Code0
PaLM 2 Technical ReportCode0
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance0
Entity-Level Sentiment Analysis (ELSA): An exploratory task surveyCode0
BenCoref: A Multi-Domain Dataset of Nominal Phrases and Pronominal Reference AnnotationsCode0
Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling PerspectiveCode0
Variational Quantum Classifiers for Natural-Language Text0
What happens before and after: Multi-Event Commonsense in Event Coreference ResolutionCode0
Evaluating and Improving the Coreference Capabilities of Machine Translation Models0
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns0
AmbiCoref: Evaluating Human and Model Sensitivity to Ambiguous CoreferenceCode0
SMDDH: Singleton Mention detection using Deep Learning in Hindi Text0
Ensemble Transfer Learning for Multilingual Coreference Resolution0
Hybrid Rule-Neural Coreference Resolution System based on Actor-Critic Learning0
Neural Coreference Resolution based on Reinforcement Learning0
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding SystemsCode0
Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study0
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE0
Show:102550
← PrevPage 4 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified