SOTAVerified

Coreference Resolution

Papers

Showing 801850 of 880 papers

TitleStatusHype
COMET-M: Reasoning about Multiple Events in Complex SentencesCode0
A Unified Approach to Entity-Centric Context Tracking in Social ConversationsCode0
A Tidy Data Model for Natural Language Processing using cleanNLPCode0
How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAGCode0
Improving Span Representation for Domain-adapted Coreference ResolutionCode0
Incorporating Centering Theory into Neural Coreference ResolutionCode0
Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic StructuresCode0
Incorporating Context and External Knowledge for Pronoun Coreference ResolutionCode0
Variation in Coreference Strategies across Genres and Production MediaCode0
OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More GenresCode0
Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better GeneralizationCode0
OpenEL: An Annotated Corpus for Entity Linking and Discourse in Open Domain DialogueCode0
Dialogue Meaning Representation for Task-Oriented Dialogue SystemsCode0
Deep Reinforcement Learning for Mention-Ranking Coreference ModelsCode0
SDNet: Contextualized Attention-based Deep Network for Conversational Question AnsweringCode0
AmbiCoref: Evaluating Human and Model Sensitivity to Ambiguous CoreferenceCode0
A Causal Inference Method for Reducing Gender Bias in Word Embedding RelationsCode0
Collecting Visually-Grounded Dialogue with A Game Of SortsCode0
Second Order WinoBias (SoWinoBias) Test Set for Latent Gender Bias Detection in Coreference ResolutionCode0
Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling PerspectiveCode0
Investigating Multilingual Coreference Resolution by Universal AnnotationsCode0
Paraphrasing vs Coreferring: Two Sides of the Same CoinCode0
ParCorFull2.0: a Parallel Corpus Annotated with Full CoreferenceCode0
Adapting Coreference Resolution Models through Active LearningCode0
PARMA: A Predicate Argument AlignerCode0
WikiCREM: A Large Unsupervised Corpus for Coreference ResolutionCode0
Triad-based Neural Network for Coreference ResolutionCode0
The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring DatasetsCode0
J2N -- Nominal Adjective Identification and its ApplicationCode0
Jigg: A Framework for an Easy Natural Language Processing PipelineCode0
PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in EnglishCode0
Joint Coreference Resolution and Character Linking for Multiparty ConversationCode0
PDFAnno: a Web-based Linguistic Annotation Tool for PDF DocumentsCode0
Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence PoolingCode0
The Knowref Coreference Corpus: Removing Gender and Number Cues for Difficult Pronominal Anaphora ResolutionCode0
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual DialogCode0
PeTra: A Sparsely Supervised Memory Model for People TrackingCode0
Visual Coreference Resolution in Visual Dialog using Neural Module NetworksCode0
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding SystemsCode0
Twitter at the Grammys: A Social Media Corpus for Entity Linking and DisambiguationCode0
Placing (Historical) Facts on a Timeline: A Classification cum Coref Resolution ApproachCode0
Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary DetectionCode0
Joint Semantic Analysis with Document-Level Cross-Task Coherence RewardsCode0
Knowledge-aware Pronoun Coreference ResolutionCode0
Data-driven Coreference-based Ontology BuildingCode0
Semi-supervised multimodal coreference resolution in image narrationsCode0
The MultiBERTs: BERT Reproductions for Robustness AnalysisCode0
Predicate Argument Alignment using a Global Coherence ModelCode0
KoCoNovel: Annotated Dataset of Character Coreference in Korean NovelsCode0
Sentence-Incremental Neural Coreference ResolutionCode0
Show:102550
← PrevPage 17 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaLM 540B (fine-tuned)Accuracy100Unverified
2Vega v2 6B (KD-based prompt transfer)Accuracy98.6Unverified
3UL2 20B (fine-tuned)Accuracy98.1Unverified
4Turing NLR v5 XXL 5.4B (fine-tuned)Accuracy97.3Unverified
5ST-MoE-32B 269B (fine-tuned)Accuracy96.6Unverified
6DeBERTa-1.5BAccuracy95.9Unverified
7T5-XXL 11B (fine-tuned)Accuracy93.8Unverified
8ST-MoE-L 4.1B (fine-tuned)Accuracy93.3Unverified
9RoBERTa-WinoGrande 355MAccuracy90.1Unverified
10Flan-T5 XXL (zero -shot)Accuracy89.82Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF183.6Unverified
2seq2seqF183.3Unverified
3ASP+T0-3BF182.3Unverified
4caw-coref + RoBERTaF181.6Unverified
5LingMessF181.4Unverified
6wl-coref + RoBERTaF181Unverified
7U-MEM + LongformerF180.9Unverified
8longdoc S (OntoNotes + 60k pseudo-singletons)F180.6Unverified
9G2GT SpanBERT-large reducedF180.5Unverified
10G2GT SpanBERT-large overlapF180.2Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesAvg F183.6Unverified
2seq2seqAvg F183.3Unverified
3CorefQA + SpanBERT-largeAvg F183.1Unverified
4ASP+T0-3BAvg F182.3Unverified
5wl-coref + RoBERTaAvg F181Unverified
6s2e + Longformer-LargeAvg F180.3Unverified
7SpanBERT + Cluster MergingAvg F180.2Unverified
8c2f + SpanBERT-LargeAvg F180.2Unverified
9CorefQA + SpanBERT-baseAvg F179.9Unverified
10U-MEM* + SpanBERT-largeAvg F179.6Unverified
#ModelMetricClaimedVerifiedStatus
1Coref-MTLOverall F192.72Unverified
2ProBERTOverall F192.5Unverified
3Maverick_incrOverall F191.2Unverified
4Full EnsembleOverall F190.2Unverified
5PeTraF185.3Unverified
#ModelMetricClaimedVerifiedStatus
1REXELAvg. F195.12Unverified
2JointAvg. F191.6Unverified
3KB-bothAvg. F191.5Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_mesF166.8Unverified
2longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)F162.5Unverified
3longdoc S (OntoNotes + PreCo + LitBank)F160.3Unverified
#ModelMetricClaimedVerifiedStatus
1DeepStruct multi-task w/ finetuneAverage F173.1Unverified
2DeepStruct multi-taskAverage F160.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrAvg F178.3Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F178.2Unverified
#ModelMetricClaimedVerifiedStatus
1MTL-corefAvg F168.2Unverified
2SpanBERTAvg F164.6Unverified
#ModelMetricClaimedVerifiedStatus
1Maverick_incrF188Unverified
2longdoc S (OntoNotes + PreCo + LitBank)F187.6Unverified
#ModelMetricClaimedVerifiedStatus
1BFCR + SpanBERT + Transfer LearningCoNLL F161.4Unverified
2BFCR + SpanBERTCoNLL F150.4Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy81.29Unverified
2BLOOMZAccuracy69.08Unverified
#ModelMetricClaimedVerifiedStatus
1mT0-13BAccuracy78.31Unverified
2BLOOMZAccuracy68.67Unverified
#ModelMetricClaimedVerifiedStatus
1longdoc S (OntoNotes + PreCo + LitBank)F142.9Unverified
#ModelMetricClaimedVerifiedStatus
1dali-full-anaphoraAvg F177.9Unverified