SOTAVerified

Sentence

Papers

Showing 801825 of 10752 papers

TitleStatusHype
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
Take its Essence, Discard its Dross! Debiasing for Toxic Language Detection via Counterfactual Causal EffectCode0
What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain ScoresCode0
Hybrid-Learning Video Moment Retrieval across Multi-Domain Labels0
The Power of Summary-Source AlignmentsCode0
Evaluating Distributed Representations for Multi-Level Lexical Semantics: A Research Proposal0
Diversifying Query: Region-Guided Transformer for Temporal Sentence GroundingCode0
Reward-based Input Construction for Cross-document Relation ExtractionCode1
Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and SummarizationCode1
PGA-SciRE: Harnessing LLM on Data Augmentation for Enhancing Scientific Relation Extraction0
AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization0
Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-ClassificationCode0
MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis0
ROAST: Review-level Opinion Aspect Sentiment Target Joint Detection for ABSACode0
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed CounterfactualsCode0
Contextual Position Encoding: Learning to Count What's Important0
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification0
Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective0
Faithful Chart Summarization with ChaTS-Pi0
Active Use of Latent Constituency Representation in both Humans and Large Language ModelsCode0
Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints0
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence0
The Multi-Range Theory of Translation Quality Measurement: MQM scoring models and Statistical Quality Control0
iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain TeasersCode0
Show:102550
← PrevPage 33 of 431Next →

No leaderboard results yet.