SOTAVerified

Sentence

Papers

Showing 15511600 of 10752 papers

TitleStatusHype
HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection0
Improving Retrieval-Augmented Neural Machine Translation with Monolingual Data0
The Distribution of Dependency Distance and Hierarchical Distance in Contemporary Written Japanese and Its Influencing Factors0
20min-XD: A Comparable Corpus of Swiss News ArticlesCode0
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues0
Do Large Language Models know who did what to whom?0
Information Leakage of Sentence Embeddings via Generative Embedding Inversion AttacksCode0
FinTextSim: Enhancing Financial Text Analysis with BERTopic0
Describe Anything: Detailed Localized Image and Video Captioning0
FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender BinarityCode0
Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends0
sEEG-based Encoding for Sentence Retrieval: A Contrastive Learning Approach to Brain-Language Alignment0
Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish0
Disentangling Linguistic Features with Dimension-Wise Analysis of Vector Embeddings0
A Baseline for Self-state Identification and Classification in Mental Health Data: CLPsych 2025 Task0
ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos0
Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models0
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation0
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation0
An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation0
SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data0
Dynamik: Syntactically-Driven Dynamic Font Sizing for Emphasis of Key InformationCode0
LLMs Can Achieve High-quality Simultaneous Machine Translation as Efficiently as OfflineCode0
Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question AnsweringCode0
RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News TextsCode0
Topic mining based on fine-tuning Sentence-BERT and LDA0
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationCode0
Generative AI Enhanced Financial Risk Management Information RetrievalCode0
Coarse-to-Fine Semantic Communication Systems for Text Transmission0
TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication0
Foundations and Evaluations in NLP0
SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching0
An extension of linear self-attention for in-context learning0
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery0
A Framework for Lightweight Responsible Prompting Recommendation0
MultiClaimNet: A Massively Multilingual Dataset of Fact-Checked Claim Clusters0
Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories0
Negation: A Pink Elephant in the Large Language Models' Room?0
Generating Synthetic Oracle Datasets to Analyze Noise Impact: A Study on Building Function Classification Using Tweets0
A Retrieval-Based Approach to Medical Procedure Matching in Romanian0
BeLightRec: A lightweight recommender system enhanced with BERT0
BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation0
Beyond Relevance: An Adaptive Exploration-Based Framework for Personalized Recommendations0
SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings0
Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions0
Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization0
CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement0
Leveraging Human Production-Interpretation Asymmetries to Test LLM Cognitive Plausibility0
FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs0
TROVE: A Challenge for Fine-Grained Text Provenance via Source Sentence Tracing and Relationship Classification0
Show:102550
← PrevPage 32 of 216Next →

No leaderboard results yet.