SOTAVerified

Sentence

Papers

Showing 14511500 of 10752 papers

TitleStatusHype
Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features0
Advancing STT for Low-Resource Real-World Speech0
Factors affecting the in-context learning abilities of LLMs for dialogue state tracking0
Multimodal Representation Alignment for Cross-modal Information Retrieval0
Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU0
Neural Responses to Affective Sentences Reveal Signatures of Depression0
ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT0
Static Word Embeddings for Sentence Semantic Representation0
A MISMATCHED Benchmark for Scientific Natural Language InferenceCode0
SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?0
Mechanistic Decomposition of Sentence Representations0
Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark0
A Statistical Physics of Language Model Reasoning0
IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator0
The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative WritingCode0
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Efficient Text Encoders for Labor Market Analysis0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering0
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal TransportCode0
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs0
Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings0
NegVQA: Can Vision Language Models Understand Negation?0
StressTest: Can YOUR Speech LM Handle the Stress?0
Towards Robust Assessment of Pathological Voices via Combined Low-Level Descriptors and Foundation Model Representations0
Causal Distillation: Transferring Structured Explanations from Large to Compact Language Models0
Token-level Accept or Reject: A Micro Alignment Approach for Large Language ModelsCode0
Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling0
Inconsistent Tokenizations Cause Language Models to be Perplexed by Japanese Grammar0
MVP: Multi-source Voice Pathology detectionCode0
Analyzing Political Bias in LLMs via Target-Oriented Sentiment Classification0
SCIRGC: Multi-Granularity Citation Recommendation and Citation Sentence Preference Alignment0
Research on feature fusion and multimodal patent text based on graph attention network0
Dependency Parsing is More Parameter-Efficient with Normalization0
Model Enumeration of Two-Variable Logic with Quadratic Delay ComplexityCode0
AmpleHate: Amplifying the Attention for Versatile Implicit Hate DetectionCode0
Rethinking the Understanding Ability across LLMs through Mutual Information0
Learning to Explain: Prototype-Based Surrogate Models for LLM Classification0
Unveiling Dual Quality in Product Reviews: An NLP-Based Approach0
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection0
ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning0
Building a Functional Machine Translation Corpus for Kpelle0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
PD^3: A Project Duplication Detection Framework via Adapted Multi-Agent Debate0
Multi-Scale Probabilistic Generation Theory: A Hierarchical Framework for Interpreting Large Language Models0
Memorization or Reasoning? Exploring the Idiom Understanding of LLMs0
A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLPCode0
SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning0
LLMs Are Not Scorers: Rethinking MT Evaluation with Generation-Based MethodsCode0
Show:102550
← PrevPage 30 of 216Next →

No leaderboard results yet.