SOTAVerified

Sentence

Papers

Showing 150 of 10752 papers

TitleStatusHype
From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment0
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts0
Mitigating Object Hallucinations via Sentence-Level Early InterventionCode1
AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News ArticlesCode0
Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in ConversationCode0
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis0
FinAI-BERT: A Transformer-Based Model for Sentence-Level Detection of AI Disclosures in Financial ReportsCode0
DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning0
skLEP: A Slovak General Language Understanding BenchmarkCode0
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval0
Predicting Readiness to Engage in Psychotherapy of People with Chronic Pain Based on their Pain-Related Narratives Saar0
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation0
Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization0
Dense Video Captioning using Graph-based Sentence Summarization0
Semantic similarity estimation for domain specific data using BERT and other techniques0
Thought Anchors: Which LLM Reasoning Steps Matter?Code2
End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data0
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction0
Thunder-NUBench: A Benchmark for LLMs' Sentence-Level Negation Understanding0
Towards Fairness Assessment of Dutch Hate Speech Detection0
A Hybrid Multi-Agent Prompting Approach for Simplifying Complex Sentences0
StepProof: Step-by-step verification of natural language mathematical proofsCode0
NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI TutorsCode0
Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?Code0
Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering0
Factors affecting the in-context learning abilities of LLMs for dialogue state tracking0
Multimodal Representation Alignment for Cross-modal Information Retrieval0
Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features0
Advancing STT for Low-Resource Real-World Speech0
Swiss Parliaments Corpus Re-Imagined (SPC_R): Enhanced Transcription with RAG-based Correction and Predicted BLEU0
Neural Responses to Affective Sentences Reveal Signatures of Depression0
ConECT Dataset: Overcoming Data Scarcity in Context-Aware E-Commerce MT0
Static Word Embeddings for Sentence Semantic Representation0
SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?0
A MISMATCHED Benchmark for Scientific Natural Language InferenceCode0
Mechanistic Decomposition of Sentence Representations0
A Statistical Physics of Language Model Reasoning0
Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark0
TokAlign: Efficient Vocabulary Adaptation via Token AlignmentCode1
IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator0
The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative WritingCode0
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Efficient Text Encoders for Labor Market Analysis0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering0
Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding PerspectiveCode1
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs0
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal TransportCode0
Show:102550
← PrevPage 1 of 216Next →

No leaderboard results yet.