SOTAVerified

Language Modeling

Papers

Showing 36013650 of 14182 papers

TitleStatusHype
DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning0
Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response TheoryCode0
CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment0
Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation0
Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions0
Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model0
Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning0
Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning0
Ensembling Sparse Autoencoders0
Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition0
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language ModelCode0
ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMsCode0
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model EditingCode0
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors0
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling0
Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution0
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring0
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives0
Rank-K: Test-Time Reasoning for Listwise RerankingCode0
Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency0
Exploring Graph Representations of Logical Forms for Language ModelingCode0
CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing0
FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation0
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation0
Automated Journalistic Questions: A New Method for Extracting 5W1H in French0
Too Long, Didn't Model: Decomposing LLM Long-Context Understanding With NovelsCode0
sudoLLM : On Multi-role Alignment of Language Models0
TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring0
Structured Agent Distillation for Large Language Model0
Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising0
MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow0
Improve Language Model and Brain Alignment via Associative MemoryCode0
Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications0
TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks0
A Physics-Inspired Optimizer: Velocity Regularized Adam0
SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models0
VocalAgent: Large Language Models for Vocal Health Diagnostics with Safety-Aware Evaluation0
A*-Decoding: Token-Efficient Inference Scaling0
VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection0
ReSW-VL: Representation Learning for Surgical Workflow Analysis Using Vision-Language Model0
Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping0
Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation0
Structure-Aware Corpus Construction and User-Perception-Aligned Metrics for Large-Language-Model Code Completion0
CMLFormer: A Dual Decoder Transformer with Switching Point Learning for Code-Mixed Language Modeling0
Krikri: Advancing Open Large Language Models for Greek0
SpatialLLM: From Multi-modality Data to Urban Spatial IntelligenceCode0
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice0
IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment0
Show:102550
← PrevPage 73 of 284Next →

No leaderboard results yet.