SOTAVerified

Language Modeling

Papers

Showing 34013450 of 14182 papers

TitleStatusHype
MLorc: Momentum Low-rank Compression for Large Language Model Adaptation0
Hybrid AI for Responsive Multi-Turn Online Conversations with Novel Dynamic Routing and Feedback Adaptation0
Parameter Efficient Fine Tuning Llama 3.1 for Answering Arabic Legal Questions: A Case Study on Jordanian LawsCode0
Why Gradients Rapidly Increase Near the End of Training0
Self-Challenging Language Model Agents0
HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset0
NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
A Large Language Model-Supported Threat Modeling Framework for Transportation Cyber-Physical Systems0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer0
Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model TranslationsCode0
Goal-Aware Identification and Rectification of Misinformation in Multi-Agent SystemsCode0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
Drop Dropout on Single-Epoch Language Model PretrainingCode0
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization0
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Probing the Robustness Properties of Neural Speech CodecsCode0
Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings0
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering0
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction0
How much do language models memorize?0
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking0
Interpreting Large Text-to-Image Diffusion Models with Dictionary LearningCode0
GradPower: Powering Gradients for Faster Language Model Pre-Training0
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model EvaluationCode0
From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning0
TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data SynthesisCode0
Transformers Are Universally Consistent0
Intuitionistic Fuzzy Sets for Large Language Model Data Annotation: A Novel Approach to Side-by-Side Preference Labeling0
Circuit Stability Characterizes Language Model GeneralizationCode0
HardTests: Synthesizing High-Quality Test Cases for LLM Coding0
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and FinetuningCode0
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation0
MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform0
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreementCode0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference0
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model CompressionCode0
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners0
An Empirical Study of Federated Prompt Learning for Vision Language Model0
Discriminative Policy Optimization for Token-Level Reward ModelsCode0
Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model0
VLM-RRT: Vision Language Model Guided RRT Search for Autonomous UAV Navigation0
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data0
Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation0
Spoken Language Modeling with Duration-Penalized Self-Supervised UnitsCode0
Learning Parametric Distributions from Samples and PreferencesCode0
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training0
Show:102550
← PrevPage 69 of 284Next →

No leaderboard results yet.