SOTAVerified

Language Modeling

Papers

Showing 34513500 of 14182 papers

TitleStatusHype
Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training0
Position: Federated Foundation Language Model Post-Training Should Focus on Open-Source Models0
Reducing Latency in LLM-Based Natural Language Commands Processing for Robot Navigation0
Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
ATLAS: Learning to Optimally Memorize the Context at Test Time0
Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model0
TrackVLA: Embodied Visual Tracking in the Wild0
SeG-SR: Integrating Semantic Knowledge into Remote Sensing Image Super-Resolution via Vision-Language ModelCode0
Actor-Critic based Online Data Mixing For Language Model Pre-Training0
SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model CompressionCode0
Large Language Model Meets Constraint Propagation0
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection0
ICH-Qwen: A Large Language Model Towards Chinese Intangible Cultural Heritage0
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL0
LLM-ODDR: A Large Language Model Framework for Joint Order Dispatching and Driver Repositioning0
Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents0
Speech as a Multimodal Digital Phenotype for Multi-Task LLM-based Mental Health Prediction0
EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles0
Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image GenerationCode0
Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment0
Conversational Alignment with Artificial Intelligence in Context0
GateNLP at SemEval-2025 Task 10: Hierarchical Three-Step Prompting for Multilingual Narrative ClassificationCode0
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems0
A Tool for Generating Exceptional Behavior Tests With Large Language ModelsCode0
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding0
A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems0
CLUE: Neural Networks Calibration via Learning Uncertainty-Error alignment0
VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models0
Improving Brain-to-Image Reconstruction via Fine-Grained Text Bridging0
Incorporating LLMs for Large-Scale Urban Complex Mobility Simulation0
CFP-Gen: Combinatorial Functional Protein Generation via Diffusion Language ModelsCode0
3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model0
StreamLink: Large-Language-Model Driven Distributed Data Engineering System0
Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short OnesCode0
Creativity in LLM-based Multi-Agent Systems: A Survey0
Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion0
HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling0
Automated Privacy Information Annotation in Large Language Model InteractionsCode0
The Multilingual Divide and Its Impact on Global AI Safety0
Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework0
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction0
PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective0
ChemHAS: Hierarchical Agent Stacking for Enhancing Chemistry Tools0
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective0
What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models0
Ankh3: Multi-Task Pretraining with Sequence Denoising and Completion Enhances Protein Representations0
Show:102550
← PrevPage 70 of 284Next →

No leaderboard results yet.