SOTAVerified

Language Modeling

Papers

Showing 16761700 of 14182 papers

TitleStatusHype
Auditing Prompt Caching in Language Model APIsCode0
Implicit Language Models are RNNs: Balancing Parallelization and ExpressivityCode1
AppVLM: A Lightweight Vision Language Model for Online App Control0
Steel-LLM:From Scratch to Open Source -- A Personal Journey in Building a Chinese-Centric LLMCode4
K-ON: Stacking Knowledge On the Head Layer of Large Language Model0
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
Recent Advances in Discrete Speech Tokens: A Review0
Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation0
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation LearningCode1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoECode1
Rationalization Models for Text-to-SQL0
μnit Scaling: Simple and Scalable FP8 LLM Training0
HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language ModelsCode0
Investigating Compositional Reasoning in Time Series Foundation Models0
Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform0
Effective Black-Box Multi-Faceted Attacks Breach Vision Large Language Model Guardrails0
Enabling Autoregressive Models to Fill In Masked Tokens0
Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education0
Certifying Language Model Robustness with Fuzzed Randomized Smoothing: An Efficient Defense Against Backdoor Attacks0
ScaffoldGPT: A Scaffold-based GPT Model for Drug Optimization0
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot ControlCode1
RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care0
UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and UnderstandingCode1
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech SystemCode11
Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging0
Show:102550
← PrevPage 68 of 568Next →

No leaderboard results yet.