SOTAVerified

Language Modeling

Papers

Showing 101150 of 14182 papers

TitleStatusHype
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language ModelingCode5
Efficient Streaming Language Models with Attention SinksCode5
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to PosttrainingCode5
Fast Inference from Transformers via Speculative DecodingCode5
MarS: a Financial Market Simulation Engine Powered by Generative Foundation ModelCode5
Trajectory Prediction Meets Large Language Models: A SurveyCode5
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingCode5
Weakly Supervised Detection of Hallucinations in LLM ActivationsCode5
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining ResearchCode5
StarVector: Generating Scalable Vector Graphics Code from Images and TextCode5
DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZCode5
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language ModelingCode5
SpeechAlign: Aligning Speech Generation to Human PreferencesCode5
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music GenerationCode5
Large Language Model based Multi-Agents: A Survey of Progress and ChallengesCode5
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal AttentionCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
The Rise and Potential of Large Language Model Based Agents: A SurveyCode5
LAB: Large-Scale Alignment for ChatBotsCode5
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-ExpertsCode5
Rethinking LLM Language Adaptation: A Case Study on Chinese MixtralCode5
InstructPix2Pix: Learning to Follow Image Editing InstructionsCode5
KBLaM: Knowledge Base augmented Language ModelCode5
Repetition Improves Language Model EmbeddingsCode5
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge AdaptationCode5
Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec modelsCode5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and BeyondCode5
CodeGen2: Lessons for Training LLMs on Programming and Natural LanguagesCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
Assessing Language Model Deployment with Risk CardsCode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
CogAgent: A Visual Language Model for GUI AgentsCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language ModelCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
Improving Text-To-Audio Models with Synthetic CaptionsCode5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsCode5
Randomized Autoregressive Visual GenerationCode5
Show-o2: Improved Native Unified Multimodal ModelsCode5
N-Grammer: Augmenting Transformers with latent n-gramsCode4
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Code4
Galactica: A Large Language Model for ScienceCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Flamingo: a Visual Language Model for Few-Shot LearningCode4
Show:102550
← PrevPage 3 of 284Next →

No leaderboard results yet.