SOTAVerified

Language Modeling

Papers

Showing 51100 of 14182 papers

TitleStatusHype
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsCode7
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
Scalable MatMul-free Language ModelingCode7
Adaptive In-conversation Team Building for Language Model AgentsCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
xLSTM: Extended Long Short-Term MemoryCode7
Labeling supervised fine-tuning data with the scaling lawCode7
Chronos: Learning the Language of Time SeriesCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
VMamba: Visual State Space ModelCode7
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
AudioLM: a Language Modeling Approach to Audio GenerationCode7
SGLang: Efficient Execution of Structured Language Model ProgramsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Mistral 7BCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Qwen Technical ReportCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Simple and Controllable Music GenerationCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
A Survey of Large Language ModelsCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
A Watermark for Large Language ModelsCode6
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesCode6
GLM-130B: An Open Bilingual Pre-trained ModelCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Show-o2: Improved Native Unified Multimodal ModelsCode5
Trajectory Prediction Meets Large Language Models: A SurveyCode5
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to PosttrainingCode5
4th PVUW MeViS 3rd Place Report: Sa2VACode5
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music GenerationCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge AdaptationCode5
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Randomized Autoregressive Visual GenerationCode5
Show:102550
← PrevPage 2 of 284Next →

No leaderboard results yet.