SOTAVerified

Language Modeling

Papers

Showing 51100 of 14182 papers

TitleStatusHype
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
Tulu 3: Pushing Frontiers in Open Language Model Post-TrainingCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
Simulating 500 million years of evolution with a language modelCode7
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-ThoughtCode7
Large Language Model Agent: A Survey on Methodology, Applications and ChallengesCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
Scaling Speech-Text Pre-training with Synthetic Interleaved DataCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
Scalable MatMul-free Language ModelingCode7
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
A Watermark for Large Language ModelsCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
GLM-130B: An Open Bilingual Pre-trained ModelCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
A Survey of Large Language ModelsCode6
Mistral 7BCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
Simple and Controllable Music GenerationCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
SGLang: Efficient Execution of Structured Language Model ProgramsCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
Qwen Technical ReportCode6
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesCode6
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
InstructPix2Pix: Learning to Follow Image Editing InstructionsCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge AdaptationCode5
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter ExpertsCode5
Show:102550
← PrevPage 2 of 284Next →

No leaderboard results yet.