SOTAVerified

Language Modeling

Papers

Showing 51100 of 14182 papers

TitleStatusHype
Large Language Model Agent: A Survey on Methodology, Applications and ChallengesCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-ThoughtCode7
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
Simulating 500 million years of evolution with a language modelCode7
Scaling Speech-Text Pre-training with Synthetic Interleaved DataCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language ModelCode7
Tulu 3: Pushing Frontiers in Open Language Model Post-TrainingCode7
Scalable MatMul-free Language ModelingCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
A Watermark for Large Language ModelsCode6
Gorilla: Large Language Model Connected with Massive APIsCode6
GLM-130B: An Open Bilingual Pre-trained ModelCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
A Survey of Large Language ModelsCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming LanguagesCode6
SGLang: Efficient Execution of Structured Language Model ProgramsCode6
Simple and Controllable Music GenerationCode6
CodeGen: An Open Large Language Model for Code with Multi-Turn Program SynthesisCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
Qwen Technical ReportCode6
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
Mistral 7BCode6
MobileVLM V2: Faster and Stronger Baseline for Vision Language ModelCode5
Ovis: Structural Embedding Alignment for Multimodal Large Language ModelCode5
InstructPix2Pix: Learning to Follow Image Editing InstructionsCode5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue AbilitiesCode5
CogVLM: Visual Expert for Pretrained Language ModelsCode5
MEIA: Multimodal Embodied Perception and Interaction in Unknown EnvironmentsCode5
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge AdaptationCode5
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training ParadigmsCode5
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUsCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
Show:102550
← PrevPage 2 of 284Next →

No leaderboard results yet.