SOTAVerified

Language Modeling

Papers

Showing 5175 of 14182 papers

TitleStatusHype
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
Scalable MatMul-free Language ModelingCode7
Adaptive In-conversation Team Building for Language Model AgentsCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
xLSTM: Extended Long Short-Term MemoryCode7
Labeling supervised fine-tuning data with the scaling lawCode7
Chronos: Learning the Language of Time SeriesCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
VMamba: Visual State Space ModelCode7
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
AudioLM: a Language Modeling Approach to Audio GenerationCode7
SGLang: Efficient Execution of Structured Language Model ProgramsCode6
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Mistral 7BCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Qwen Technical ReportCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
Show:102550
← PrevPage 3 of 568Next →

No leaderboard results yet.