SOTAVerified

Language Modeling

Papers

Showing 7180 of 14182 papers

TitleStatusHype
Mamba: Linear-Time Sequence Modeling with Selective State SpacesCode6
Mistral 7BCode6
NEFTune: Noisy Embeddings Improve Instruction FinetuningCode6
Qwen Technical ReportCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningCode6
Extending Context Window of Large Language Models via Positional InterpolationCode6
FinGPT: Open-Source Financial Large Language ModelsCode6
Simple and Controllable Music GenerationCode6
Show:102550
← PrevPage 8 of 1419Next →

No leaderboard results yet.