SOTAVerified

Language Modeling

Papers

Showing 2650 of 14182 papers

TitleStatusHype
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language ModelCode9
Visually Descriptive Language Model for Vector Graphics ReasoningCode9
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge FusionCode9
LawGPT: A Chinese Legal Knowledge-Enhanced Large Language ModelCode9
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code IntelligenceCode9
Arcee's MergeKit: A Toolkit for Merging Large Language ModelsCode9
Yi: Open Foundation Models by 01.AICode9
Language agents achieve superhuman synthesis of scientific knowledgeCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech RecognitionCode8
Large Language Model Agent: A Survey on Methodology, Applications and ChallengesCode7
AutoTrain: No-code training for state-of-the-art modelsCode7
AudioLM: a Language Modeling Approach to Audio GenerationCode7
Chronos: Learning the Language of Time SeriesCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language ModelCode7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
Dynamic data sampler for cross-language transfer learning in large language modelsCode7
DSPy: Compiling Declarative Language Model Calls into Self-Improving PipelinesCode7
EAGLE: Speculative Sampling Requires Rethinking Feature UncertaintyCode7
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
Large Concept Models: Language Modeling in a Sentence Representation SpaceCode7
Labeling supervised fine-tuning data with the scaling lawCode7
Show:102550
← PrevPage 2 of 568Next →

No leaderboard results yet.