SOTAVerified

Large Language Model

Papers

Showing 150 of 6097 papers

TitleStatusHype
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode13
Autonomous Agents for Collaborative Task under Information AsymmetryCode13
Zep: A Temporal Knowledge Graph Architecture for Agent MemoryCode12
OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task AutomationCode11
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech SystemCode11
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and GenerationCode11
HybridFlow: A Flexible and Efficient RLHF FrameworkCode11
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic TokensCode11
Scaling Synthetic Data Creation with 1,000,000,000 PersonasCode11
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code IntelligenceCode11
MiniCPM4: Ultra-Efficient LLMs on End DevicesCode9
SkyReels-V2: Infinite-length Film Generative ModelCode9
AutoAgent: A Fully-Automated and Zero-Code Framework for LLM AgentsCode9
Moshi: a speech-text foundation model for real-time dialogueCode9
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse AttentionCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
LawGPT: A Chinese Legal Knowledge-Enhanced Large Language ModelCode9
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge FusionCode9
FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language ModelsCode9
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language ModelsCode9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-TuningCode9
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech RecognitionCode8
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and PlanningCode7
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow DevelopmentCode7
Large Language Model Agent: A Survey on Methodology, Applications and ChallengesCode7
Qwen2.5-Omni Technical ReportCode7
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!Code7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model ServingCode7
OASIS: Open Agent Social Interaction Simulations with One Million AgentsCode7
MagicQuill: An Intelligent Interactive Image Editing SystemCode7
AutoTrain: No-code training for state-of-the-art modelsCode7
aiXcoder-7B: A Lightweight and Effective Large Language Model for Code ProcessingCode7
mPLUG-Owl3: Towards Long Image-Sequence Understanding in Multi-Modal Large Language ModelsCode7
VITA: Towards Open-Source Interactive Omni Multimodal LLMCode7
Mixture-of-Agents Enhances Large Language Model CapabilitiesCode7
Adaptive In-conversation Team Building for Language Model AgentsCode7
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese UnderstandingCode7
Labeling supervised fine-tuning data with the scaling lawCode7
SoftTiger: A Clinical Foundation Model for Healthcare WorkflowsCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learningCode7
Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsCode7
Judging LLM-as-a-Judge with MT-Bench and Chatbot ArenaCode7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
Qwen Technical ReportCode6
Efficient Memory Management for Large Language Model Serving with PagedAttentionCode6
Show:102550
← PrevPage 1 of 122Next →

No leaderboard results yet.