SOTAVerified

Language Modeling

Papers

Showing 301350 of 14182 papers

TitleStatusHype
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
OVLW-DETR: Open-Vocabulary Light-Weighted Detection TransformerCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
Partially Rewriting a Transformer in Natural LanguageCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
Order Matters: Sequence to sequence for setsCode3
On the Efficiency of NLP-Inspired Methods for Tabular Deep LearningCode3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive SurveyCode3
Discovering Language Model Behaviors with Model-Written EvaluationsCode3
Multi-objective Asynchronous Successive HalvingCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
GLM: General Language Model Pretraining with Autoregressive Blank InfillingCode3
Multimodal Table UnderstandingCode3
Diffusion-LM Improves Controllable Text GenerationCode3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsCode3
Multi-agent Architecture Search via Agentic SupernetCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve ClassificationCode3
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text IntegrationCode3
Datasheet for the PileCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
MeshXL: Neural Coordinate Field for Generative 3D Foundation ModelsCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Ola: Pushing the Frontiers of Omni-Modal Language ModelCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback LearningCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse AutoencodersCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
Diffusion Language Models Are Versatile Protein LearnersCode3
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language ModelsCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
Agent Workflow MemoryCode3
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software ImprovementCode3
Llemma: An Open Language Model For MathematicsCode3
Large Language Model-Brained GUI Agents: A SurveyCode3
LaViDa: A Large Diffusion Language Model for Multimodal UnderstandingCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
Show:102550
← PrevPage 7 of 284Next →

No leaderboard results yet.