SOTAVerified

Language Modeling

Papers

Showing 301350 of 14182 papers

TitleStatusHype
Order Matters: Sequence to sequence for setsCode3
OVLW-DETR: Open-Vocabulary Light-Weighted Detection TransformerCode3
A Systematic Evaluation of Large Language Models of CodeCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMsCode3
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback LearningCode3
A Survey on the Memory Mechanism of Large Language Model based AgentsCode3
A Survey on Large Language Model Acceleration based on KV Cache ManagementCode3
On the Efficiency of NLP-Inspired Methods for Tabular Deep LearningCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive SurveyCode3
Ola: Pushing the Frontiers of Omni-Modal Language ModelCode3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
Multi-objective Asynchronous Successive HalvingCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
Discovering Language Model Behaviors with Model-Written EvaluationsCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
Multi-agent Architecture Search via Agentic SupernetCode3
Diffusion Language Models Are Versatile Protein LearnersCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
Diffusion-LM Improves Controllable Text GenerationCode3
MeshXL: Neural Coordinate Field for Generative 3D Foundation ModelsCode3
MoMA: Multimodal LLM Adapter for Fast Personalized Image GenerationCode3
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RLCode3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsCode3
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve ClassificationCode3
Datasheet for the PileCode3
Longformer: The Long-Document TransformerCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
Data Filtering NetworksCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
Multimodal Table UnderstandingCode3
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model AgentsCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
Llemma: An Open Language Model For MathematicsCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelCode3
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse AutoencodersCode3
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text IntegrationCode3
Show:102550
← PrevPage 7 of 284Next →

No leaderboard results yet.