SOTAVerified

Language Modeling

Papers

Showing 201250 of 14182 papers

TitleStatusHype
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Code4
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
Generative Representational Instruction TuningCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Reasoning with Language Model is Planning with World ModelCode4
The Llama 3 Herd of ModelsCode4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
Tower: An Open Multilingual Large Language Model for Translation-Related TasksCode4
RewardBench: Evaluating Reward Models for Language ModelingCode4
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language ModelsCode4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsCode4
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextCode4
Flamingo: a Visual Language Model for Few-Shot LearningCode4
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
Optimizing Prompts for Text-to-Image GenerationCode4
Partition Generative Modeling: Masked Modeling Without MasksCode4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
Efficient Post-training Quantization with FP8 FormatsCode4
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
AutoCoder: Enhancing Code Large Language Model with AIEV-InstructCode4
Galactica: A Large Language Model for ScienceCode4
N-Grammer: Augmenting Transformers with latent n-gramsCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video GenerationCode3
GLM: General Language Model Pretraining with Autoregressive Blank InfillingCode3
Multi-objective Asynchronous Successive HalvingCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
Multimodal Table UnderstandingCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
Show:102550
← PrevPage 5 of 284Next →

No leaderboard results yet.