SOTAVerified

Language Modeling

Papers

Showing 201225 of 14182 papers

TitleStatusHype
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 smallCode4
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
Galactica: A Large Language Model for ScienceCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
Flamingo: a Visual Language Model for Few-Shot LearningCode4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsCode4
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
Image Fusion via Vision-Language ModelCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
Partition Generative Modeling: Masked Modeling Without MasksCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation InferenceCode4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-stepCode4
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attentionCode4
Optimizing Prompts for Text-to-Image GenerationCode4
N-Grammer: Augmenting Transformers with latent n-gramsCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
Show:102550
← PrevPage 9 of 568Next →

No leaderboard results yet.