SOTAVerified

Language Modeling

Papers

Showing 151200 of 14182 papers

TitleStatusHype
Language Model Beats Diffusion -- Tokenizer is Key to Visual GenerationCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal LearningCode4
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth ApproachCode4
SEED-Story: Multimodal Long Story Generation with Large Language ModelCode4
Self-Play Preference Optimization for Language Model AlignmentCode4
Sailor: Open Language Models for South-East AsiaCode4
Safurai 001: New Qualitative Approach for Code LLM EvaluationCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
AnyGPT: Unified Multimodal LLM with Discrete Sequence ModelingCode4
INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank AdaptationCode4
LISA++: An Improved Baseline for Reasoning Segmentation with Large Language ModelCode4
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 smallCode4
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent ExplorationCode4
RewardBench: Evaluating Reward Models for Language ModelingCode4
ImgEdit: A Unified Image Editing Dataset and BenchmarkCode4
Image Fusion via Vision-Language ModelCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
ChatHaruhi: Reviving Anime Character in Reality via Large Language ModelCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation GenerationCode4
RaTEScore: A Metric for Radiology Report GenerationCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
SNAC: Multi-Scale Neural Audio CodecCode4
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?Code4
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language ModelCode4
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
GLIPv2: Unifying Localization and Vision-Language UnderstandingCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
Phoenix: Democratizing ChatGPT across LanguagesCode4
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language ModelsCode4
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelCode4
Galactica: A Large Language Model for ScienceCode4
Generative Representational Instruction TuningCode4
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
Partition Generative Modeling: Masked Modeling Without MasksCode4
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
OLMoE: Open Mixture-of-Experts Language ModelsCode4
Beyond Reward Hacking: Causal Rewards for Large Language Model AlignmentCode4
Optimizing Prompts for Text-to-Image GenerationCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
N-Grammer: Augmenting Transformers with latent n-gramsCode4
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsCode4
Reasoning with Language Model is Planning with World ModelCode4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat DataCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Flamingo: a Visual Language Model for Few-Shot LearningCode4
MutaPLM: Protein Language Modeling for Mutation Explanation and EngineeringCode4
Show:102550
← PrevPage 4 of 284Next →

No leaderboard results yet.