SOTAVerified

Language Modeling

Papers

Showing 401450 of 14182 papers

TitleStatusHype
Reasoning with Language Model Prompting: A SurveyCode3
Discovering Language Model Behaviors with Model-Written EvaluationsCode3
Prompting Is Programming: A Query Language for Large Language ModelsCode3
Human-level play in the game of Diplomacy by combining language models with strategic reasoningCode3
What Language Model to Train if You Have One Million GPU Hours?Code3
Diffusion-LM Improves Controllable Text GenerationCode3
A Systematic Evaluation of Large Language Models of CodeCode3
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language ModelCode3
Datasheet for the PileCode3
8-bit Optimizers via Block-wise QuantizationCode3
Finetuned Language Models Are Zero-Shot LearnersCode3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
Evaluating Large Language Models Trained on CodeCode3
Multi-objective Asynchronous Successive HalvingCode3
GLM: General Language Model Pretraining with Autoregressive Blank InfillingCode3
Prefix-Tuning: Optimizing Continuous Prompts for GenerationCode3
PGL at TextGraphs 2020 Shared Task: Explanation Regeneration using Language and Graph Learning MethodsCode3
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language UnderstandingCode3
Language Models are Few-Shot LearnersCode3
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
Revisiting Pre-Trained Models for Chinese Natural Language ProcessingCode3
Longformer: The Long-Document TransformerCode3
Semi-Supervised Speech Recognition via Local Prior MatchingCode3
Universal Language Model Fine-tuning for Text ClassificationCode3
Order Matters: Sequence to sequence for setsCode3
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
OctoThinker: Mid-training Incentivizes Reinforcement Learning ScalingCode2
Language Modeling by Language ModelsCode2
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
Watermarking Autoregressive Image GenerationCode2
BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation ModelsCode2
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and GenerationCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RLCode2
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
Improved Representation Steering for Language ModelsCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
WINA: Weight Informed Neuron Activation for Accelerating Large Language Model InferenceCode2
DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and UnderstandingCode2
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel DecodingCode2
Structure-Aligned Protein Language ModelCode2
CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code GenerationCode2
Efficient Speech Language Modeling via Energy Distance in Continuous Latent SpaceCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
SLOT: Sample-specific Language Model Optimization at Test-timeCode2
Demystifying and Enhancing the Efficiency of Large Language Model Based Search AgentsCode2
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
WorldPM: Scaling Human Preference ModelingCode2
Show:102550
← PrevPage 9 of 284Next →

No leaderboard results yet.