SOTAVerified

Language Modeling

Papers

Showing 12011225 of 14182 papers

TitleStatusHype
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Simulating Rumor Spreading in Social Networks using LLM AgentsCode1
Fine-Tuning Discrete Diffusion Models with Policy Gradient MethodsCode1
Speculative Ensemble: Fast Large Language Model Ensemble via SpeculationCode1
Low-Rank Adapting Models for Sparse AutoencodersCode1
Scalable-Softmax Is Superior for AttentionCode1
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-TrainingCode1
2SSP: A Two-Stage Framework for Structured Pruning of LLMsCode1
RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token ReprogrammingsCode1
Atla Selene Mini: A General Purpose Evaluation ModelCode1
Ocean-OCR: Towards General OCR Application via a Vision-Language ModelCode1
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from TransformerCode1
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace EditingCode1
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model CritiquesCode1
Enhancing Biomedical Relation Extraction with DirectionalityCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
Glinthawk: A Two-Tiered Architecture for Offline LLM InferenceCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language ModelCode1
LAVCap: LLM-based Audio-Visual Captioning using Optimal TransportCode1
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher LearningCode1
Gandalf the Red: Adaptive Security for LLMsCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware SparsificationCode1
Merging Feed-Forward Sublayers for Compressed TransformersCode1
Show:102550
← PrevPage 49 of 568Next →

No leaderboard results yet.