SOTAVerified

Language Modeling

Papers

Showing 12011250 of 14182 papers

TitleStatusHype
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Simulating Rumor Spreading in Social Networks using LLM AgentsCode1
Fine-Tuning Discrete Diffusion Models with Policy Gradient MethodsCode1
Speculative Ensemble: Fast Large Language Model Ensemble via SpeculationCode1
Low-Rank Adapting Models for Sparse AutoencodersCode1
Scalable-Softmax Is Superior for AttentionCode1
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-TrainingCode1
2SSP: A Two-Stage Framework for Structured Pruning of LLMsCode1
RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token ReprogrammingsCode1
Atla Selene Mini: A General Purpose Evaluation ModelCode1
Ocean-OCR: Towards General OCR Application via a Vision-Language ModelCode1
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from TransformerCode1
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model CritiquesCode1
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace EditingCode1
Enhancing Biomedical Relation Extraction with DirectionalityCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
Glinthawk: A Two-Tiered Architecture for Offline LLM InferenceCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language ModelCode1
LAVCap: LLM-based Audio-Visual Captioning using Optimal TransportCode1
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher LearningCode1
Gandalf the Red: Adaptive Security for LLMsCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware SparsificationCode1
Merging Feed-Forward Sublayers for Compressed TransformersCode1
Segmenting Text and Learning Their Rewards for Improved RLHF in Language ModelCode1
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model EvaluationCode1
Establishing baselines for generative discovery of inorganic crystalsCode1
Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration DecodingCode1
Rethinking Addressing in Language Models via Contexualized Equivariant Positional EncodingCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language EnvironmentCode1
Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive DefenseCode1
Facilitating large language model Russian adaptation with Learned Embedding PropagationCode1
No Preference Left Behind: Group Distributional Preference OptimizationCode1
An Engorgio Prompt Makes Large Language Model Babble onCode1
Learning to engineer protein flexibilityCode1
Brain-to-Text Benchmark '24: Lessons LearnedCode1
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain TestingCode1
Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language ModelCode1
MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and DetectionCode1
Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language ModelsCode1
ConfliBERT: A Language Model for Political ConflictCode1
Cal-DPO: Calibrated Direct Preference Optimization for Language Model AlignmentCode1
Autonomous Microscopy Experiments through Large Language Model AgentsCode1
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language ModelingCode1
EscapeBench: Pushing Language Models to Think Outside the BoxCode1
SnakModel: Lessons Learned from Training an Open Danish Large Language ModelCode1
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationCode1
Large Language Models as Realistic Microservice Trace GeneratorsCode1
Show:102550
← PrevPage 25 of 284Next →

No leaderboard results yet.