SOTAVerified

Language Modeling

Papers

Showing 31513200 of 14182 papers

TitleStatusHype
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models0
PLaMo-100B: A Ground-Up Language Model Designed for Japanese Proficiency0
Plug-and-Play Performance Estimation for LLM Services without Relying on Labeled DataCode0
OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring ModelingCode2
Q-VLM: Post-training Quantization for Large Vision-Language ModelsCode2
CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression0
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed RoutingCode0
Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models0
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked TextCode2
Evolutionary Contrastive Distillation for Language Model Alignment0
Efficient Reinforcement Learning with Large Language Model Priors0
Disease Entity Recognition and Normalization is Improved with Large Language Model Derived Synthetic Normalized Mentions0
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model PromptingCode1
Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student RevisionsCode0
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMsCode2
Recent advancements in LLM Red-Teaming: Techniques, Defenses, and Ethical Considerations0
Generating long-horizon stock "buy" signals with a neural language model0
QuAILoRA: Quantization-Aware Initialization for LoRA0
Exploring Prompt Engineering: A Systematic Review with SWOT Analysis0
TinyClick: Single-Turn Agent for Empowering GUI Automation0
Enhancing Vision-Language Model Pre-training with Image-text Pair Pruning Based on Word FrequencyCode0
AuditWen:An Open-Source Large Language Model for AuditCode1
Exploring Efficient Foundational Multi-modal Models for Video Summarization0
Multi-Task Program Error Repair and Explanatory Diagnosis0
β-calibration of Language Model Confidence Scores for Generative QA0
Large Language Model Compression with Neural Architecture Search0
Pixtral 12BCode11
Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning0
Sylber: Syllabic Embedding Representation of Speech from Raw AudioCode2
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures0
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
TinyEmo: Scaling down Emotional Reasoning via Metric ProjectionCode0
Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling0
FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding0
Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models0
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM UnlearningCode1
Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear ComplexityCode0
Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara0
Application of NotebookLM, a Large Language Model with Retrieval-Augmented Generation, for Lung Cancer Staging0
Applying Refusal-Vector Ablation to Llama 3.1 70B Agents0
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
Enhancing SPARQL Generation by Triplet-order-sensitive Pre-trainingCode0
ParallelSpec: Parallel Drafter for Efficient Speculative Decoding0
Multi-Session Client-Centered Treatment Outcome Evaluation in Psychotherapy0
Training-free Diffusion Model Alignment with Sampling DemonsCode1
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning0
Accelerated Preference Optimization for Large Language Model Alignment0
Retrieving, Rethinking and Revising: The Chain-of-Verification Can Improve Retrieval Augmented Generation0
Think While You Generate: Discrete Diffusion with Planned DenoisingCode2
RL, but don't do anything I wouldn't doCode0
Show:102550
← PrevPage 64 of 284Next →

No leaderboard results yet.