SOTAVerified

Language Modeling

Papers

Showing 42764300 of 14182 papers

TitleStatusHype
Language Model Personalization via Reward Factorization0
From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models0
Look Before You Leap: Using Serialized State Machine for Language Conditioned Robotic Manipulation0
DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization0
Is Your Video Language Model a Reliable Judge?0
IDEA Prune: An Integrated Enlarge-and-Prune Pipeline in Generative Language Model Pretraining0
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation0
LLM-based Iterative Approach to Metamodeling in Automotive0
Frequency Autoregressive Image Generation with Continuous Tokens0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation0
Unveiling Biases in AI: ChatGPT's Political Economy Perspectives and Human Comparisons0
SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding0
The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
Measuring temporal effects of agent knowledge by date-controlled tool use0
Wanda++: Pruning Large Language Models via Regional GradientsCode0
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining0
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction0
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services0
Better Process Supervision with Bi-directional Rewarding Signals0
KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney DiseaseCode0
From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design0
Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges0
Show:102550
← PrevPage 172 of 568Next →

No leaderboard results yet.