SOTAVerified

Large Language Model

Papers

Showing 10511075 of 6097 papers

TitleStatusHype
Modifying Large Language Model Post-Training for Diverse Creative WritingCode2
Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent0
Improving Quantization with Post-Training Model Expansion0
Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks0
Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation0
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On0
Variance Control via Weight Rescaling in LLM Pre-trainingCode0
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing CapabilitiesCode0
Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms0
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
Entropy-based Exploration Conduction for Multi-step Reasoning0
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data ContaminationCode1
Cultural Alignment in Large Language Models Using Soft Prompt Tuning0
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs0
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates0
Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture0
Using Language Models to Decipher the Motivation Behind Human Behaviors0
ChatGPT and U(X): A Rapid Review on Measuring the User Experience0
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R10
Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs0
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning TasksCode3
Probing the topology of the space of tokens with structured prompts0
Robust Transmission of Punctured Text with Large Language Model-based Recovery0
Show:102550
← PrevPage 43 of 244Next →

No leaderboard results yet.