SOTAVerified

Language Modeling

Papers

Showing 38763900 of 14182 papers

TitleStatusHype
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
ParamΔ for Direct Weight Mixing: Post-Train Large Language Model at Zero Cost0
Planning with Diffusion Models for Target-Oriented Dialogue Systems0
Monte Carlo Planning with Large Language Model for Text-Based Game Agents0
Improving Significant Wave Height Prediction Using Chronos Models0
SplitReason: Learning To Offload Reasoning0
In-Context Learning can distort the relationship between sequence likelihoods and biological fitness0
Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion0
FaceInsight: A Multimodal Large Language Model for Face Perception0
Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software0
LLMs meet Federated Learning for Scalable and Secure IoT Management0
Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V30
DATETIME: A new benchmark to measure LLM translation and reasoning capabilitiesCode0
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models0
What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token PatternsCode0
Large Language Model Empowered Privacy-Protected Framework for PHI Annotation in Clinical Notes0
Enhancing TCR-Peptide Interaction Prediction with Pretrained Language Models and Molecular Representations0
RepliBench: Evaluating the Autonomous Replication Capabilities of Language Model Agents0
Kuwain 1.5B: An Arabic SLM via Language Injection0
Speculative Sampling via Exponential Races0
LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning0
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models0
Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions0
Virology Capabilities Test (VCT): A Multimodal Virology Q&A BenchmarkCode0
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines0
Show:102550
← PrevPage 156 of 568Next →

No leaderboard results yet.