SOTAVerified

Large Language Model

Papers

Showing 10511100 of 6097 papers

TitleStatusHype
Modifying Large Language Model Post-Training for Diverse Creative WritingCode2
Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation0
Language Models May Verbatim Complete Text They Were Not Explicitly Trained On0
Improving Quantization with Post-Training Model Expansion0
Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent0
Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks0
Variance Control via Weight Rescaling in LLM Pre-trainingCode0
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing CapabilitiesCode0
Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms0
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
Entropy-based Exploration Conduction for Multi-step Reasoning0
Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture0
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data ContaminationCode1
Using Language Models to Decipher the Motivation Behind Human Behaviors0
Cultural Alignment in Large Language Models Using Soft Prompt Tuning0
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates0
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs0
ChatGPT and U(X): A Rapid Review on Measuring the User Experience0
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R10
Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs0
GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation0
Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection0
Robust Transmission of Punctured Text with Large Language Model-based Recovery0
Probing the topology of the space of tokens with structured prompts0
LEGION: Learning to Ground and Explain for Synthetic Image Detection0
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models0
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation0
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning TasksCode3
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual SettingsCode0
Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication0
Gender and content bias in Large Language Models: a case study on Google Gemini 2.0 Flash Experimental0
Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations0
Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search0
MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments0
Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation0
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability0
Engineering Scientific Assistants using Interactive Structured Induction of Programs0
Gricean Norms as a Basis for Effective CollaborationCode0
The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations0
KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse0
HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model0
AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications0
Pensez: Less Data, Better Reasoning -- Rethinking French LLM0
Mitigating KV Cache Competition to Enhance User Experience in LLM Inference0
Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model0
Knowledge-Aware Iterative Retrieval for Multi-Agent Systems0
PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing0
Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models0
Show:102550
← PrevPage 22 of 122Next →

No leaderboard results yet.