SOTAVerified

Large Language Model

Papers

Showing 826850 of 6097 papers

TitleStatusHype
Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach0
HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design TasksCode1
Position: The Most Expensive Part of an LLM should be its Training Data0
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures0
Towards Conversational AI for Human-Machine Collaborative MLOps0
Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence0
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
Video Summarization with Large Language Models0
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers0
Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content0
ReZero: Enhancing LLM search ability by trying one-more-time0
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Learning to Be A Doctor: Searching for Effective Medical Agent Architectures0
The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections0
Evaluation Report on MCP ServersCode3
Transferable text data distillation by trajectory matching0
A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science0
Investigating cybersecurity incidents using large language models in latest-generation wireless networks0
LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current BenchmarksCode0
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models0
LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement0
Automated Testing of COBOL to Java Transformation0
Show:102550
← PrevPage 34 of 244Next →

No leaderboard results yet.