SOTAVerified

Large Language Model

Papers

Showing 801850 of 6097 papers

TitleStatusHype
System of Agentic AI for the Discovery of Metal-Organic Frameworks0
PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting0
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt GenerationCode0
Scaling sparse feature circuit finding for in-context learning0
RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines0
Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback0
Causal-Copilot: An Autonomous Causal Analysis Agent0
ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images0
Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge0
Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural IntegrationCode1
Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacksCode0
Retrieval-Augmented Generation with Conflicting EvidenceCode1
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training0
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery0
SkyReels-V2: Infinite-length Film Generative ModelCode9
Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep ClassificationCode0
SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction UnderstandingCode1
Mixer Metaphors: audio interfaces for non-musical applications0
BitNet b1.58 2B4T Technical Report0
Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM0
Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification0
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly DetectionCode1
Generative Recommendation with Continuous-Token Diffusion0
Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach0
HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design TasksCode1
Position: The Most Expensive Part of an LLM should be its Training Data0
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures0
Towards Conversational AI for Human-Machine Collaborative MLOps0
Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence0
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
Video Summarization with Large Language Models0
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers0
Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content0
ReZero: Enhancing LLM search ability by trying one-more-time0
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement LearningCode3
Learning to Be A Doctor: Searching for Effective Medical Agent Architectures0
The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections0
Evaluation Report on MCP ServersCode3
Transferable text data distillation by trajectory matching0
A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science0
Investigating cybersecurity incidents using large language models in latest-generation wireless networks0
LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current BenchmarksCode0
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model0
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models0
LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement0
Automated Testing of COBOL to Java Transformation0
Show:102550
← PrevPage 17 of 122Next →

No leaderboard results yet.