SOTAVerified

Large Language Model

Papers

Showing 21012150 of 6097 papers

TitleStatusHype
Bottom-Up Synthesis of Knowledge-Grounded Task-Oriented Dialogues with Iteratively Self-Refined Prompts0
Manipulating Multimodal Agents via Cross-Modal Prompt Injection0
Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models0
Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management0
SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation0
FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference0
PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting0
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization0
High-Throughput LLM inference on Heterogeneous Clusters0
System of Agentic AI for the Discovery of Metal-Organic Frameworks0
Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods0
Large Language Bayes0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt GenerationCode0
RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines0
Scaling sparse feature circuit finding for in-context learning0
ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images0
Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback0
Causal-Copilot: An Autonomous Causal Analysis Agent0
Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacksCode0
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery0
Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep ClassificationCode0
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training0
Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge0
Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures0
Position: The Most Expensive Part of an LLM should be its Training Data0
Towards Conversational AI for Human-Machine Collaborative MLOps0
Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification0
Generative Recommendation with Continuous-Token Diffusion0
BitNet b1.58 2B4T Technical Report0
Mixer Metaphors: audio interfaces for non-musical applications0
Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach0
Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM0
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers0
The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections0
Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content0
Video Summarization with Large Language Models0
ReZero: Enhancing LLM search ability by trying one-more-time0
Recommending Clinical Trials for Online Patient Cases using Artificial Intelligence0
Learning to Be A Doctor: Searching for Effective Medical Agent Architectures0
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models0
Investigating cybersecurity incidents using large language models in latest-generation wireless networks0
SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning0
GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction0
Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design0
SUMART: SUMmARizing Translation from Wordy to Concise Expression0
Automated Testing of COBOL to Java Transformation0
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model0
Show:102550
← PrevPage 43 of 122Next →

No leaderboard results yet.