SOTAVerified

Large Language Model

Papers

Showing 101150 of 6097 papers

TitleStatusHype
A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction0
Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection0
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image GenerationCode3
Evolving Prompts In-Context: An Open-ended, Self-replicating PerspectiveCode1
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms0
Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation BoosterCode2
Mechanistic Interpretability in the Presence of Architectural ObfuscationCode0
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent0
DreamJourney: Perpetual View Generation with Video Diffusion Models0
Can Generated Images Serve as a Viable Modality for Text-Centric Multimodal Learning?0
Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems0
Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models0
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections0
Challenges in Grounding Language in the Real World0
Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems0
The Condition Number as a Scale-Invariant Proxy for Information Encoding in Neural UnitsCode1
Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support0
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language ModelsCode1
LLMs in Coding and their Impact on the Commercial Software Engineering Landscape0
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System NeedCode0
Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks0
RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World EnvironmentsCode0
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language ModelsCode2
deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses0
SonicVerse: Multi-Task Learning for Music Feature-Informed CaptioningCode2
LLM Agent for Hyper-Parameter Optimization0
DisProtEdit: Exploring Disentangled Representations for Multi-Attribute Protein Editing0
Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition0
FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization0
Utility-Driven Speculative Decoding for Mixture-of-Experts0
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM0
Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning0
Unified Software Engineering agent as AI Software Engineer0
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue AgentsCode0
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR0
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model0
Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems0
Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs0
ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users0
EmoNews: A Spoken Dialogue System for Expressive News ConversationsCode0
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection0
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech ModelCode5
VIS-Shepherd: Constructing Critic for LLM-based Data Visualization GenerationCode0
SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation0
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries0
Show:102550
← PrevPage 3 of 122Next →

No leaderboard results yet.