SOTAVerified

Large Language Model

Papers

Showing 201250 of 6097 papers

TitleStatusHype
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis0
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning0
CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models0
SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models0
From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge0
Your Agent Can Defend Itself against Backdoor Attacks0
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs0
Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data0
G-Memory: Tracing Hierarchical Memory for Multi-Agent SystemsCode3
LLM Unlearning Should Be Form-Independent0
JavelinGuard: Low-Cost Transformer Architectures for LLM Security0
MiniCPM4: Ultra-Efficient LLMs on End DevicesCode9
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO0
Statistical Hypothesis Testing for Auditing Robustness in Language Models0
Event-Priori-Based Vision-Language Model for Efficient Visual Understanding0
An Intelligent Fault Self-Healing Mechanism for Cloud AI Systems via Integration of Large Language Models and Deep Reinforcement Learning0
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest QuestionsCode2
SpatialLM: Training Large Language Models for Structured Indoor Modeling0
Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs0
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations0
How Benchmark Prediction from Fewer Data Misses the MarkCode0
QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA0
Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance GraphCode0
AnnoDPO: Protein Functional Annotation Learning with Direct Preference OptimizationCode0
Speech Recognition on TV Series with Video-guided Post-Correction0
Contextual Experience Replay for Self-Improvement of Language Agents0
An Agentic Framework for Autonomous Metamaterial Modeling and Inverse Design0
RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks0
AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical SearchCode0
PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time0
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms0
Prompting Wireless Networks: Reinforced In-Context Learning for Power Control0
Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management0
Training-Free Query Optimization via LLM-Based Plan Similarity0
Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage0
HeavyWater and SimplexWater: Watermarking Low-Entropy Text DistributionsCode0
ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search0
Voice Impression Control in Zero-Shot TTS0
Eigenspectrum Analysis of Neural Networks without Aspect Ratio BiasCode1
Customizing Speech Recognition Model with Large Language Model Feedback0
Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic DataCode1
E-bike agents: Large Language Model-Driven E-Bike Accident Analysis and Severity Prediction0
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow DevelopmentCode7
DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models0
Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics0
Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin0
HALoS: Hierarchical Asynchronous Local SGD over Slow Networks for Geo-Distributed Large Language Model TrainingCode0
Show:102550
← PrevPage 5 of 122Next →

No leaderboard results yet.