SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2035120400 of 474278 papers

TitleStatusHype
Improved Representation Steering for Language ModelsCode2
Conditional Diffusion Models with Classifier-Free Gibbs-like GuidanceCode0
StreamLink: Large-Language-Model Driven Distributed Data Engineering System0
The Role of AI in Early Detection of Life-Threatening Diseases: A Retinal Imaging Perspective0
Can we Debias Social Stereotypes in AI-Generated Images? Examining Text-to-Image Outputs and User Perceptions0
Unpaired Image-to-Image Translation for Segmentation and Signal Unmixing0
IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model0
Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective0
FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial InformationCode1
Privacy-Preserving Chest X-ray Report Generation via Multimodal Federated Learning with ViT and GPT-20
Label-free Super-Resolution Microvessel Color Flow Imaging with Ultrasound0
Graph Neural Network Aided Detection for the Multi-User Multi-Dimensional Index Modulated Uplink0
What happens when generative AI models train recursively on each others' generated outputs?0
Multi-Mode Process Control Using Multi-Task Inverse Reinforcement Learning0
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction0
A Joint Reconstruction-Triplet Loss Autoencoder Approach Towards Unseen Attack Detection in IoV Networks0
PrivATE: Differentially Private Confidence Intervals for Average Treatment Effects0
Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling0
Unveiling Impact of Frequency Components on Membership Inference Attacks for Diffusion Models0
AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models0
Responsible Data Stewardship: Generative AI and the Digital Waste Problem0
Public Discourse Sandbox: Facilitating Human and AI Digital Communication Research0
Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs)0
Fairness in Federated Learning: Fairness for Whom?0
Beyond Explainability: The Case for AI Validation0
RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models0
Time-Series Learning for Proactive Fault Prediction in Distributed Systems with Deep Neural Structures0
InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling0
Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing0
Efficient Diffusion Models for Symmetric Manifolds0
Scheduling with Uncertain Holding Costs and its Application to Content Moderation0
Quantum Machine Learning in Healthcare: Evaluating QNN and QSVM Models0
CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians0
Be Decisive: Noise-Induced Layouts for Multi-Subject Generation0
CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects0
A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment0
Creativity in LLM-based Multi-Agent Systems: A Survey0
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions0
Supervised Contrastive Learning for Ordinal Engagement Measurement0
Large Language Models Miss the Multi-Agent Mark0
Recurrent Neural Operators: Stable Long-Term PDE Prediction0
Fog Intelligence for Network Anomaly Detection0
Diagnosing and Resolving Cloud Platform Instability with Multi-modal RAG LLMs0
Do Betting Markets Sense a Goal Coming? Evidence from the German Bundesliga0
Visual Loop Closure Detection Through Deep Graph Consensus0
MIND-Stack: Modular, Interpretable, End-to-End Differentiability for Autonomous Navigation0
PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation0
Spatial RoboGrasp: Generalized Robotic Grasping Control Policy0
Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning0
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation0
Show:102550
← PrevPage 408 of 9486Next →