SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1255112600 of 474278 papers

TitleStatusHype
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Warehouse Spatial Question Answering with LLM AgentCode1
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers0
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation0
WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph ModelingCode1
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data ContaminationCode1
Iceberg: Enhancing HLS Modeling with Synthetic DataCode0
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at OnceCode1
A Simple Approximate Bayesian Inference Neural Surrogate for Stochastic Petri Net ModelsCode0
MLAR: Multi-layer Large Language Model-based Robotic Process Automation Applicant Tracking0
Wavelet-Enhanced Neural ODE and Graph Attention for Interpretable Energy Forecasting0
Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance0
On Gradual Semantics for Assumption-Based ArgumentationCode0
Text-Visual Semantic Constrained AI-Generated Image Quality AssessmentCode1
Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures0
Overcoming catastrophic forgetting in neural networks0
Bridging Robustness and Generalization Against Word Substitution Attacks in NLP via the Growth Bound Matrix ApproachCode0
LifelongPR: Lifelong knowledge fusion for point cloud place recognition based on replay and prompt learningCode0
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-ResolutionCode1
VoTranhAbyssCoreMicro and PoliticalCore: A Unified Framework for Simulating Complex Economic and Political DynamicsCode0
Predictive Modeling: BIM Command Recommendation Based on Large-scale Usage LogsCode0
TinyTroupe: An LLM-powered Multiagent Persona Simulation ToolkitCode0
DRPCA-Net: Make Robust PCA Great Again for Infrared Small Target DetectionCode0
Auto-Regressively Generating Multi-View Consistent ImagesCode0
SeqCSIST: Sequential Closely-Spaced Infrared Small Target UnmixingCode0
EyeSeg: An Uncertainty-Aware Eye Segmentation Framework for AR/VRCode0
Hear-Your-Click: Interactive Object-Specific Video-to-Audio GenerationCode0
ViSP: A PPO-Driven Framework for Sarcasm Generation with Contrastive LearningCode0
When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired TrainingCode0
Generative Cognitive DiagnosisCode0
Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal InteractionsCode0
Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive0
Landmark Detection for Medical Images using a General-purpose Segmentation Model0
Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation0
Federated Learning with Graph-Based Aggregation for Traffic Forecasting0
Lightweight Federated Learning over Wireless Edge Networks0
Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks0
Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding0
VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose EstimationCode0
FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise0
Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI0
Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges0
DRAGD: A Federated Unlearning Data Reconstruction Attack Based on Gradient Differences0
Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language ModelsCode0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection0
BitParticle: Partializing Sparse Dual-Factors to Build Quasi-Synchronizing MAC Arrays for Energy-efficient DNNs0
AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)Code0
Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs0
Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene UnderstandingCode0
WellPINN: Accurate Well Representation for Transient Fluid Pressure Diffusion in Subsurface Reservoirs with Physics-Informed Neural NetworksCode0
Show:102550
← PrevPage 252 of 9486Next →