SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 64516500 of 177340 papers

TitleStatusHype
Xplique: A Deep Learning Explainability ToolboxCode2
Neural Networks for ChessCode2
Control Industrial Automation System with Large Language Model AgentsCode2
Accurate Computation of Quantum Excited States with Neural NetworksCode2
CRNet: A Detail-Preserving Network for Unified Image Restoration and Enhancement TaskCode2
PIDNet: A Real-time Semantic Segmentation Network Inspired by PID ControllersCode2
Graph-Aware Isomorphic Attention for Adaptive Dynamics in TransformersCode2
Bayesian Flow NetworksCode2
Spatio-Temporal Few-Shot Learning via Diffusive Neural Network GenerationCode2
VeCLIP: Improving CLIP Training via Visual-enriched CaptionsCode2
Distillation Quantification for Large Language ModelsCode2
DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous DrivingCode2
Polyper: Boundary Sensitive Polyp SegmentationCode2
Evaluating Neural Networks Architectures for Spring Reverb ModellingCode2
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane ExtrapolationCode2
Fully-inductive Node Classification on Arbitrary GraphsCode2
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image SynthesisCode2
Motif Channel Opened in a White-Box: Stereo Matching via Motif Correlation GraphCode2
LEAD: Large Foundation Model for EEG-Based Alzheimer's Disease DetectionCode2
SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-ImprovementCode2
Spatial-Temporal Large Language Model for Traffic PredictionCode2
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level CodeCode2
MeshDiffusion: Score-based Generative 3D Mesh ModelingCode2
PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy ReductionCode2
Invisible Image Watermarks Are Provably Removable Using Generative AICode2
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and GenerationCode2
Revisiting Classifier: Transferring Vision-Language Models for Video RecognitionCode2
evosax: JAX-based Evolution StrategiesCode2
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D PerceptionCode2
MemBench: Towards More Comprehensive Evaluation on the Memory of LLM-based AgentsCode2
TorchFX: A modern approach to Audio DSP with PyTorch and GPU accelerationCode2
Cross-domain Neural Pitch and Periodicity EstimationCode2
Sinkhorn Distance Minimization for Knowledge DistillationCode2
Precise Zero-Shot Dense Retrieval without Relevance LabelsCode2
Diffusion Bridge Implicit ModelsCode2
Capturing and Animation of Body and Clothing from Monocular VideoCode2
CDFormer:When Degradation Prediction Embraces Diffusion Model for Blind Image Super-ResolutionCode2
Immersive Neural Graphics PrimitivesCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
Reducing Energy Bloat in Large Model TrainingCode2
Cheetah: Bridging the Gap Between Machine Learning and Particle Accelerator Physics with High-Speed, Differentiable SimulationsCode2
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction FollowingCode2
Neural Potential Field for Obstacle-Aware Local Motion PlanningCode2
Unlocking the Hidden Potential of CLIP in Generalizable Deepfake DetectionCode2
Growing Steerable Neural Cellular AutomataCode2
Unleashing Text-to-Image Diffusion Models for Visual PerceptionCode2
LLM Agents Making Agent ToolsCode2
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance SegmentationCode2
Bringing Old Films Back to LifeCode2
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series ClassificationCode2
Show:102550
← PrevPage 130 of 3547Next →