SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1605116100 of 474278 papers

TitleStatusHype
Energy-Efficient Deep Learning for Traffic Classification on Microcontrollers0
Deep Learning-based Multi Project InP Wafer Simulation for Unsupervised Surface Defect Detection0
Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework0
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos0
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos0
InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model0
Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches0
Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop0
Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts0
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning0
Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding0
Graph Neural Networks for Automatic Addition of Optimizing Components in Printed Circuit Board SchematicsCode0
Spurious Rewards: Rethinking Training Signals in RLVRCode3
StepProof: Step-by-step verification of natural language mathematical proofsCode0
Monitoring Decomposition Attacks in LLMs with Lightweight Sequential MonitorsCode0
Unsupervised Deformable Image Registration with Structural Nonparametric SmoothingCode0
Foundation Models for Causal Inference via Prior-Data Fitted Networks0
Saturation Self-Organizing MapCode0
Data-Driven Prediction of Dynamic Interactions Between Robot Appendage and Granular Material0
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding0
EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence0
Viability of Future Actions: Robust Safety in Reinforcement Learning via Entropy RegularizationCode0
SlotPi: Physics-informed Object-centric Reasoning ModelsCode0
Learning Chaotic Dynamics with Neuromorphic Network DynamicsCode0
TexTailor: Customized Text-aligned Texturing via Effective ResamplingCode0
Augmenting Large Language Models with Static Code Analysis for Automated Code Quality Improvements0
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip GenerationCode3
SoK: Evaluating Jailbreak Guardrails for Large Language ModelsCode1
Low-Barrier Dataset Collection with Real Human Body for Interactive Per-Garment Virtual Try-OnCode1
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design GenerationCode2
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive PokémonCode1
Understanding In-Context Learning on Structured Manifolds: Bridging Attention to Kernel Methods0
Execution Guided Line-by-Line Code GenerationCode2
QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy PredictionCode2
Hessian Geometry of Latent Space in Generative ModelsCode1
TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Similarity TreeCode3
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object DetectionCode1
SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference AttacksCode1
GeoCAD: Local Geometry-Controllable CAD GenerationCode0
Harmonizing Geometry and Uncertainty: Diffusion with HyperspheresCode0
ConStyX: Content Style Augmentation for Generalizable Medical Image SegmentationCode0
EQA-RM: A Generative Embodied Reward Model with Test-time ScalingCode0
HalLoc: Token-level Localization of Hallucinations for Vision Language ModelsCode0
Accelerating Diffusion Large Language Models with SlowFast: The Three Golden PrinciplesCode1
VideoDeepResearch: Long Video Understanding With Agentic Tool UsingCode2
The Diffusion DualityCode3
Conversational Search: From Fundamentals to Frontiers in the LLM Era0
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLPCode1
Unsupervised Protoform Reconstruction through Parsimonious Rule-guided Heuristics and Evolutionary SearchCode0
Show:102550
← PrevPage 322 of 9486Next →