SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1510115150 of 474278 papers

TitleStatusHype
The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor ProductsCode1
Self-Supervised Enhancement for Depth from a Lightweight ToF Sensor with Monocular ImagesCode1
Tady: A Neural Disassembler without Structural Constraint ViolationsCode1
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact VerifiersCode1
Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model OutputsCode1
Probing Deep into Temporal Profile Makes the Infrared Small Target Detector Much BetterCode1
TCANet: A Temporal Convolutional Attention Network for Motor Imagery EEG DecodingCode1
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional DependenciesCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
SmartHome-Bench: A Comprehensive Benchmark for Video Anomaly Detection in Smart Homes Using Multi-Modal Large Language ModelsCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
BSA: Ball Sparse Attention for Large-scale GeometriesCode1
Domain Generalization for Person Re-identification: A Survey Towards Domain-Agnostic Person MatchingCode1
Real-Time Per-Garment Virtual Try-On with Temporal Consistency for Loose-Fitting GarmentsCode1
Vectorized Sparse Second-Order Forward Automatic Differentiation for Optimal Control Direct MethodsCode1
Schema-R1: A reasoning training approach for schema linking in Text-to-SQL TaskCode1
Structural Similarity-Inspired Unfolding for Lightweight Image Super-ResolutionCode1
Dynamic Grid Trading Strategy: From Zero Expectation to Market OutperformanceCode1
Self-supervised Learning of Echocardiographic Video Representations via Online Cluster DistillationCode1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
Recursive KalmanNet: Deep Learning-Augmented Kalman Filtering for State Estimation with Consistent Uncertainty QuantificationCode1
DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM AgentsCode1
SIMSHIFT: A Benchmark for Adapting Neural Surrogates to Distribution ShiftsCode1
PRO-V: An Efficient Program Generation Multi-Agent System for Automatic RTL VerificationCode1
ICME 2025 Grand Challenge on Video Super-Resolution for Video ConferencingCode1
Visual Pre-Training on Unlabeled Images using Reinforcement LearningCode1
Diffusion-Based Electrocardiography Noise Quantification via Anomaly DetectionCode1
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix ApproximationCode1
Dual‑detector Re‑optimization for Federated Weakly Supervised Video Anomaly Detection Via Adaptive Dynamic Recursive MappingCode1
SoK: Evaluating Jailbreak Guardrails for Large Language ModelsCode1
Probably Approximately Correct LabelsCode1
Low-Barrier Dataset Collection with Real Human Body for Interactive Per-Garment Virtual Try-OnCode1
PyLO: Towards Accessible Learned Optimizers in PyTorchCode1
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive PokémonCode1
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change QueriesCode1
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object DetectionCode1
TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research CorporaCode1
SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference AttacksCode1
Hessian Geometry of Latent Space in Generative ModelsCode1
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLPCode1
Principled Approaches for Extending Neural Architectures to Function Spaces for Operator LearningCode1
Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution ShiftsCode1
DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and MambaCode1
It's Not the Target, It's the Background: Rethinking Infrared Small Target Detection via Deep Patch-Free Low-Rank RepresentationsCode1
Farseer: A Refined Scaling Law in Large Language ModelsCode1
Accelerating Diffusion Large Language Models with SlowFast: The Three Golden PrinciplesCode1
NoLoCo: No-all-reduce Low Communication Training Method for Large ModelsCode1
Anti-Aliased 2D Gaussian SplattingCode1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrierCode1
Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix FactorizationCode1
Show:102550
← PrevPage 303 of 9486Next →