SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1230112350 of 474278 papers

TitleStatusHype
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility0
Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios0
BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images0
Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark0
Towards Autonomous Riding: A Review of Perception, Planning, and Control in Intelligent Two-Wheelers0
Context-Aware Search and Retrieval Over Erasure Channels0
A Survey of Deep Learning for Geometry Problem SolvingCode0
Analytic estimation of parameters of stochastic volatility diffusion models with exponential-affine characteristic function for currency option pricingCode0
Distributional Reinforcement Learning on Path-dependent Options0
Self-Adaptive and Robust Federated Spectrum Sensing without Benign Majority for Cellular Networks0
Site-Level Fine-Tuning with Progressive Layer Freezing: Towards Robust Prediction of Bronchopulmonary Dysplasia from Day-1 Chest Radiographs in Extremely Preterm Infants0
FADE: Adversarial Concept Erasure in Flow Models0
Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos0
MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks0
Trustworthy Tree-based Machine Learning by MoS_2 Flash-based Analog CAM with Inherent Soft Boundaries0
Distributed Resilient State Estimation and Control with Strategically Implemented Security Measures0
SEPose: A Synthetic Event-based Human Pose Estimation Dataset for Pedestrian Monitoring0
Novel Approach to Dual-Channel Estimation in Integrated Sensing and Communications for 6G0
Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning0
Kevin: Multi-Turn RL for Generating CUDA Kernels0
Looking for Fairness in Recommender Systems0
FORTRESS: Function-composition Optimized Real-Time Resilient Structural Segmentation via Kolmogorov-Arnold Enhanced Spatial Attention NetworksCode0
Imbalanced Regression Pipeline RecommendationCode0
PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online LearningCode0
CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy LabelsCode0
Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized ConstraintsCode0
InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofingCode1
Integrated Switched Capacitor Array and Synchronous Charge Extraction with Adaptive Hybrid MPPT for Piezoelectric Harvesters0
AFPM: Alignment-based Frame Patch Modeling for Cross-Dataset EEG Decoding0
Similarity-Guided Diffusion for Contrastive Sequential Recommendation0
MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM0
RegCL: Continual Adaptation of Segment Anything Model via Model Merging0
SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation0
A Fuzzy Approach to Project Success: Measuring What MattersCode0
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning0
Catching Bid-rigging Cartels with Graph Attention Neural Networks0
Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-rankerCode0
DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt CompressionCode0
Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth EstimationCode0
Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated DefinitionsCode0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AICode3
Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning WorkflowsCode0
Watch, Listen, Understand, Mislead: Tri-modal Adversarial Attacks on Short Videos for Content Appropriateness Evaluation0
Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning0
Choosing the Better Bandit Algorithm under Data Sharing: When Do A/B Experiments Work?Code0
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
Assay2Mol: large language model-based drug design using BioAssay contextCode0
Describe Anything Model for Visual Question Answering on Text-rich ImagesCode1
PhysX: Physical-Grounded 3D Asset GenerationCode3
Show:102550
← PrevPage 247 of 9486Next →