SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 97519775 of 474278 papers

TitleStatusHype
CCNeXt: An Effective Self-Supervised Stereo Depth Estimation ApproachCode0
Convolutional Set TransformerCode0
TY-RIST: Tactical YOLO Tricks for Real-time Infrared Small Target DetectionCode0
A Two-Stage Strategy for Mitosis Detection Using Improved YOLO11x Proposals and ConvNeXt ClassificationCode0
Neptune-X: Active X-to-Maritime Generation for Universal Maritime Object DetectionCode0
Gradient-based multi-focus image fusion with focus-aware saliency enhancementCode0
AutoIntent: AutoML for Text Classification0
X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought ReasoningCode0
Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM EnrichingCode0
In AI Sweet Harmony: Sociopragmatic Guardrail Bypasses and Evaluation-Awareness in OpenAI gpt-oss-20bCode0
IoT-MCP: Bridging LLMs and IoT Systems Through Model Context ProtocolCode0
WDformer: A Wavelet-based Differential Transformer Model for Time Series ForecastingCode0
Residual Off-Policy RL for Finetuning Behavior Cloning Policies0
AUDDT: Audio Unified Deepfake Detection Benchmark ToolkitCode0
VISION: Prompting Ocean Vertical Velocity Reconstruction from Incomplete ObservationsCode0
Understanding and Enhancing Mask-Based Pretraining towards Universal RepresentationsCode0
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open ResourcesCode0
RIS-LAD: A Benchmark and Model for Referring Low-Altitude Drone Image SegmentationCode0
Unlocking Financial Insights: An advanced Multimodal Summarization with Multimodal Output Framework for Financial Advisory VideosCode0
Searching for Privacy Risks in LLM Agents via Simulation0
CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts0
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models0
CLIPin: A Non-contrastive Plug-in to CLIP for Multimodal Semantic AlignmentCode0
LIMI: Less is More for Agency0
MAPO: Mixed Advantage Policy Optimization0
Show:102550
← PrevPage 391 of 18972Next →