SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 93519400 of 177340 papers

TitleStatusHype
V*: Guided Visual Search as a Core Mechanism in Multimodal LLMsCode2
SimPhony: A Device-Circuit-Architecture Cross-Layer Modeling and Simulation Framework for Heterogeneous Electronic-Photonic AI SystemCode2
Abstractive Summarization of Spoken andWritten Instructions with BERTCode2
Controlling Length in Image CaptioningCode2
An Inverse Scaling Law for CLIP TrainingCode2
Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networksCode2
Focal Loss for Dense Object DetectionCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy VideoCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion ModelsCode2
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement LearningCode2
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakesCode2
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object DetectionCode2
Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at ScaleCode2
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question AnsweringCode2
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsCode2
Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency ConsistencyCode2
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series ForecastingCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
ABodyBuilder3: Improved and scalable antibody structure predictionsCode2
TrustRAG: Enhancing Robustness and Trustworthiness in RAGCode2
Scaling Language-Image Pre-training via MaskingCode2
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and CosmologyCode2
TODS: An Automated Time Series Outlier Detection SystemCode2
LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorCode2
A Survey of Machine UnlearningCode2
Dynamic Spatial Propagation Network for Depth CompletionCode2
OctoThinker: Mid-training Incentivizes Reinforcement Learning ScalingCode2
Perception Test: A Diagnostic Benchmark for Multimodal Video ModelsCode2
RITA: a Study on Scaling Up Generative Protein Sequence ModelsCode2
Multi-target stain normalization for histology slidesCode2
MedS^3: Towards Medical Small Language Models with Self-Evolved Slow ThinkingCode2
Long-term Frame-Event Visual Tracking: Benchmark Dataset and BaselineCode2
ChaCha for Online AutoMLCode2
Graph-based Topology Reasoning for Driving ScenesCode2
SegFix: Model-Agnostic Boundary Refinement for SegmentationCode2
Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view ReconstructionCode2
TrafficGPT: An LLM Approach for Open-Set Encrypted Traffic ClassificationCode2
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language ModelsCode2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language ModelsCode2
Map It Anywhere (MIA): Empowering Bird's Eye View Mapping using Large-scale Public DataCode2
Probability density estimation for sets of large graphs with respect to spectral information using stochastic block modelsCode2
MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly DetectionCode2
One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text PromptsCode2
OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape GenerationCode2
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
LongVLM: Efficient Long Video Understanding via Large Language ModelsCode2
Geometry-Informed Neural NetworksCode2
MOROCCO: Model Resource Comparison FrameworkCode2
Show:102550
← PrevPage 188 of 3547Next →