SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2165121700 of 474278 papers

TitleStatusHype
Re-Mix: Optimizing Data Mixtures for Large Scale Imitation LearningCode1
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language ModelsCode1
Uncovering Knowledge Gaps in Radiology Report Generation Models through Knowledge GraphsCode1
SelEx: Self-Expertise in Fine-Grained Generalized Category DiscoveryCode1
I2EBench: A Comprehensive Benchmark for Instruction-based Image EditingCode1
DIAGen: Diverse Image Augmentation with Generative ModelsCode1
LSM-YOLO: A Compact and Effective ROI Detector for Medical DetectionCode1
General targeted machine learning for modern causal mediation analysisCode1
Automated Machine Learning in InsuranceCode1
PHEVA: A Privacy-preserving Human-centric Video Anomaly Detection DatasetCode1
An Embedding is Worth a Thousand Noisy LabelsCode1
Efficient fine-tuning of 37-level GraphCast with the Canadian global deterministic analysisCode1
Revisiting Image Captioning Training Paradigm via Direct CLIP-based OptimizationCode1
Affine steerers for structured keypoint descriptionCode1
Text3DAug -- Prompted Instance Augmentation for LiDAR PerceptionCode1
1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bitCode1
AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic FrameworkCode1
Spoken-Term Discovery using Discrete Speech UnitsCode1
MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image ClassificationCode1
SONICS: Synthetic Or Not -- Identifying Counterfeit SongsCode1
DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian SplattingCode1
A Lightweight Insulator Defect Detection Model Based on Drone ImagesCode1
CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper InfluenceCode1
GenFormer -- Generated Images are All You Need to Improve Robustness of Transformers on Small DatasetsCode1
LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRUCode1
LoG-VMamba: Local-Global Vision Mamba for Medical Image SegmentationCode1
Cascaded Temporal Updating Network for Efficient Video Super-ResolutionCode1
ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action LocalizationCode1
CF-KAN: Kolmogorov-Arnold Network-based Collaborative Filtering to Mitigate Catastrophic Forgetting in Recommender SystemsCode1
LexBoost: Improving Lexical Document Retrieval with Nearest NeighborsCode1
Capturing Homogeneous Influence among Students: Hypergraph Cognitive Diagnosis for Intelligent Education SystemsCode1
Time Series Analysis for Education: Methods, Applications, and Future DirectionsCode1
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthCode1
Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!Code1
Item-Difficulty-Aware Learning Path Recommendation: From a Real Walking PerspectiveCode1
Control-Informed Reinforcement Learning for Chemical ProcessesCode1
HRGraph: Leveraging LLMs for HR Data Knowledge Graphs with Information Propagation-based Job RecommendationCode1
Symbolic Working Memory Enhances Language Models for Complex Rule ApplicationCode1
Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text GenerationCode1
Localize-and-Stitch: Efficient Model Merging via Sparse Task ArithmeticCode1
Procedural Synthesis of Synthesizable MoleculesCode1
Variational Autoencoder for Anomaly Detection: A Comparative StudyCode1
Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion ModelCode1
ReactZyme: A Benchmark for Enzyme-Reaction PredictionCode1
Online Continuous Generalized Category DiscoveryCode1
ParGo: Bridging Vision-Language with Partial and Global ViewsCode1
Functional Tensor Decompositions for Physics-Informed Neural NetworksCode1
S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control PointsCode1
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation ModelsCode1
Show:102550
← PrevPage 434 of 9486Next →