SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1635116400 of 474278 papers

TitleStatusHype
Towards Accurate and Interpretable Neuroblastoma Diagnosis via Contrastive Multi-scale Pathological Image AnalysisCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
p2smi: A Python Toolkit for Peptide FASTA-to-SMILES Conversion and Molecular Property AnalysisCode1
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba DiffusionCode1
Learning to Attribute with AttentionCode1
Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward ModelingCode1
Meta-Learning and Knowledge Discovery based Physics-Informed Neural Network for Remaining Useful Life PredictionCode1
Learning from Noisy Pseudo-labels for All-Weather Land Cover MappingCode1
U-Shape Mamba: State Space Model for faster diffusionCode1
A Deep Learning-Based Supervised Transfer Learning Framework for DOA Estimation with Array ImperfectionsCode1
CheXWorld: Exploring Image World Modeling for Radiograph Representation LearningCode1
Filter2Noise: Interpretable Self-Supervised Single-Image Denoising for Low-Dose CT with Attention-Guided Bilateral FilteringCode1
Bayesian continual learning and forgetting in neural networksCode1
Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer DesignCode1
FocusTrack: A Self-Adaptive Local Sampling Algorithm for Efficient Anti-UAV TrackingCode1
Compile Scene Graphs with Reinforcement LearningCode1
HDBFormer: Efficient RGB-D Semantic Segmentation with A Heterogeneous Dual-Branch FrameworkCode1
STAMP Your Content: Proving Dataset Membership via Watermarked RephrasingsCode1
MIB: A Mechanistic Interpretability BenchmarkCode1
Hierarchical Feature Learning for Medical Point Clouds via State Space ModelCode1
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy PredictionCode1
TTRD3: Texture Transfer Residual Denoising Dual Diffusion Model for Remote Sensing Image Super-ResolutionCode1
UncAD: Towards Safe End-to-end Autonomous Driving via Online Map UncertaintyCode1
GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity IntegrationCode1
Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code SelectionCode1
Mask Image WatermarkingCode1
Post-pre-training for Modality Alignment in Vision-Language Foundation ModelsCode1
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and ResultsCode1
Retrieval-Augmented Generation with Conflicting EvidenceCode1
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe GuidanceCode1
Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic SegmentationCode1
Building Russian Benchmark for Evaluation of Information Retrieval ModelsCode1
Personalized Text-to-Image Generation with Auto-Regressive ModelsCode1
CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented GenerationCode1
Graph Learning at Scale: Characterizing and Optimizing Pre-Propagation GNNsCode1
Collaborative Perception Datasets for Autonomous Driving: A ReviewCode1
TimeCapsule: Solving the Jigsaw Puzzle of Long-Term Time Series Forecasting with Compressed Predictive RepresentationsCode1
Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint GraphsCode1
Towards Lossless Token Pruning in Late-Interaction Retrieval ModelsCode1
Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and BeyondCode1
Data-efficient LLM Fine-tuning for Code GenerationCode1
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video ModelsCode1
Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural IntegrationCode1
SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction UnderstandingCode1
ZeroSumEval: Scaling LLM Evaluation with Inter-Model CompetitionCode1
NTIRE 2025 Challenge on Event-Based Image Deblurring: Methods and ResultsCode1
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame RateCode1
RadMamba: Efficient Human Activity Recognition through Radar-based Micro-Doppler-Oriented Mamba State-Space ModelCode1
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?Code1
Show:102550
← PrevPage 328 of 9486Next →