SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1615116200 of 474278 papers

TitleStatusHype
RAIL: Region-Aware Instructive Learning for Semi-Supervised Tooth Segmentation in CBCTCode1
Geospatial Mechanistic Interpretability of Large Language ModelsCode1
Learning-based Homothetic Tube MPCCode1
OSUniverse: Benchmark for Multimodal GUI-navigation AI AgentsCode1
Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music GenerationCode1
Panoramic Out-of-Distribution SegmentationCode1
Blending 3D Geometry and Machine Learning for Multi-View StereopsisCode1
Fixed-Length Dense Fingerprint RepresentationCode1
Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation MapCode1
WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from ScratchCode1
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic LanguagesCode1
CombiBench: Benchmarking LLM Capability for Combinatorial MathematicsCode1
Framework GNN-AID: Graph Neural Network Analysis Interpretation and DefenseCode1
Multi-View Learning with Context-Guided Receptance for Image DenoisingCode1
fastabx: A library for efficient computation of ABX discriminabilityCode1
Token Coordinated Prompt Attention is Needed for Visual PromptingCode1
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at ScaleCode1
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RLCode1
Rewriting Pre-Training Data Boosts LLM Performance in Math and CodeCode1
SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction TuningCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-PracticingCode1
Towards Quantifying the Hessian Structure of Neural NetworksCode1
ReplaceMe: Network Simplification via Layer Pruning and Linear TransformationsCode1
AutoLibra: Agent Metric Induction from Open-Ended FeedbackCode1
Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion ModelsCode1
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question AnsweringCode1
Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly DetectionCode1
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention RoutingCode1
NTIRE 2025 Challenge on UGC Video Enhancement: Methods and ResultsCode1
CASA: CNN Autoencoder-based Score Attention for Efficient Multivariate Long-term Time-series ForecastingCode1
Adaptive Thinking via Mode Policy Optimization for Social Language AgentsCode1
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time VideoCode1
Cricket: A Self-Powered Chirping PixelCode1
Small Clips, Big Gains: Learning Long-Range Refocused Temporal Information for Video Super-ResolutionCode1
ProDisc-VAD: An Efficient System for Weakly-Supervised Anomaly Detection in Video Surveillance ApplicationsCode1
Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured DataCode1
HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud EncoderCode1
Accelerating Volumetric Medical Image Annotation via Short-Long Memory SAM 2Code1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
An LSTM-PINN Hybrid Method to the specific problem of population forecastingCode1
Morello: Compiling Fast Neural Networks with Dynamic Programming and Spatial CompressionCode1
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component AnalysisCode1
2DXformer: Dual Transformers for Wind Power Forecasting with Dual Exogenous VariablesCode1
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video UnderstandingCode1
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained AlignmentCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
Autonomous Embodied Agents: When Robotics Meets Deep Learning ReasoningCode1
Carbon Aware Transformers Through Joint Model-Hardware OptimizationCode1
SpectrumFM: A Foundation Model for Intelligent Spectrum ManagementCode1
Show:102550
← PrevPage 324 of 9486Next →