SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1500115050 of 474278 papers

TitleStatusHype
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual TrackingCode1
Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorchCode1
A foundation model with multi-variate parallel attention to generate neuronal activityCode1
Q-resafe: Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language ModelsCode1
WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI WorkloadsCode1
DPLib: A Standard Benchmark Library for Distributed Power System Analysis and OptimizationCode1
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized CollaborationCode1
Augmenting Multi-Agent Communication with State Delta TrajectoryCode1
Self-Supervised Multimodal NeRF for Autonomous DrivingCode1
HERCULES: Hierarchical Embedding-based Recursive Clustering Using LLMs for Efficient SummarizationCode1
SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing ImagesCode1
Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language ModelsCode1
ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World ModelCode1
Elucidated Rolling Diffusion Models for Probabilistic Weather ForecastingCode1
MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility ApplicationsCode1
KnowRL: Exploring Knowledgeable Reinforcement Learning for FactualityCode1
EvDetMAV: Generalized MAV Detection from Moving Event CamerasCode1
Fast and Distributed Equivariant Graph Neural Networks by Virtual Node LearningCode1
Introducing EG-IPT and ipt~: a novel electric guitar dataset and a new Max/MSP object for real-time classification of instrumental playing techniquesCode1
Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and ArchitectureCode1
EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson RegressionCode1
LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMRCode1
NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness AnalysisCode1
The Within-Orbit Adaptive Leapfrog No-U-Turn SamplerCode1
Riemannian generative decoderCode1
Morse: Dual-Sampling for Lossless Acceleration of Diffusion ModelsCode1
MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene RepresentationCode1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detectionCode1
Parallel Continuous Chain-of-Thought with Jacobi IterationCode1
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss FunctionsCode1
What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language ModelsCode1
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear AttentionCode1
CommVQ: Commutative Vector Quantization for KV Cache CompressionCode1
Taming Vision-Language Models for Medical Image Analysis: A Comprehensive ReviewCode1
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked ModelingCode1
DIP: Unsupervised Dense In-Context Post-training of Visual RepresentationsCode1
LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earthCode1
Evolving Prompts In-Context: An Open-ended, Self-replicating PerspectiveCode1
h-calibration: Rethinking Classifier Recalibration with Probabilistic Error-Bounded ObjectiveCode1
MiCo: Multiple Instance Learning with Context-Aware Clustering for Whole Slide Image AnalysisCode1
OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learningCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody-Antigen Affinity RankingCode1
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMsCode1
TextBraTS: Text-Guided Volumetric Brain Tumor Segmentation with Innovative Dataset Development and Fusion Module ExplorationCode1
Visual-Instructed Degradation Diffusion for All-in-One Image RestorationCode1
Mesh-Informed Neural Operator : A Transformer Generative ApproachCode1
A Minimalist Optimizer Design for LLM PretrainingCode1
Show:102550
← PrevPage 301 of 9486Next →