SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1845118500 of 474278 papers

TitleStatusHype
Dynamics-incorporated Modeling Framework for Stability Constrained Scheduling Under High-penetration of Renewable EnergyCode1
MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action DetectionCode1
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific InformationCode1
From Mesh Completion to AI Designed CrownCode1
Improving Zero-Shot Object-Level Change Detection by Incorporating Visual CorrespondenceCode1
Uncertainty-aware Knowledge TracingCode1
Battling the Non-stationarity in Time Series Forecasting via Test-time AdaptationCode1
AnCoGen: Analysis, Control and Generation of Speech with a Masked AutoencoderCode1
AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR DataCode1
IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression SegmentationCode1
VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language ModelsCode1
A Flexible and Scalable Framework for Video Moment SearchCode1
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue ResolutionCode1
Continuous Knowledge-Preserving Decomposition for Few-Shot Continual LearningCode1
Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant LearningCode1
Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and BenchmarksCode1
Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort?Code1
SensorQA: A Question Answering Benchmark for Daily-Life MonitoringCode1
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs AccelerationCode1
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
Solving the Catastrophic Forgetting Problem in Generalized Category DiscoveryCode1
Demystifying Domain-adaptive Post-training for Financial LLMsCode1
Progressive Supervision via Label Decomposition: An Long-Term and Large-Scale Wireless Traffic Forecasting MethodCode1
Plug-and-Play DISep: Separating Dense Instances for Scene-to-Pixel Weakly-Supervised Change Detection in High-Resolution Remote Sensing ImagesCode1
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D VolumesCode1
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition BenchmarkCode1
EDMB: Edge Detector with MambaCode1
S2 Chunking: A Hybrid Framework for Document Segmentation Through Integrated Spatial and Semantic AnalysisCode1
ContextMRI: Enhancing Compressed Sensing MRI through Metadata ConditioningCode1
Rethinking High-speed Image Reconstruction Framework with Spike CameraCode1
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-PromptingCode1
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMsCode1
Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP SupervisionCode1
DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion ModelsCode1
Neural Parameter Estimation with Incomplete DataCode1
Eve: Efficient Multimodal Vision Language Models with Elastic Visual ExpertsCode1
Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware InversionCode1
Histologic Dataset of Normal and Atypical Mitotic Figures on Human Breast Cancer (AMi-Br)Code1
Online Gaussian Test-Time Adaptation of Vision-Language ModelsCode1
DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional ApplicationsCode1
Implicit Guidance and Explicit Representation of Semantic Information in Points Cloud: A SurveyCode1
Can LLMs Design Good Questions Based on Context?Code1
RecKG: Knowledge Graph for Recommender SystemsCode1
Entropy-Guided Attention for Private LLMsCode1
Stochastic Process Learning via Operator Flow MatchingCode1
LM-Net: A Light-weight and Multi-scale Network for Medical Image SegmentationCode1
Dual-level Adaptive Incongruity-enhanced Model for Multimodal Sarcasm DetectionCode1
FgC2F-UDiff: Frequency-guided and Coarse-to-fine Unified Diffusion Model for Multi-modality Missing MRI SynthesisCode1
Unsupervised Speech Segmentation: A General Approach Using Speech Language ModelsCode1
VLM-driven Behavior Tree for Context-aware Task PlanningCode1
Show:102550
← PrevPage 370 of 9486Next →