SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1865118700 of 474278 papers

TitleStatusHype
OmniStereo: Real-time Omnidireactional Depth Estimation with Multiview Fisheye CamerasCode1
Revisiting Generative Replay for Class Incremental Object DetectionCode1
FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few ImagesCode1
Octopus: Alleviating Hallucination via Dynamic Contrastive DecodingCode1
Blood Flow Speed Estimation with Optical Coherence Tomography Angiography ImagesCode1
Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM GreaterCode1
Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question AnsweringCode1
DV-Matcher: Deformation-based Non-rigid Point Cloud Matching Guided by Pre-trained Visual FeaturesCode1
Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image CaptioningCode1
Point Cloud Upsampling Using Conditional Diffusion Module with Adaptive Noise SuppressionCode1
SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot SegmentationCode1
TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language EnvironmentCode1
KAE: Kolmogorov-Arnold Auto-Encoder for Representation LearningCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothingCode1
Lightweight G-YOLOv11: Advancing Efficient Fracture Detection in Pediatric Wrist X-raysCode1
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code GenerationCode1
A novel deep learning approach for facial emotion recognition: application to detecting emotional responses in elderly individuals with Alzheimer’s diseaseCode1
Insights on Galaxy Evolution from Interpretable Sparse Feature NetworksCode1
Low-Light Image Enhancement via Generative Perceptual PriorsCode1
Plancraft: an evaluation dataset for planning with LLM agentsCode1
A Large-Scale Study on Video Action Dataset CondensationCode1
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain ReasonerCode1
Length-Aware DETR for Robust Moment RetrievalCode1
Facilitating large language model Russian adaptation with Learned Embedding PropagationCode1
PyG-SSL: A Graph Self-Supervised Learning ToolkitCode1
Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive DefenseCode1
TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning DistillationCode1
Frequency-Masked Embedding Inference: A Non-Contrastive Approach for Time Series Representation LearningCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
TrajLearn: Trajectory Prediction Learning using Deep Generative ModelsCode1
Visual Style Prompt Learning Using Diffusion Models for Blind Face RestorationCode1
DDIM sampling for Generative AIBIM, a faster intelligent structural design frameworkCode1
Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)Code1
Training-free Heterogeneous Model MergingCode1
ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video UnderstandingCode1
EraseAnything: Enabling Concept Erasure in Rectified Flow TransformersCode1
FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian PerturbationCode1
Diminishing Return of Value Expansion MethodsCode1
FreqMixFormerV2: Lightweight Frequency-aware Mixed Transformer for Human Skeleton Action RecognitionCode1
Stochastic gradient descent estimation of generalized matrix factorization models with application to single-cell RNA sequencing dataCode1
PTQ4VM: Post-Training Quantization for Visual MambaCode1
Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic ManipulationCode1
The Fifth International Verification of Neural Networks Competition (VNN-COMP 2024): Summary and ResultsCode1
TeLU Activation Function for Fast and Stable Deep LearningCode1
BaiJia: A Large-Scale Role-Playing Agent Corpus of Chinese Historical CharactersCode1
M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation EvaluationCode1
SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object DetectionCode1
On the Compositional Generalization of Multimodal LLMs for Medical ImagingCode1
Federated Unlearning with Gradient Descent and Conflict MitigationCode1
Show:102550
← PrevPage 374 of 9486Next →