SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2180121850 of 474278 papers

TitleStatusHype
A toolbox for calculating objective image properties in aesthetics researchCode1
V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?Code1
TDS-CLIP: Temporal Difference Side Network for Image-to-Video Transfer LearningCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
AIR: Analytic Imbalance Rectifier for Continual LearningCode1
Structure-preserving Image Translation for Depth Estimation in Colonoscopy VideoCode1
SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action RecognitionCode1
PolypDB: A Curated Multi-Center Dataset for Development of AI Algorithms in ColonoscopyCode1
Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and AlgorithmsCode1
CLIPCleaner: Cleaning Noisy Labels with CLIPCode1
TDNetGen: Empowering Complex Network Resilience Prediction with Generative Augmentation of Topology and DynamicsCode1
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsCode1
Contextual Importance and Utility in Python: New Functionality and Insights with the py-ciu PackageCode1
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis AssistantCode1
ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image EnhancementCode1
Unsupervised Composable Representations for AudioCode1
A Dataset for Mechanical MechanismsCode1
Deep-MacroFin: Informed Equilibrium Neural Network for Continuous Time Economic ModelsCode1
PinnDE: Physics-Informed Neural Networks for Solving Differential EquationsCode1
Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment EnhancementCode1
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic ModelsCode1
OccMamba: Semantic Occupancy Prediction with State Space ModelsCode1
TaSL: Continual Dialog State Tracking via Task Skill Localization and ConsolidationCode1
"Image, Tell me your story!" Predicting the original meta-context of visual misinformationCode1
SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical ImagesCode1
Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit AdjustmentCode1
Customizing Language Models with Instance-wise LoRA for Sequential RecommendationCode1
Goldfish: Monolingual Language Models for 350 LanguagesCode1
Implicit Grid Convolution for Multi-Scale Image Super-ResolutionCode1
Facial Wrinkle Segmentation for Cosmetic Dermatology: Pretraining with Texture Map-Based Weak SupervisionCode1
Uncertainty Quantification of Surrogate Models using Conformal PredictionCode1
Parkinson's Disease Classification via EEG: All You Need is a Single Convolutional LayerCode1
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE InferenceCode1
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEditCode1
Harnessing Multimodal Large Language Models for Multimodal Sequential RecommendationCode1
Dynamic Label Injection for Imbalanced Industrial Defect SegmentationCode1
GNN-Empowered Effective Partial Observation MARL Method for AoI Management in Multi-UAV NetworkCode1
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language ModelCode1
G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric PriorsCode1
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language ModelsCode1
Distinguish Confusion in Legal Judgment Prediction via Revised Relation KnowledgeCode1
VrdONE: One-stage Video Visual Relation DetectionCode1
Unsupervised Change Detection Based on Image Reconstruction Loss with Segment AnythingCode1
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion RecognitionCode1
GitHub is an effective platform for collaborative and reproducible laboratory researchCode1
Flemme: A Flexible and Modular Learning Platform for Medical ImagesCode1
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuningCode1
Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image RestorationCode1
PADetBench: Towards Benchmarking Physical Attacks against Object DetectionCode1
Are CLIP features all you need for Universal Synthetic Image Origin Attribution?Code1
Show:102550
← PrevPage 437 of 9486Next →