SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2090120950 of 474278 papers

TitleStatusHype
Probabilistic Answer Set Programming with Discrete and Continuous Random VariablesCode1
Law of the Weakest Link: Cross Capabilities of Large Language ModelsCode1
Text Clustering as Classification with LLMsCode1
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and GeneralizabilityCode1
Volumetric Conditional Score-based Residual Diffusion Model for PET/MR DenoisingCode1
Camera Calibration using a Collimator SystemCode1
EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth PredictionCode1
PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological CounselingCode1
Delving Deep into Engagement Prediction of Short VideosCode1
ASQuery: A Query-based Model for Action SegmentationCode1
Physics-Regularized Multi-Modal Image Assimilation for Brain Tumor LocalizationCode1
ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-IdentificationCode1
SWIM: Short-Window CNN Integrated with Mamba for EEG-Based Auditory Spatial Attention DecodingCode1
IRFusionFormer: Enhancing Pavement Crack Segmentation with RGB-T Fusion and Topological-Based LossCode1
Towards Unified Multimodal Editing with Enhanced Knowledge CollaborationCode1
Enhancing High-order Interaction Awareness in LLM-based Recommender ModelCode1
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models FunctionCode1
Basis-to-Basis Operator Learning Using Function EncodersCode1
Unified Gradient-Based Machine Unlearning with Remain Geometry EnhancementCode1
T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness RecognitionCode1
MASKDROID: Robust Android Malware Detection with Masked Graph RepresentationsCode1
BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language ModeCode1
Crafting Distribution Shifts for Validation and Training in Single Source Domain GeneralizationCode1
Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on DialoguesCode1
Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and ModelsCode1
Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and MethodCode1
DATransNet: Dynamic Attention Transformer Network for Infrared Small Target DetectionCode1
LoRKD: Low-Rank Knowledge Decomposition for Medical Foundation ModelsCode1
Evolving Multi-Scale Normalization for Time Series Forecasting under Distribution ShiftsCode1
2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language ModelsCode1
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document UnderstandingCode1
Hybrid Mamba for Few-Shot SegmentationCode1
All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path AggregationCode1
Gradient descent with adaptive stepsize converges (nearly) linearly under fourth-order growthCode1
Vision-Language Models are Strong Noisy Label DetectorsCode1
MCDDPM: Multichannel Conditional Denoising Diffusion Model for Unsupervised Anomaly Detection in Brain MRICode1
OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing ImagesCode1
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question AnsweringCode1
VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place RecognitionCode1
GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian SplattingCode1
X-Prompt: Multi-modal Visual Prompt for Video Object SegmentationCode1
Summit Vitals: Multi-Camera and Multi-Signal Biosensing at High AltitudesCode1
SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language ModelsCode1
Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language ModelsCode1
RMLR: Extending Multinomial Logistic Regression into General GeometriesCode1
A Confidence-Aware Matching Strategy For Generalized Multi-Object TrackingCode1
MECG-E: Mamba-based ECG Enhancer for Baseline Wander RemovalCode1
Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation ForecastingCode1
Mixture of Multicenter Experts in Multimodal Generative AI for Advanced Radiotherapy Target DelineationCode1
Relighting from a Single Image: Datasets and Deep Intrinsic-based ArchitectureCode1
Show:102550
← PrevPage 419 of 9486Next →