SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1030110350 of 661570 papers

TitleStatusHype
Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-IdentificationCode2
Health-LLM: Large Language Models for Health Prediction via Wearable Sensor DataCode2
Seg-metrics: a Python package to compute segmentation metricsCode2
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase PredictionCode2
Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imageryCode2
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic SurgeryCode2
On the representation and methodology for wide and short range head pose estimationCode2
PartSTAD: 2D-to-3D Part Segmentation Task AdaptationCode2
Learn From Zoom: Decoupled Supervised Contrastive Learning For WCE Image ClassificationCode2
Cheetah: Bridging the Gap Between Machine Learning and Particle Accelerator Physics with High-Speed, Differentiable SimulationsCode2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Code2
Transformers are Multi-State RNNsCode2
UAVD4L: A Large-Scale Dataset for UAV 6-DoF LocalizationCode2
Transforming Image Super-Resolution: A ConvFormer-based Efficient ApproachCode2
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion RecognitionCode2
End-to-end Learnable Clustering for Intent Learning in RecommendationCode2
ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of VideoCode2
Graph-of-Thought: Utilizing Large Language Models to Solve Complex and Dynamic Business ProblemsCode2
Rethinking Test-time Likelihood: The Likelihood Path Principle and Its Application to OOD DetectionCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety TrainingCode2
InfiAgent-DABench: Evaluating Agents on Data Analysis TasksCode2
Real-time and Continuous Turn-taking Prediction Using Voice Activity ProjectionCode2
MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly DetectionCode2
DebugBench: Evaluating Debugging Capability of Large Language ModelsCode2
RadarCam-Depth: Radar-Camera Fusion for Depth Estimation with Learned Metric ScaleCode2
PhilEO Bench: Evaluating Geo-Spatial Foundation ModelsCode2
Low-resource finetuning of foundation models beats state-of-the-art in histopathologyCode2
TechGPT-2.0: A large language model project to solve the task of knowledge graph constructionCode2
Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar CreationCode2
U-Mamba: Enhancing Long-range Dependency for Biomedical Image SegmentationCode2
Deep Covariance Alignment for Domain Adaptive Remote Sensing Image SegmentationCode2
LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly DetectionCode2
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table UnderstandingCode2
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent SystemsCode2
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet DiffusionCode2
WidthFormer: Toward Efficient Transformer-based BEV View TransformationCode2
LLM4PLC: Harnessing Large Language Models for Verifiable Programming of PLCs in Industrial Control SystemsCode2
scDiffusion: conditional generation of high-quality single-cell data using diffusion modelCode2
Attack-Resilient Image Watermarking Using Stable DiffusionCode2
A Survey on 3D Gaussian SplattingCode2
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAMCode2
MARG: Multi-Agent Review Generation for Scientific PapersCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
Multi-Modal Representation Learning for Molecular Property Prediction: Sequence, Graph, GeometryCode2
Grimoire is All You Need for Enhancing Large Language ModelsCode2
Agent AI: Surveying the Horizons of Multimodal InteractionCode2
Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning StrategyCode2
InFoBench: Evaluating Instruction Following Ability in Large Language ModelsCode2
Malla: Demystifying Real-world Large Language Model Integrated Malicious ServicesCode2
Show:102550
← PrevPage 207 of 13232Next →