SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1845118500 of 474278 papers

TitleStatusHype
Unsupervised Evaluation of Interactive Dialog with DialoGPTCode1
Joint Trajectory and Passive Beamforming Design for Intelligent Reflecting Surface-Aided UAV Communications: A Deep Reinforcement Learning ApproachCode1
Re-ReND: Real-time Rendering of NeRFs across DevicesCode1
Learning Gradient Fields for Shape GenerationCode1
Twitter Corpus of the #BlackLivesMatter Movement And Counter Protests: 2013 to 2021Code1
"Other-Play" for Zero-Shot CoordinationCode1
Unmasking the Mask -- Evaluating Social Biases in Masked Language ModelsCode1
HINT3: Raising the bar for Intent Detection in the WildCode1
Evolutionary Generation of Visual Motion IllusionsCode1
Nearly Optimal Robust Subspace TrackingCode1
Pareto Set Learning for Neural Multi-objective Combinatorial OptimizationCode1
BRepNet: A topological message passing system for solid modelsCode1
"Why Should I Trust You?": Explaining the Predictions of Any ClassifierCode1
A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images GenerationCode1
VideoCon: Robust Video-Language Alignment via Contrast CaptionsCode1
Agent with Warm Start and Active Termination for Plane Localization in 3D UltrasoundCode1
Sparse Adversarial Attack to Object DetectionCode1
PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM InferenceCode1
Contrastive Code Representation LearningCode1
Regularizing Meta-Learning via Gradient DropoutCode1
PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop LayersCode1
Surrogate Neural Networks Local Stability for Aircraft Predictive MaintenanceCode1
Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation MechanismCode1
A Comparison of 1-D and 2-D Deep Convolutional Neural Networks in ECG ClassificationCode1
Accelerating Quadratic Optimization with Reinforcement LearningCode1
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language UnderstandingCode1
A Bayesian algorithm for retrosynthesisCode1
Score-based diffusion models for accelerated MRICode1
Millimeter-wave Mobile Sensing and Environment Mapping: Models, Algorithms and ValidationCode1
Is Image-to-Image Translation the Panacea for Multimodal Image Registration? A Comparative StudyCode1
Bad Characters: Imperceptible NLP AttacksCode1
Template Filling with Generative TransformersCode1
Contrastive Neural Architecture Search with Neural Architecture ComparatorsCode1
TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-casesCode1
MOLUCINATE: A Generative Model for Molecules in 3D SpaceCode1
EPIC30M: An Epidemics Corpus Of Over 30 Million Relevant TweetsCode1
Convolutional 2D Knowledge Graph EmbeddingsCode1
CRFL: Certifiably Robust Federated Learning against Backdoor AttacksCode1
DiRA: Discriminative, Restorative, and Adversarial Learning for Self-supervised Medical Image AnalysisCode1
Estimating leverage scores via rank revealing methods and randomizationCode1
RDA: Robust Domain Adaptation via Fourier Adversarial AttackingCode1
A Structural Model for Contextual Code ChangesCode1
CLICKER: A Computational LInguistics Classification Scheme for Educational ResourcesCode1
Data Augmentation for Scene Text RecognitionCode1
Selective Differential Privacy for Language ModelingCode1
Self-supervised Pseudo Multi-class Pre-training for Unsupervised Anomaly Detection and Segmentation in Medical ImagesCode1
Exposure Trajectory Recovery from Motion BlurCode1
A Channel Coding Benchmark for Meta-LearningCode1
Federated Mutual LearningCode1
SD-DefSLAM: Semi-Direct Monocular SLAM for Deformable and Intracorporeal ScenesCode1
Show:102550
← PrevPage 370 of 9486Next →