SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1880118850 of 474278 papers

TitleStatusHype
Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language ModelsCode1
3D Detection and Characterisation of ALMA Sources through Deep LearningCode1
Mutual Distillation Learning For Person Re-IdentificationCode1
ChatGPT in the Age of Generative AI and Large Language Models: A Concise SurveyCode1
Introducing the VoicePrivacy InitiativeCode1
Self-supervised Monocular Underwater Depth Recovery, Image Restoration, and a Real-sea Video DatasetCode1
Referring Multi-Object TrackingCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Next Generation Loss Function for Image ClassificationCode1
Steward: Natural Language Web AutomationCode1
CLIP-Adapter: Better Vision-Language Models with Feature AdaptersCode1
3D Focusing-and-Matching Network for Multi-Instance Point Cloud RegistrationCode1
Invariant Collaborative Filtering to Popularity Distribution ShiftCode1
Evaluating Unsupervised Text Classification: Zero-shot and Similarity-based ApproachesCode1
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning ProofsCode1
Single-Domain Generalized Object Detection in Urban Scene via Cyclic-Disentangled Self-DistillationCode1
E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image SegmentationCode1
Higher-order Coreference Resolution with Coarse-to-fine InferenceCode1
Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile DatasetCode1
Graph Matching with Bi-level Noisy CorrespondenceCode1
DagSim: Combining DAG-based model structure with unconstrained data types and relations for flexible, transparent, and modularized data simulationCode1
Prototype-Driven and Multi-Expert Integrated Multi-Modal MR Brain Tumor Image SegmentationCode1
Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language ModelsCode1
Tisane: Authoring Statistical Models via Formal Reasoning from Conceptual and Data RelationshipsCode1
MAD-AD: Masked Diffusion for Unsupervised Brain Anomaly DetectionCode1
Tackling Long-Tailed Category Distribution Under Domain ShiftsCode1
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance SegmentationCode1
Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI PoolingCode1
MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference CalibrationCode1
ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable CompressionCode1
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over QuantityCode1
Learning Accurate Dense Correspondences and When to Trust ThemCode1
Patent Image Retrieval Using Cross-entropy-based Metric LearningCode1
LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-AnsweringCode1
fastMRI: An Open Dataset and Benchmarks for Accelerated MRICode1
CausPref: Causal Preference Learning for Out-of-Distribution RecommendationCode1
Fine-Grained Semantically Aligned Vision-Language Pre-TrainingCode1
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model AdaptationCode1
TSGBench: Time Series Generation BenchmarkCode1
Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional NetworksCode1
Sampling-free Inference for Ab-Initio Potential Energy Surface NetworksCode1
Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning SystemsCode1
Can Pre-trained Language Models Interpret Similes as Smart as Human?Code1
Blind Motion Deblurring with Pixel-Wise Kernel Estimation via Kernel Prediction NetworksCode1
GPTutor: a ChatGPT-powered programming tool for code explanationCode1
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language ModelsCode1
Addressing Maximization Bias in Reinforcement Learning with Two-Sample TestingCode1
A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NERCode1
Exploring the Individuality and Collectivity of Intents behind Interactions for Graph Collaborative FilteringCode1
HumanGif: Single-View Human Diffusion with Generative PriorCode1
Show:102550
← PrevPage 377 of 9486Next →