SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1380113850 of 474278 papers

TitleStatusHype
Few-Shot Bearing Fault Diagnosis Via Ensembling Transformer-Based Model With Mahalanobis Distance Metric Learning From Multiscale FeaturesCode2
DGFont++: Robust Deformable Generative Networks for Unsupervised Font GenerationCode2
YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging GeometriesCode2
Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUsCode2
Analysing the Residual Stream of Language Models Under Knowledge ConflictsCode2
JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation FrameworkCode2
Hypergraph Neural NetworksCode2
Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News RecommendersCode2
Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift AdaptationCode2
ViSpeak: Visual Instruction Feedback in Streaming VideosCode2
SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite ImageryCode2
Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 ModelCode2
Detection Transformer with Stable MatchingCode2
Chain-of-Thought Reasoning Without PromptingCode2
Domain Adaptation with a Single Vision-Language EmbeddingCode2
An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image RetrievalCode2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
Prototype-based Cross-Modal Object TrackingCode2
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained TransformerCode2
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion ModelsCode2
1st Place Solution of Multiview Egocentric Hand Tracking Challenge ECCV2024Code2
C^2LEVA: Toward Comprehensive and Contamination-Free Language Model EvaluationCode2
Region Rebalance for Long-Tailed Semantic SegmentationCode2
NLLB-CLIP -- train performant multilingual image retrieval model on a budgetCode2
TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion SynthesisCode2
Gaussian Processes for Big DataCode2
DetGPT: Detect What You Need via ReasoningCode2
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure ModelingCode2
GAIA: a benchmark for General AI AssistantsCode2
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & DialectsCode2
Seeing through Satellite Images at Street ViewsCode2
Large Language Models are In-Context Molecule LearnersCode2
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion ModelsCode2
Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search AgentCode2
Deduplicating Training Data Mitigates Privacy Risks in Language ModelsCode2
RandAugment: Practical automated data augmentation with a reduced search spaceCode2
Mamba-R: Vision Mamba ALSO Needs RegistersCode2
The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)Code2
Structured Denoising Diffusion Models in Discrete State-SpacesCode2
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion ModelsCode2
Neural Responding Machine for Short-Text ConversationCode2
Neural Lander: Stable Drone Landing Control using Learned DynamicsCode2
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to ImitateCode2
Scaling up Differentially Private Deep Learning with Fast Per-Example Gradient ClippingCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
Improving RetinaNet for CT Lesion Detection with Dense Masks from Weak RECIST LabelsCode2
NeuralUQ: A comprehensive library for uncertainty quantification in neural differential equations and operatorsCode2
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion ModelsCode2
Double Difference Earthquake Location with Graph Neural NetworksCode2
A Library for Representing Python Programs as Graphs for Machine LearningCode2
Show:102550
← PrevPage 277 of 9486Next →