SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1335113400 of 177340 papers

TitleStatusHype
PatternRank: Leveraging Pretrained Language Models and Part of Speech for Unsupervised Keyphrase ExtractionCode2
Snuffy: Efficient Whole Slide Image ClassifierCode2
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation ModelsCode2
Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation LearningCode2
TreeRL: LLM Reinforcement Learning with On-Policy Tree SearchCode2
CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text LabelsCode2
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse AutoencodersCode2
A User's Guide to KSig: GPU-Accelerated Computation of the Signature KernelCode2
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question AnsweringCode2
SOTOPIA-π: Interactive Learning of Socially Intelligent Language AgentsCode2
Towards Localized Fine-Grained Control for Facial Expression GenerationCode2
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?Code2
HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance FieldsCode2
Efficient Per-Example Gradient ComputationsCode2
LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake DetectionCode2
GAUDI: A Neural Architect for Immersive 3D Scene GenerationCode2
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech TranslationCode2
Off-the-Grid MARL: Datasets with Baselines for Offline Multi-Agent Reinforcement LearningCode2
JoLT: Joint Probabilistic Predictions on Tabular Data Using LLMsCode2
NeO 360: Neural Fields for Sparse View Synthesis of Outdoor ScenesCode2
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text ModelsCode2
Unsupervised Medical Image Translation with Adversarial Diffusion ModelsCode2
Omni3D: A Large Benchmark and Model for 3D Object Detection in the WildCode2
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationCode2
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space ModelsCode2
GMMSeg: Gaussian Mixture based Generative Semantic Segmentation ModelsCode2
Dynamic GNNs for Precise Seizure Detection and Classification from EEG DataCode2
Wav-KAN: Wavelet Kolmogorov-Arnold NetworksCode2
SirLLM: Streaming Infinite Retentive LLMCode2
LAR-IQA: A Lightweight, Accurate, and Robust No-Reference Image Quality Assessment ModelCode2
BERTrend: Neural Topic Modeling for Emerging Trends DetectionCode2
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality EstimationCode2
GTA1: GUI Test-time Scaling AgentCode2
Scaling Rich Style-Prompted Text-to-Speech DatasetsCode2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place RecognitionCode2
MAGO-SP: Detection and Correction of Water-Fat Swaps in Magnitude-Only VIBE MRICode2
Datasets and Benchmarks for Offline Safe Reinforcement LearningCode2
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image ManipulationCode2
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial OptimizationCode2
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-DenoisingCode2
Recent Advances in OOD Detection: Problems and ApproachesCode2
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth StudyCode2
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion ModelsCode2
Distributed Global Structure-from-Motion with a Deep Front-EndCode2
Solver-in-the-Loop: Learning from Differentiable Physics to Interact with Iterative PDE-SolversCode2
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor ScenesCode2
Remote Bio-Sensing: Open Source Benchmark Framework for Fair Evaluation of rPPGCode2
Stream of Search (SoS): Learning to Search in LanguageCode2
TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK BiobankCode2
Tracking Anything in High QualityCode2
Show:102550
← PrevPage 268 of 3547Next →