SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 67016750 of 177340 papers

TitleStatusHype
Masked Face Recognition Dataset and ApplicationCode2
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language ModelsCode2
Semantic Image Synthesis via Diffusion ModelsCode2
Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image EditingCode2
Generating 3D Molecules for Target Protein BindingCode2
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex ManipulationCode2
Isotropic Correlation Models for the Cross-Section of Equity ReturnsCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
QAEncoder: Towards Aligned Representation Learning in Question Answering SystemCode2
Neural-Driven Image EditingCode2
Rethinking Negative Instances for Generative Named Entity RecognitionCode2
Act3D: 3D Feature Field Transformers for Multi-Task Robotic ManipulationCode2
Space Group Informed Transformer for Crystalline Materials GenerationCode2
SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing SegmentationCode2
Fourier Neural Operator with Learned Deformations for PDEs on General GeometriesCode2
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud ProviderCode2
Deep Video Prior for Video Consistency and PropagationCode2
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient SparsityCode2
Towards Large-Scale Training of Pathology Foundation ModelsCode2
Explicit Differentiable Slicing and Global Deformation for Cardiac Mesh ReconstructionCode2
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based TrainingCode2
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse RewardsCode2
Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion ModelsCode2
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-ExpertsCode2
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation ModelsCode2
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with InstructionsCode2
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent SystemCode2
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow EstimationCode2
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling RatesCode2
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding ModelCode2
RecDiffusion: Rectangling for Image Stitching with Diffusion ModelsCode2
APEBench: A Benchmark for Autoregressive Neural Emulators of PDEsCode2
PALO: A Polyglot Large Multimodal Model for 5B PeopleCode2
Continual Test-Time Domain AdaptationCode2
Taming Data and Transformers for Audio GenerationCode2
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary DetectionCode2
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AICode2
Sample-Efficient Diffusion for Text-To-Speech SynthesisCode2
ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context ModelCode2
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic PredictionCode2
AnySat: One Earth Observation Model for Many Resolutions, Scales, and ModalitiesCode2
LLM Processes: Numerical Predictive Distributions Conditioned on Natural LanguageCode2
BYOL for Audio: Exploring Pre-trained General-purpose Audio RepresentationsCode2
Learning local equivariant representations for quantum operatorsCode2
Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action SegmentationCode2
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and contextCode2
ByT5 model for massively multilingual grapheme-to-phoneme conversionCode2
Show:102550
← PrevPage 135 of 3547Next →