SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1190111950 of 661570 papers

TitleStatusHype
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision ApplicationsCode2
Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning PerspectiveCode2
Universal Few-shot Learning of Dense Prediction Tasks with Visual Token MatchingCode2
Learned Image Compression with Mixed Transformer-CNN ArchitecturesCode2
Label-Free Liver Tumor SegmentationCode2
Anti-DreamBooth: Protecting users from personalized text-to-image synthesisCode2
SimpleNet: A Simple Network for Image Anomaly Detection and LocalizationCode2
High-fidelity 3D Human Digitization from Single 2K Resolution ImagesCode2
CelebV-Text: A Large-Scale Facial Text-Video DatasetCode2
Learning Generative Structure Prior for Blind Text Image Super-resolutionCode2
WinCLIP: Zero-/Few-Shot Anomaly Classification and SegmentationCode2
GestureDiffuCLIP: Gesture Diffusion Model with CLIP LatentsCode2
OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane RenderingCode2
You Only Segment Once: Towards Real-Time Panoptic SegmentationCode2
Human Preference Score: Better Aligning Text-to-Image Models with Human PreferenceCode2
Auto-AVSR: Audio-Visual Speech Recognition with Automatic LabelsCode2
PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime CharactersCode2
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level LatenciesCode2
MDTv2: Masked Diffusion Transformer is a Strong Image SynthesizerCode2
Conditional Image-to-Video Generation with Latent Flow Diffusion ModelsCode2
TRAK: Attributing Model Behavior at ScaleCode2
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content CreationCode2
Query-Dependent Video Representation for Moment Retrieval and Highlight DetectionCode2
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at ScaleCode2
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing LearningCode2
FedGH: Heterogeneous Federated Learning with Generalized Global HeaderCode2
Towards Better Dynamic Graph Learning: New Architecture and Unified LibraryCode2
Masked Image Training for Generalizable Deep Image DenoisingCode2
NOPE: Novel Object Pose Estimation from a Single ImageCode2
ReVersion: Diffusion-Based Relation Inversion from ImagesCode2
Neural Preset for Color Style TransferCode2
Learning Human-Inspired Force Strategies for Robotic AssemblyCode2
Dense Distinct Query for End-to-End Object DetectionCode2
SHERF: Generalizable Human NeRF from a Single ImageCode2
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRsCode2
ExBEHRT: Extended Transformer for Electronic Health Records to Predict Disease Subtypes & ProgressionsCode2
Instruct-NeRF2NeRF: Editing 3D Scenes with InstructionsCode2
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and GenerationCode2
Spherical Transformer for LiDAR-based 3D RecognitionCode2
Emotionally Enhanced Talking Face GenerationCode2
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic SegmentationCode2
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point CloudsCode2
Learning A Sparse Transformer Network for Effective Image DerainingCode2
3D Human Mesh Estimation from Virtual MarkersCode2
BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological MeasurementsCode2
Large AI Models in Health Informatics: Applications, Challenges, and the FutureCode2
Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR FusionCode2
Show:102550
← PrevPage 239 of 13232Next →