SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 95019550 of 661570 papers

TitleStatusHype
NetTrack: Tracking Highly Dynamic Objects with a NetCode2
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion ModelCode2
Unified Generative Modeling of 3D Molecules via Bayesian Flow NetworksCode2
BrightDreamer: Generic 3D Gaussian Generative Framework for Fast Text-to-3D SynthesisCode2
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of DataCode2
Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation frameworkCode2
Neural Markov Random Field for Stereo MatchingCode2
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown DegradationsCode2
SelfIE: Self-Interpretation of Large Language Model EmbeddingsCode2
HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object DetectionCode2
DarkGS: Learning Neural Illumination and 3D Gaussians Relighting for Robotic Exploration in the DarkCode2
MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in ConversationsCode2
Boosting Flow-based Generative Super-Resolution Models via Learned PriorCode2
A Comprehensive Study of Multimodal Large Language Models for Image Quality AssessmentCode2
Fast Sparse View Guided NeRF Update for Object ReconfigurationsCode2
ScanTalk: 3D Talking Heads from Unregistered ScansCode2
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy ProjectionsCode2
NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge DevicesCode2
Revisiting Adversarial Training under Long-Tailed DistributionsCode2
Hybrid Convolutional and Attention Network for Hyperspectral Image DenoisingCode2
Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance PrimitivesCode2
Uni-SMART: Universal Science Multimodal Analysis and Research TransformerCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
Isotropic3D: Image-to-3D Generation Based on a Single CLIP EmbeddingCode2
A Survey on Game Playing Agents and Large Models: Methods, Applications, and ChallengesCode2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsCode2
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument LeakageCode2
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-IdentificationCode2
VideoAgent: Long-form Video Understanding with Large Language Model as AgentCode2
BirdSet: A Large-Scale Dataset for Audio Classification in Avian BioacousticsCode2
Learning Spatiotemporal Inconsistency via Thumbnail Layout for Face Deepfake DetectionCode2
Robust Shape Fitting for 3D Scene AbstractionCode2
RCooper: A Real-world Large-scale Dataset for Roadside Cooperative PerceptionCode2
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic GraspingCode2
PosSAM: Panoptic Open-vocabulary Segment AnythingCode2
What Was Your Prompt? A Remote Keylogging Attack on AI AssistantsCode2
Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data ScarcityCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Easy-to-Hard Generalization: Scalable Alignment Beyond Human SupervisionCode2
An Image Is Worth 1000 Lies: Adversarial Transferability across Prompts on Vision-Language ModelsCode2
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor EnvironmentsCode2
VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image SegmentationCode2
Faceptor: A Generalist Model for Face PerceptionCode2
Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative InferenceCode2
AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield PromptingCode2
Hyper-3DG: Text-to-3D Gaussian Generation via HypergraphCode2
RAGGED: Towards Informed Design of Retrieval Augmented Generation SystemsCode2
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-ExpertsCode2
CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise ClassificationCode2
Show:102550
← PrevPage 191 of 13232Next →