SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 68016850 of 661570 papers

TitleStatusHype
HairCLIPv2: Unifying Hair Editing via Proxy Feature BlendingCode2
Turning a CLIP Model into a Scene Text DetectorCode2
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Moving Object Segmentation in Point Cloud Data using Hidden Markov ModelsCode2
ChangeViT: Unleashing Plain Vision Transformers for Change DetectionCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
TroL: Traversal of Layers for Large Language and Vision ModelsCode2
One-for-More: Continual Diffusion Model for Anomaly DetectionCode2
GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising DiffusionCode2
EmoBench: Evaluating the Emotional Intelligence of Large Language ModelsCode2
Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI ImagesCode2
Text2HOI: Text-guided 3D Motion Generation for Hand-Object InteractionCode2
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human ReferencesCode2
ICDAR 2021 Competition on Scientific Literature ParsingCode2
Scale-invariant Learning by Physics InversionCode2
Probabilistic Warp Consistency for Weakly-Supervised Semantic CorrespondencesCode2
Differentiable Voxelization and Mesh MorphingCode2
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video ComprehensionCode2
Transformers learn in-context by gradient descentCode2
Meent: Differentiable Electromagnetic Simulator for Machine LearningCode2
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingCode2
Video Super-Resolution Transformer with Masked Inter&Intra-Frame AttentionCode2
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly DetectionCode2
4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural NetworksCode2
Plug and Play Language Models: A Simple Approach to Controlled Text GenerationCode2
Third Time's the Charm? Image and Video Editing with StyleGAN3Code2
PPI++: Efficient Prediction-Powered InferenceCode2
Adapting Segment Anything Model for Change Detection in HR Remote Sensing ImagesCode2
Black-Box Tuning for Language-Model-as-a-ServiceCode2
Managing FAIR Knowledge Graphs as Polyglot Data End Points: A Benchmark based on the rdf2pg Framework and Plant Biology DataCode2
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less HallucinationCode2
Spatio-Temporal Self-Supervised Learning for Traffic Flow PredictionCode2
Contrastive Decoding: Open-ended Text Generation as OptimizationCode2
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingCode2
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised LearningCode2
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationCode2
FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationCode2
EpiLearn: A Python Library for Machine Learning in Epidemic ModelingCode2
3D Point Cloud Compression with Recurrent Neural Network and Image Compression MethodsCode2
It Takes Two to Tango: Directly Optimizing for Constrained Synthesizability in Generative Molecular DesignCode2
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource AllocationCode2
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape EstimationCode2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image UnderstandingCode2
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion ModelsCode2
DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view RepresentationCode2
MG-LLaVA: Towards Multi-Granularity Visual Instruction TuningCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
Scattered Mixture-of-Experts ImplementationCode2
Show:102550
← PrevPage 137 of 13232Next →