SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1340113450 of 474278 papers

TitleStatusHype
TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer TokensCode2
BUMBLE: Unifying Reasoning and Acting with Vision-Language Models for Building-wide Mobile ManipulationCode2
Phenaki: Variable Length Video Generation From Open Domain Textual DescriptionCode2
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)Code2
Training-Free Consistent Text-to-Image GenerationCode2
Efficient Differentiable Simulation of Articulated BodiesCode2
DiffBP: Generative Diffusion of 3D Molecules for Target Protein BindingCode2
Multi-Interest Network with Dynamic Routing for Recommendation at TmallCode2
Deconstructing Denoising Diffusion Models for Self-Supervised LearningCode2
UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned PolicyCode2
Fine-Grained Stochastic Architecture SearchCode2
ScanNet++: A High-Fidelity Dataset of 3D Indoor ScenesCode2
Foundational Challenges in Assuring Alignment and Safety of Large Language ModelsCode2
Is Weakly-supervised Action Segmentation Ready For Human-Robot Interaction? No, Let's Improve It With Action-union LearningCode2
Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVACode2
Parameter is Not All You Need: Starting from Non-Parametric Networks for 3D Point Cloud AnalysisCode2
Inference-Time Scaling for Complex Tasks: Where We Stand and What Lies AheadCode2
VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?Code2
Reducing Transformer Key-Value Cache Size with Cross-Layer AttentionCode2
InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANsCode2
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite ImageryCode2
Flexible Isosurface Extraction for Gradient-Based Mesh OptimizationCode2
Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth PredictionCode2
TongUI: Building Generalized GUI Agents by Learning from Multimodal Web TutorialsCode2
Unified Structure Generation for Universal Information ExtractionCode2
Customization Assistant for Text-to-image GenerationCode2
IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait SynthesisCode2
PiEEG-16 to Measure 16 EEG Channels with Raspberry Pi for Brain-Computer Interfaces and EEG devicesCode2
GotenNet: Rethinking Efficient 3D Equivariant Graph Neural NetworksCode2
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts LayerCode2
Voice Separation with an Unknown Number of Multiple SpeakersCode2
Differentiable Augmentation for Data-Efficient GAN TrainingCode2
MaskLLM: Learnable Semi-Structured Sparsity for Large Language ModelsCode2
ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive FeedbackCode2
MPNet: Masked and Permuted Pre-training for Language UnderstandingCode2
Task-Customized Mixture of Adapters for General Image FusionCode2
Adaptive Multi-Scale Decomposition Framework for Time Series ForecastingCode2
PyGAD: An Intuitive Genetic Algorithm Python LibraryCode2
Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback LearningCode2
Modular Primitives for High-Performance Differentiable RenderingCode2
Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive ReviewCode2
LidarDM: Generative LiDAR Simulation in a Generated WorldCode2
ClipCap: CLIP Prefix for Image CaptioningCode2
End to End Learning for Self-Driving CarsCode2
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and TrackingCode2
L4acados: Learning-based models for acados, applied to Gaussian process-based predictive controlCode2
MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian SplattingCode2
Virgo: A Preliminary Exploration on Reproducing o1-like MLLMCode2
Neural Speech Synthesis with Transformer NetworkCode2
End-To-End Memory NetworksCode2
Show:102550
← PrevPage 269 of 9486Next →