SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 25512600 of 177339 papers

TitleStatusHype
Stereo Anywhere: Robust Zero-Shot Deep Stereo Matching Even Where Either Stereo or Mono FailCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
Tensorized NeuroEvolution of Augmenting Topologies for GPU AccelerationCode3
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference ChecklistCode3
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose RepresentationCode3
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsCode3
Poseidon: Efficient Foundation Models for PDEsCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
Quantifying the robustness of deep multispectral segmentation models against natural perturbations and data poisoningCode3
pix2gestalt: Amodal Segmentation by Synthesizing WholesCode3
Paint Bucket Colorization Using Anime Character Color Design SheetsCode3
Vary: Scaling up the Vision Vocabulary for Large Vision-Language ModelsCode3
SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous DrivingCode3
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose EstimationCode3
NGD-SLAM: Towards Real-Time Dynamic SLAM without GPUCode3
Highly Accurate Quantum Chemical Property Prediction with Uni-Mol+Code3
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingCode3
ZigMa: A DiT-style Zigzag Mamba Diffusion ModelCode3
Searching for Best Practices in Retrieval-Augmented GenerationCode3
Evaluating representation learning on the protein structure universeCode3
Proxy Denoising for Source-Free Domain AdaptationCode3
FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with MambaCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
3D-LLM: Injecting the 3D World into Large Language ModelsCode3
Fast Feedforward 3D Gaussian Splatting CompressionCode3
Scaling Laws for Fine-Grained Mixture of ExpertsCode3
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition ToolkitCode3
A Review of Large Language Models and Autonomous Agents in ChemistryCode3
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous CircleCode3
Accelerating Diffusion Transformers with Token-wise Feature CachingCode3
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment LocomotionCode3
skscope: Fast Sparsity-Constrained Optimization in PythonCode3
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi DecodingCode3
Repeat After Me: Transformers are Better than State Space Models at CopyingCode3
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place RecognitionCode3
Towards Universal Soccer Video UnderstandingCode3
Self-Rectifying Diffusion Sampling with Perturbed-Attention GuidanceCode3
Temporal Graph Analysis with TGXCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement LearningCode3
Halton Scheduler For Masked Generative Image TransformerCode3
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio DistanceCode3
iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed SpeciesCode3
Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to PairsCode3
SemDeDup: Data-efficient learning at web-scale through semantic deduplicationCode3
PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionCode3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsCode3
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image GenerationCode3
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile ManipulationCode3
Universal Language Model Fine-tuning for Text ClassificationCode3
Show:102550
← PrevPage 52 of 3547Next →