SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 34513500 of 177340 papers

TitleStatusHype
Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to MultimodalityCode3
Iterative Self-Incentivization Empowers Large Language Models as Agentic SearchersCode3
Spurious Rewards: Rethinking Training Signals in RLVRCode3
MotionDirector: Motion Customization of Text-to-Video Diffusion ModelsCode3
River: machine learning for streaming data in PythonCode3
Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-DesignCode3
Personalized Benchmarking with the Ludwig Benchmarking ToolkitCode3
Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical lawsCode3
Large Language Models for Generative Information Extraction: A SurveyCode3
The Rise of Diffusion Models in Time-Series ForecastingCode3
Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single ShotCode3
Segment Anything Model for Road Network Graph ExtractionCode3
RS-Mamba for Large Remote Sensing Image Dense PredictionCode3
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future DirectionsCode3
Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent TransformerCode3
SMART: Scalable Multi-agent Real-time Motion Generation via Next-token PredictionCode3
MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion ScaffoldsCode3
Generative AI for Autonomous Driving: Frontiers and OpportunitiesCode3
Understanding and Minimising Outlier Features in Neural Network TrainingCode3
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual GenerationCode3
LoRA-GA: Low-Rank Adaptation with Gradient ApproximationCode3
Fast Matrix Multiplications for Lookup Table-Quantized LLMsCode3
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image PersonalizationCode3
OpenResearcher: Unleashing AI for Accelerated Scientific ResearchCode3
REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion LatentsCode3
SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerCode3
Automating the Search for Artificial Life with Foundation ModelsCode3
RadGPT: Constructing 3D Image-Text Tumor DatasetsCode3
Hierarchical Lexical Graph for Enhanced Multi-Hop RetrievalCode3
Generalized Trajectory Scoring for End-to-end Multimodal PlanningCode3
General-Reasoner: Advancing LLM Reasoning Across All DomainsCode3
A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation ModelsCode3
SNR-Aware Low-Light Image EnhancementCode3
Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language ModelsCode3
Scaling up Masked Diffusion Models on TextCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
SplatFormer: Point Transformer for Robust 3D Gaussian SplattingCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse ScenesCode3
The T05 System for The VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic SpeechCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video UnderstandingCode3
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language ModelsCode3
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation BenchmarkCode3
MP-SfM: Monocular Surface Priors for Robust Structure-from-MotionCode3
Recurrent Drafter for Fast Speculative Decoding in Large Language ModelsCode3
MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR OdometryCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
AER: Auto-Encoder with Regression for Time Series Anomaly DetectionCode3
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionCode3
Show:102550
← PrevPage 70 of 3547Next →