SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 26512700 of 659983 papers

TitleStatusHype
A Survey on Human Interaction Motion GenerationCode3
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model CompressionCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU MemoryCode3
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question AnsweringCode3
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation ModelCode3
GS-SDF: LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and ReconstructionCode3
PyGDA: A Python Library for Graph Domain AdaptationCode3
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and EditingCode3
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation SystemCode3
RFUAV: A Benchmark Dataset for Unmanned Aerial Vehicle Detection and IdentificationCode3
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action AlignmentCode3
BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse ScenesCode3
nnInteractive: Redefining 3D Promptable SegmentationCode3
Robust Latent Matters: Boosting Image Generation with Sampling ErrorCode3
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics AwarenessCode3
Motion Anything: Any to Motion GenerationCode3
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and ReasoningCode3
PE3R: Perception-Efficient 3D ReconstructionCode3
Automated Movie Generation via Multi-Agent CoT PlanningCode3
From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeersCode3
CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-ResolutionCode3
AA-CLIP: Enhancing Zero-shot Anomaly Detection via Anomaly-Aware CLIPCode3
Learning and discovering multiple solutions using physics-informed neural networks with random initialization and deep ensembleCode3
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and ImagesCode3
Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement LearningCode3
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACsCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and AudioCode3
Simulating the Real World: A Unified Survey of Multimodal Generative ModelsCode3
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey WritingCode3
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement LearningCode3
EgoLife: Towards Egocentric Life AssistantCode3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent SystemsCode3
All-atom Diffusion Transformers: Unified generative modelling of molecules and materialsCode3
OmniSQL: Synthesizing High-quality Text-to-SQL Data at ScaleCode3
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly DetectionCode3
Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich ManipulationCode3
A Phylogenetic Approach to Genomic Language ModelingCode3
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for CodingCode3
Audio-Reasoner: Improving Reasoning Capability in Large Audio Language ModelsCode3
SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in StructuresCode3
MUSt3R: Multi-view Network for Stereo 3D ReconstructionCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRsCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset GenerationCode3
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory OptimizationCode3
Proteina: Scaling Flow-based Protein Structure Generative ModelsCode3
Show:102550
← PrevPage 54 of 13200Next →