SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81018125 of 474278 papers

TitleStatusHype
Diffusion Models and Representation Learning: A SurveyCode2
Learning Formal Mathematics From Intrinsic MotivationCode2
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image GenerationCode2
Hyperparameter Optimization for Randomized Algorithms: A Case Study on Random FeaturesCode2
Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk AssessmentCode2
PerAct2: Benchmarking and Learning for Robotic Bimanual Manipulation TasksCode2
UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization ProblemsCode2
Teola: Towards End-to-End Optimization of LLM-based ApplicationsCode2
Multimodal Prototyping for cancer survival predictionCode2
Odd-One-Out: Anomaly Detection by Comparing with NeighborsCode2
Text2Robot: Evolutionary Robot Design from Text DescriptionsCode2
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent CollaborationCode2
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based AgentsCode2
Efficient Large Multi-modal Models via Visual Context CompressionCode2
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMsCode2
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache ManagementCode2
Chat AI: A Seamless Slurm-Native Solution for HPC-Based ServicesCode2
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangementCode2
Computational Life: How Well-formed, Self-replicating Programs Emerge from Simple InteractionCode2
T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient EmbeddingsCode2
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language ModelsCode2
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time VariabilityCode2
RoboUniView: Visual-Language Model with Unified View Representation for Robotic ManipulationCode2
AnyControl: Create Your Artwork with Versatile Control on Text-to-Image GenerationCode2
Efficient World Models with Context-Aware TokenizationCode2
Show:102550
← PrevPage 325 of 18972Next →