The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6926–6950 of 474278 papers

Title	Date	Tasks	Status	Hype
Adaptive Length Image Tokenization via Recurrent Allocation	Nov 4, 2024	Decoder	CodeCode Available	2
Combining Induction and Transduction for Abstract Reasoning	Nov 4, 2024	ARCProgram Synthesis	CodeCode Available	2
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector	Nov 4, 2024	DecoderEmotional Speech Synthesis	CodeCode Available	2
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing	Nov 4, 2024	Computational EfficiencyGPU	CodeCode Available	2
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments	Nov 4, 2024		CodeCode Available	2
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance	Nov 4, 2024	Caption GenerationMultiple-choice	CodeCode Available	2
Attacking Vision-Language Computer Agents via Pop-ups	Nov 4, 2024		CodeCode Available	2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation	Nov 4, 2024	Earth ObservationObject	CodeCode Available	2
Learning General-Purpose Biomedical Volume Representations using Randomized Synthesis	Nov 4, 2024	Contrastive LearningDiversity	CodeCode Available	2
Training on test proteins improves fitness, structure, and function prediction	Nov 4, 2024	PredictionProtein Structure Prediction	CodeCode Available	2
INQUIRE: A Natural World Text-to-Image Retrieval Benchmark	Nov 4, 2024	Image RetrievalReranking	CodeCode Available	2
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution	Nov 4, 2024	GPURobot Manipulation	CodeCode Available	2
RAGViz: Diagnose and Visualize Retrieval-Augmented Generation	Nov 4, 2024	Answer GenerationGPU	CodeCode Available	2
Mapping Global Floods with 10 Years of Satellite Radar Data	Nov 3, 2024	Disaster Response	CodeCode Available	2
GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation	Nov 2, 2024	Imitation Learning	CodeCode Available	2
Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical Documents	Nov 2, 2024	Handwritten Text RecognitionHTR	CodeCode Available	2
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios	Nov 2, 2024	Denoising	CodeCode Available	2
On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR	Nov 1, 2024	3D Semantic SegmentationAutonomous Driving	CodeCode Available	2
A Survey of Financial AI: Architectures, Advances and Open Challenges	Nov 1, 2024	Decision MakingPortfolio Optimization	CodeCode Available	2
Communication Learning in Multi-Agent Systems from Graph Modeling Perspective	Nov 1, 2024		CodeCode Available	2
Toward Automated Algorithm Design: A Survey and Practical Guide to Meta-Black-Box-Optimization	Nov 1, 2024	Computational EfficiencyIn-Context Learning	CodeCode Available	2
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	2
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs	Oct 31, 2024	Knowledge GraphsLanguage Modeling	CodeCode Available	2
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators	Oct 31, 2024	BenchmarkingText Generation	CodeCode Available	2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Oct 31, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2