The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3876–3900 of 661570 papers

Title	Date	Tasks	Status	Hype
IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus	Feb 22, 2024	Zero-shot Generalization	CodeCode Available	3
Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot	Feb 22, 2024	3D Human Pose Estimation3D Human Reconstruction	CodeCode Available	3
OmniPred: Language Models as Universal Regressors	Feb 22, 2024	Experimental Designregression	CodeCode Available	3
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding	Feb 22, 2024	Computational EfficiencyPrediction	CodeCode Available	3
Cleaner Pretraining Corpus Curation with Neural Web Scraping	Feb 22, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition	Feb 22, 2024	Re-RankingVisual Place Recognition	CodeCode Available	3
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping	Feb 21, 2024	Decision MakingDecoder	CodeCode Available	3
Towards Building Multilingual Language Model for Medicine	Feb 21, 2024	Domain AdaptationLanguage Modeling	CodeCode Available	3
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens	Feb 21, 2024	8k	CodeCode Available	3
Bench: Extending Long Context Evaluation Beyond 100K Tokens	Feb 21, 2024		CodeCode Available	3
Visual Style Prompting with Swapping Self-Attention	Feb 20, 2024	DenoisingImage Generation	CodeCode Available	3
Video ReCap: Recursive Captioning of Hour-Long Videos	Feb 20, 2024	EgoSchemaVideo Captioning	CodeCode Available	3
TorchCP: A Python Library for Conformal Prediction	Feb 20, 2024	Conformal PredictionDeep Learning	CodeCode Available	3
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive	Feb 20, 2024		CodeCode Available	3
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models	Feb 20, 2024		CodeCode Available	3
FiT: Flexible Vision Transformer for Diffusion Model	Feb 19, 2024	Computational EfficiencyImage Cropping	CodeCode Available	3
A Chinese Dataset for Evaluating the Safeguards in Large Language Models	Feb 19, 2024		CodeCode Available	3
UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction	Feb 19, 2024	Decision MakingManagement	CodeCode Available	3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation	Feb 19, 2024	Image Generation	CodeCode Available	3
Language-Codec: Bridging Discrete Codec Representations and Speech Language Models	Feb 19, 2024	Audio CompressionAudio Generation	CodeCode Available	3
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding	Feb 19, 2024		CodeCode Available	3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning	Feb 19, 2024		CodeCode Available	3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations	Feb 19, 2024	Card GamesLogical Reasoning	CodeCode Available	3
Major TOM: Expandable Datasets for Earth Observation	Feb 19, 2024	Earth Observation	CodeCode Available	3
Query-Based Adversarial Prompt Generation	Feb 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	3