The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10676–10700 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs	Feb 16, 2024	Quantization	CodeCode Available	2	5
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding	Feb 21, 2024	Text Generation	CodeCode Available	2	5
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap	Feb 29, 2024	Math	CodeCode Available	2	5
DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly	Feb 29, 2024	DenoisingGraph Neural Network	CodeCode Available	2	5
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques	Mar 9, 2024	Knowledge GraphsLong Form Question Answering	CodeCode Available	2	5
Scalable Spatiotemporal Prediction with Bayesian Neural Fields	Mar 12, 2024	Bayesian InferenceDemand Forecasting	CodeCode Available	2	5
BirdSet: A Large-Scale Dataset for Audio Classification in Avian Bioacoustics	Mar 15, 2024	Audio ClassificationClassification	CodeCode Available	2	5
View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network	Mar 21, 2024	Person Re-Identification	CodeCode Available	2	5
Volumetric Environment Representation for Vision-Language Navigation	Mar 21, 2024	3D geometryMulti-Task Learning	CodeCode Available	2	5
CoverUp: Effective High Coverage Test Generation for Python	Mar 24, 2024	software testing	CodeCode Available	2	5
MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion	Apr 12, 2024	Image ReconstructionMamba	CodeCode Available	2	5
FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining	Apr 15, 2024	MambaRain Removal	CodeCode Available	2	5
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond	May 23, 2024	3D Object Detectionobject-detection	CodeCode Available	2	5
Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration	May 5, 2024	Color Image DenoisingImage Restoration	CodeCode Available	2	5
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?	Apr 10, 2024		CodeCode Available	2	5
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices	May 20, 2024	Image GenerationVideo Editing	CodeCode Available	2	5
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code	May 24, 2024		CodeCode Available	2	5
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation	May 28, 2024	Machine Translationspeech-recognition	CodeCode Available	2	5
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training	Feb 23, 2022	Question Answering	CodeCode Available	2	5
Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model	Apr 23, 2024	3D Point Cloud ClassificationMamba	CodeCode Available	2	5
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Speech Translation	Jun 11, 2024	DecoderSimultaneous Speech-to-Speech Translation	CodeCode Available	2	5
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models	Jun 17, 2024	Benchmarking	CodeCode Available	2	5
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation	Jul 4, 2024		CodeCode Available	2	5
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance	Jul 9, 2024	BenchmarkingConditional Image Generation	CodeCode Available	2	5
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models	Aug 4, 2024		CodeCode Available	2	5