The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3976–4000 of 661570 papers

Title	Date	Tasks	Status	Hype
BlackMamba: Mixture of Experts for State-Space Models	Feb 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy	Feb 1, 2024	Neural Rendering	CodeCode Available	3
StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering	Feb 1, 2024	Novel View Synthesis	CodeCode Available	3
PirateNets: Physics-informed Deep Learning with Residual Adaptive Networks	Feb 1, 2024	Deep Learning	CodeCode Available	3
Repeat After Me: Transformers are Better than State Space Models at Copying	Feb 1, 2024	State Space Models	CodeCode Available	3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization	Jan 31, 2024	GPUQuantization	CodeCode Available	3
LongAlign: A Recipe for Long Context Alignment of Large Language Models	Jan 31, 2024	DiversityInstruction Following	CodeCode Available	3
Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation	Jan 31, 2024	Hierarchical Text Segmentationparameter-efficient fine-tuning	CodeCode Available	3
Common Sense Reasoning for Deepfake Detection	Jan 31, 2024	Binary ClassificationCommon Sense Reasoning	CodeCode Available	3
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models	Jan 30, 2024		CodeCode Available	3
MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images	Jan 30, 2024	Anomaly ClassificationAnomaly Detection	CodeCode Available	3
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models	Jan 30, 2024	Self-Supervised LearningSpeaker Recognition	CodeCode Available	3
When Large Language Models Meet Vector Databases: A Survey	Jan 30, 2024	HallucinationInformation Retrieval	CodeCode Available	3
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models	Jan 30, 2024	Knowledge Base ConstructionQuestion Answering	CodeCode Available	3
Corrective Retrieval Augmented Generation	Jan 29, 2024	RAGRetrieval	CodeCode Available	3
DeFlow: Decoder of Scene Flow Network in Autonomous Driving	Jan 29, 2024	Autonomous DrivingDecoder	CodeCode Available	3
StableIdentity: Inserting Anybody into Anywhere at First Sight	Jan 29, 2024	3D Generation	CodeCode Available	3
FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather Forecasting	Jan 28, 2024	Weather Forecasting	CodeCode Available	3
BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry	Jan 28, 2024		CodeCode Available	3
A Practical Probabilistic Benchmark for AI Weather Models	Jan 27, 2024	DiagnosticWeather Forecasting	CodeCode Available	3
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries	Jan 27, 2024	BenchmarkingRAG	CodeCode Available	3
Scientific Large Language Models: A Survey on Biological & Chemical Domains	Jan 26, 2024	scientific discoverySurvey	CodeCode Available	3
SliceGPT: Compress Large Language Models by Deleting Rows and Columns	Jan 26, 2024		CodeCode Available	3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design	Jan 25, 2024	GPUQuantization	CodeCode Available	3
pix2gestalt: Amodal Segmentation by Synthesizing Wholes	Jan 25, 2024	3D ReconstructionObject Recognition	CodeCode Available	3