The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10951–11000 of 661570 papers

Title	Date	Tasks	Status	Hype
GRID: A Platform for General Robot Intelligence Development	Oct 2, 2023		CodeCode Available	2
PatchMixer: A Patch-Mixing Architecture for Long-Term Time Series Forecasting	Oct 1, 2023	Time SeriesTime Series Forecasting	CodeCode Available	2
RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models	Oct 1, 2023	Benchmarking	CodeCode Available	2
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion	Oct 1, 2023	DenoisingImage Generation	CodeCode Available	2
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants	Oct 1, 2023	Instruction Following	CodeCode Available	2
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists	Sep 30, 2023	Depth EstimationImage Generation	CodeCode Available	2
Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process	Sep 29, 2023	Change Data GenerationChange Detection	CodeCode Available	2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training	Sep 29, 2023	Decision MakingLanguage Modeling	CodeCode Available	2
GAIA-1: A Generative World Model for Autonomous Driving	Sep 29, 2023	Autonomous Driving	CodeCode Available	2
Graph-based Neural Weather Prediction for Limited Area Modeling	Sep 29, 2023	Weather Forecasting	CodeCode Available	2
nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance	Sep 29, 2023	Few-Shot LearningHeart Segmentation	CodeCode Available	2
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering	Sep 29, 2023	Image to textPassage Retrieval	CodeCode Available	2
Directly Fine-Tuning Diffusion Models on Differentiable Rewards	Sep 29, 2023		CodeCode Available	2
One for All: Towards Training One Graph Model for All Classification Tasks	Sep 29, 2023	AllGraph Classification	CodeCode Available	2
UXsim: An open source macroscopic and mesoscopic traffic simulator in Python -- a technical overview	Sep 29, 2023		CodeCode Available	2
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets	Sep 29, 2023	Language ModellingMathematical Reasoning	CodeCode Available	2
Denoising Diffusion Bridge Models	Sep 29, 2023	DenoisingImage Generation	CodeCode Available	2
Transformer-VQ: Linear-Time Transformers via Vector Quantization	Sep 28, 2023	8kDecoder	CodeCode Available	2
LawBench: Benchmarking Legal Knowledge of Large Language Models	Sep 28, 2023	ArticlesBenchmarking	CodeCode Available	2
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers	Sep 28, 2023	GPUInstruction Following	CodeCode Available	2
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models	Sep 28, 2023	10-shot image generation1 Image, 2*2 Stitchi	CodeCode Available	2
MEM: Multi-Modal Elevation Mapping for Robotics and Learning	Sep 28, 2023	ColorizationGPU	CodeCode Available	2
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond	Sep 28, 2023	Benchmarking	CodeCode Available	2
Text-to-3D using Gaussian Splatting	Sep 28, 2023	3D GenerationText to 3D	CodeCode Available	2
RLLTE: Long-Term Evolution Project of Reinforcement Learning	Sep 28, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Cross-Prediction-Powered Inference	Sep 28, 2023	Decision MakingMissing Labels	CodeCode Available	2
MHG-GNN: Combination of Molecular Hypergraph Grammar with Graph Neural Network	Sep 28, 2023	Graph Neural NetworkPrediction	CodeCode Available	2
Deep Geometrized Cartoon Line Inbetweening	Sep 28, 2023		CodeCode Available	2
OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs	Sep 27, 2023		CodeCode Available	2
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Sep 27, 2023	Contrastive Learninggeo-localization	CodeCode Available	2
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future	Sep 27, 2023	Navigate	CodeCode Available	2
NeuRBF: A Neural Fields Representation with Adaptive Radial Basis Functions	Sep 27, 2023		CodeCode Available	2
Effective Long-Context Scaling of Foundation Models	Sep 27, 2023	Continual PretrainingLanguage Modeling	CodeCode Available	2
A Content-Driven Micro-Video Recommendation Dataset at Scale	Sep 27, 2023	BenchmarkingRecommendation Systems	CodeCode Available	2
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning	Sep 26, 2023	BenchmarkingMulti-Objective Reinforcement Learning	CodeCode Available	2
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models	Sep 26, 2023	Information RetrievalReranking	CodeCode Available	2
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline	Sep 26, 2023	Knowledge DistillationObject Tracking	CodeCode Available	2
ProteinInvBench: Benchmarking Protein Inverse Folding on Diverse Tasks, Models, and Metrics	Sep 26, 2023		CodeCode Available	2
M^4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models	Sep 26, 2023		CodeCode Available	2
PIXIU: A Comprehensive Benchmark, Instruction Dataset and Large Language Model for Finance	Sep 26, 2023		CodeCode Available	2
ICML 2023 Topological Deep Learning Challenge : Design and Results	Sep 26, 2023	Deep Learning	CodeCode Available	2
ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design	Sep 26, 2023	Mutational/Variant Effect Prediction	CodeCode Available	2
Joint Audio and Speech Understanding	Sep 25, 2023		CodeCode Available	2
Detecting and Grounding Multi-Modal Media Manipulation and Beyond	Sep 25, 2023	Binary ClassificationContrastive Learning	CodeCode Available	2
OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding	Sep 25, 2023	Event Argument ExtractionEvent Detection	CodeCode Available	2
Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory	Sep 25, 2023		CodeCode Available	2
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision	Sep 25, 2023	Image Quality Assessment	CodeCode Available	2
VidChapters-7M: Video Chapters at Scale	Sep 25, 2023	Dense Video CaptioningNavigate	CodeCode Available	2
MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models	Sep 24, 2023	Instruction Following	CodeCode Available	2
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting	Sep 22, 2023	DecoderSpeech Synthesis	CodeCode Available	2