The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9901–9925 of 474278 papers

Title	Date	Tasks	Status	Hype
Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark	Feb 18, 2024	Benchmarking	CodeCode Available	2
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing	Feb 18, 2024	Deep Reinforcement LearningEdge-computing	CodeCode Available	2
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection	Feb 18, 2024	3D Object DetectionDataset Generation	CodeCode Available	2
Aligning Modalities in Vision Large Language Models via Preference Fine-tuning	Feb 18, 2024	HallucinationInstruction Following	CodeCode Available	2
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization	Feb 18, 2024	Code GenerationData Visualization	CodeCode Available	2
Neighborhood-Enhanced Supervised Contrastive Learning for Collaborative Filtering	Feb 18, 2024	Collaborative FilteringContrastive Learning	CodeCode Available	2
3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods	Feb 18, 2024	Data CompressionImage Compression	CodeCode Available	2
Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning	Feb 18, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Continual Learning on Graphs: Challenges, Solutions, and Opportunities	Feb 18, 2024	Continual LearningGraph Learning	CodeCode Available	2
Centroid-Based Efficient Minimum Bayes Risk Decoding	Feb 17, 2024	de-enTranslation	CodeCode Available	2
Optimizing tiny colorless feedback delay networks	Feb 17, 2024		CodeCode Available	2
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents	Feb 17, 2024	Backdoor Attackbackdoor defense	CodeCode Available	2
Beyond Generalization: A Survey of Out-Of-Distribution Adaptation on Graphs	Feb 17, 2024		CodeCode Available	2
PEDANTS: Cheap but Effective and Interpretable Answer Equivalence	Feb 17, 2024	BenchmarkingForm	CodeCode Available	2
CoLLaVO: Crayon Large Language and Vision mOdel	Feb 17, 2024	Large Language Modelmodel	CodeCode Available	2
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs	Feb 17, 2024	EEGEEG Signal Classification	CodeCode Available	2
Do Llamas Work in English? On the Latent Language of Multilingual Transformers	Feb 16, 2024		CodeCode Available	2
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models	Feb 16, 2024	Common Sense ReasoningNavigate	CodeCode Available	2
Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs	Feb 16, 2024	Quantization	CodeCode Available	2
Incremental Sequence Labeling: A Tale of Two Shifts	Feb 16, 2024	Incremental LearningKnowledge Distillation	CodeCode Available	2
ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment	Feb 16, 2024	Entity AlignmentGraph Neural Network	CodeCode Available	2
Distillation Enhanced Generative Retrieval	Feb 16, 2024	RetrievalText Retrieval	CodeCode Available	2
An end-to-end attention-based approach for learning on graphs	Feb 16, 2024	Graph ClassificationGraph Regression	CodeCode Available	2
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator	Feb 16, 2024	Mathematical ReasoningRe-Ranking	CodeCode Available	2
Large Language Models as Zero-shot Dialogue State Tracker through Function Calling	Feb 16, 2024	AvgDialogue State Tracking	CodeCode Available	2