The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7301–7350 of 661570 papers

Title	Date	Tasks	Status	Hype
Mixed-curvature decision trees and random forests	Oct 3, 2024	Link Predictionregression	CodeCode Available	2
Towards Comprehensive Detection of Chinese Harmful Memes	Oct 3, 2024		CodeCode Available	2
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML	Oct 3, 2024	AutoMLCode Generation	CodeCode Available	2
PnP-Flow: Plug-and-Play Image Restoration with Flow Matching	Oct 3, 2024	DeblurringDenoising	CodeCode Available	2
Curvature Diversity-Driven Deformation and Domain Alignment for Point Cloud	Oct 3, 2024	DiversityDomain Adaptation	CodeCode Available	2
A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond	Oct 3, 2024	MambaMedical Image Analysis	CodeCode Available	2
CodeJudge: Evaluating Code Generation with Large Language Models	Oct 3, 2024	Code Generation	CodeCode Available	2
CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-Series	Oct 3, 2024	Causal DiscoveryTime Series	CodeCode Available	2
Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations	Oct 3, 2024	Zero Shot Segmentation	CodeCode Available	2
NNetscape Navigator: Complex Demonstrations for Web Agents Without a Demonstrator	Oct 3, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations	Oct 3, 2024		CodeCode Available	2
MiraGe: Editable 2D Images using Gaussian Splatting	Oct 2, 2024		CodeCode Available	2
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection	Oct 2, 2024	3DGS3D Object Detection	CodeCode Available	2
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging	Oct 2, 2024	Auto DebuggingBug fixing	CodeCode Available	2
Interpretable Contrastive Monte Carlo Tree Search Reasoning	Oct 2, 2024		CodeCode Available	2
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?	Oct 2, 2024		CodeCode Available	2
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy	Oct 2, 2024	Motion PlanningRobot Manipulation	CodeCode Available	2
FlipAttack: Jailbreak LLMs via Flipping	Oct 2, 2024		CodeCode Available	2
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models	Oct 2, 2024	Mixture-of-ExpertsNavigate	CodeCode Available	2
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?	Oct 2, 2024	Code CompletionCode Generation	CodeCode Available	2
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation	Oct 2, 2024	Image GenerationQuantization	CodeCode Available	2
Selective Aggregation for Low-Rank Adaptation in Federated Learning	Oct 2, 2024	Federated LearningGeneral Knowledge	CodeCode Available	2
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment	Oct 2, 2024	GSM8KMath	CodeCode Available	2
Peeling Back the Layers: An In-Depth Evaluation of Encoder Architectures in Neural News Recommenders	Oct 2, 2024	Model SelectionNews Recommendation	CodeCode Available	2
EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics	Oct 1, 2024		CodeCode Available	2
Generative causal testing to bridge data-driven models and scientific theories in language neuroscience	Oct 1, 2024		CodeCode Available	2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection	Oct 1, 2024	3D Anomaly DetectionAnomaly Detection	CodeCode Available	2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving	Oct 1, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control	Oct 1, 2024	Emotional Speech SynthesisSpeech Synthesis	CodeCode Available	2
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages	Oct 1, 2024	Automatic Speech Recognitionspeech-recognition	CodeCode Available	2
CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM	Oct 1, 2024	3DGSSimultaneous Localization and Mapping	CodeCode Available	2
Uncertainty Modelling and Robust Observer Synthesis using the Koopman Operator	Oct 1, 2024		CodeCode Available	2
Recent Advances in Speech Language Models: A Survey	Oct 1, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models	Sep 30, 2024	BenchmarkingContinual Learning	CodeCode Available	2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation	Sep 30, 2024	AttributeCollaborative Filtering	CodeCode Available	2
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"	Sep 30, 2024	counterfactualHallucination	CodeCode Available	2
PerCo (SD): Open Perceptual Compression	Sep 30, 2024	AttributeImage Compression	CodeCode Available	2
Frequency Adaptive Normalization For Non-stationary Time Series Forecasting	Sep 30, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning	Sep 30, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	2
HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes	Sep 30, 2024	Objectobject-detection	CodeCode Available	2
DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction	Sep 30, 2024	3D Object Detection3D Semantic Occupancy Prediction	CodeCode Available	2
ForecastBench: A Dynamic Benchmark of AI Forecasting Capabilities	Sep 30, 2024	Decision Making	CodeCode Available	2
KV-Compress: Paged KV-Cache Compression with Variable Compression Rates per Attention Head	Sep 30, 2024		CodeCode Available	2
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models	Sep 30, 2024	Contrastive Learning	CodeCode Available	2
DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data	Sep 30, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	2
Melody-Guided Music Generation	Sep 30, 2024	cross-modal alignmentMusic Generation	CodeCode Available	2
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation	Sep 30, 2024	Cross-Modal RetrievalDynamic Time Warping	CodeCode Available	2
End-to-end Piano Performance-MIDI to Score Conversion with Transformers	Sep 30, 2024		CodeCode Available	2
Towards Robust Multimodal Sentiment Analysis with Incomplete Data	Sep 30, 2024	Multimodal Sentiment AnalysisSentiment Analysis	CodeCode Available	2