The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9676–9700 of 474278 papers

Title	Date	Tasks	Status	Hype
MeaCap: Memory-Augmented Zero-shot Image Captioning	Mar 6, 2024	Caption GenerationImage Captioning	CodeCode Available	2
What do we learn from inverting CLIP models?	Mar 5, 2024		CodeCode Available	2
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model	Mar 5, 2024	Stock Market Prediction	CodeCode Available	2
Semantic Human Mesh Reconstruction with Textures	Mar 5, 2024		CodeCode Available	2
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents	Mar 5, 2024	BenchmarkingLanguage Modeling	CodeCode Available	2
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels	Mar 5, 2024	Pseudo LabelSemantic Segmentation	CodeCode Available	2
Interactive Continual Learning: Fast and Slow Thinking	Mar 5, 2024	Continual LearningOutlier Detection	CodeCode Available	2
Android in the Zoo: Chain-of-Action-Thought for GUI Agents	Mar 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
PPFlow: Target-aware Peptide Design with Torsional Flow Matching	Mar 5, 2024	Drug DesignDrug Discovery	CodeCode Available	2
TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of Experts	Mar 5, 2024	Graph AttentionGraph Embedding	CodeCode Available	2
MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer	Mar 5, 2024		CodeCode Available	2
ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling	Mar 5, 2024	AllLanguage Modeling	CodeCode Available	2
Towards Measuring and Modeling "Culture" in LLMs: A Survey	Mar 5, 2024	Survey	CodeCode Available	2
Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models	Mar 4, 2024	Knowledge Graph CompletionKnowledge Graphs	CodeCode Available	2
Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection	Mar 4, 2024	DeepFake DetectionFace Swapping	CodeCode Available	2
Learning to Solve Job Shop Scheduling under Uncertainty	Mar 4, 2024	Combinatorial OptimizationDeep Reinforcement Learning	CodeCode Available	2
Large language models surpass human experts in predicting neuroscience results	Mar 4, 2024		CodeCode Available	2
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection	Mar 4, 2024	GPUMamba	CodeCode Available	2
xT: Nested Tokenization for Larger Context in Large Images	Mar 4, 2024		CodeCode Available	2
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT	Mar 4, 2024	Image CaptioningZero-shot Moment Retrieval	CodeCode Available	2
A Simple Baseline for Efficient Hand Mesh Reconstruction	Mar 4, 2024	3D Hand Pose EstimationComputational Efficiency	CodeCode Available	2
Applied Causal Inference Powered by ML and AI	Mar 4, 2024	Causal Inference	CodeCode Available	2
REAL-Colon: A dataset for developing real-world AI applications in colonoscopy	Mar 4, 2024	Benchmarking	CodeCode Available	2
One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models	Mar 4, 2024	Adversarial AttackAdversarial Robustness	CodeCode Available	2
Multi-Spectral Remote Sensing Image Retrieval Using Geospatial Foundation Models	Mar 4, 2024	Image RetrievalRetrieval	CodeCode Available	2