The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 19051–19100 of 474278 papers

Title	Date	Tasks	Status	Hype
ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL	Dec 13, 2024	In-Context LearningText to SQL	CodeCode Available	1
DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts	Dec 13, 2024	Claim VerificationFact Checking	CodeCode Available	1
CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal Information	Dec 13, 2024	EEGElectroencephalogram (EEG)	CodeCode Available	1
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection	Dec 13, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection	Dec 13, 2024	object-detectionObject Detection	CodeCode Available	1
Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics	Dec 13, 2024		CodeCode Available	1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector	Dec 13, 2024	In-Context LearningQuestion Answering	CodeCode Available	1
FM2S: Towards Spatially-Correlated Noise Modeling in Zero-Shot Fluorescence Microscopy Image Denoising	Dec 13, 2024	Computational EfficiencyData Augmentation	CodeCode Available	1
Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection	Dec 13, 2024	Anomaly Detection	CodeCode Available	1
GraSP: Simple yet Effective Graph Similarity Predictions	Dec 13, 2024	Graph Similarity	CodeCode Available	1
Semi-IIN: Semi-supervised Intra-inter modal Interaction Learning Network for Multimodal Sentiment Analysis	Dec 13, 2024	Multimodal Sentiment AnalysisSentiment Analysis	CodeCode Available	1
ChainStream: An LLM-based Framework for Unified Synthetic Sensing	Dec 13, 2024	Code Generation	CodeCode Available	1
Multi-Head Encoding for Extreme Label Classification	Dec 13, 2024	Classification	CodeCode Available	1
The Complexity Dynamics of Grokking	Dec 13, 2024	Generalization BoundsMemorization	CodeCode Available	1
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data	Dec 13, 2024	named-entity-recognitionNamed Entity Recognition	CodeCode Available	1
Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation	Dec 13, 2024	Token Reduction	CodeCode Available	1
CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models	Dec 13, 2024	RAG	CodeCode Available	1
waveOrder: generalist framework for label-agnostic computational microscopy	Dec 13, 2024		CodeCode Available	1
GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers	Dec 12, 2024	GSM8KPrompt Engineering	CodeCode Available	1
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion	Dec 12, 2024	HallucinationKnowledge Graph Completion	CodeCode Available	1
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM	Dec 12, 2024	Computational Efficiency	CodeCode Available	1
Towards Open-Vocabulary Video Semantic Segmentation	Dec 12, 2024	SegmentationSemantic Segmentation	CodeCode Available	1
Enhancing Implicit Neural Representations via Symmetric Power Transformation	Dec 12, 2024		CodeCode Available	1
Federated Foundation Models on Heterogeneous Time Series	Dec 12, 2024	Anomaly DetectionFederated Learning	CodeCode Available	1
Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-Identification	Dec 12, 2024	Person Re-Identification	CodeCode Available	1
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation	Dec 12, 2024	cross-modal alignmentMultimodal Music Generation	CodeCode Available	1
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression	Dec 12, 2024	DisentanglementPortrait Animation	CodeCode Available	1
A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problems	Dec 12, 2024	Operator learning	CodeCode Available	1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression	Dec 12, 2024	4k8k	CodeCode Available	1
USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation	Dec 12, 2024	Action DetectionAction Recognition	CodeCode Available	1
A Flexible Plug-and-Play Module for Generating Variable-Length	Dec 12, 2024	Deep HashingImage Retrieval	CodeCode Available	1
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios	Dec 12, 2024	Logical ReasoningLong-Context Understanding	CodeCode Available	1
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning	Dec 12, 2024	Offline RL	CodeCode Available	1
Selective Visual Prompting in Vision Mamba	Dec 12, 2024	MambaState Space Models	CodeCode Available	1
Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals	Dec 12, 2024	EEGTime Series	CodeCode Available	1
Weighted Poisson-disk Resampling on Large-Scale Point Clouds	Dec 12, 2024		CodeCode Available	1
MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images	Dec 12, 2024	DiagnosticTransfer Learning	CodeCode Available	1
SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations	Dec 12, 2024	FairnessLanguage Modeling	CodeCode Available	1
Can Modern LLMs Act as Agent Cores in Radiology Environments?	Dec 12, 2024		CodeCode Available	1
SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization	Dec 12, 2024		CodeCode Available	1
Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark	Dec 12, 2024	Highlight DetectionVideo Summarization	CodeCode Available	1
Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration	Dec 12, 2024	Contrastive LearningImage Restoration	CodeCode Available	1
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries	Dec 12, 2024	4kGSM8K	CodeCode Available	1
OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs	Dec 12, 2024	Image RestorationImage Super-Resolution	CodeCode Available	1
CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning	Dec 12, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	1
Temporal Action Localization with Cross Layer Task Decoupling and Refinement	Dec 12, 2024	Action ClassificationAction Localization	CodeCode Available	1
Physics-Driven Autoregressive State Space Models for Medical Image Reconstruction	Dec 12, 2024	Image ReconstructionSensitivity	CodeCode Available	1
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model	Dec 12, 2024	Anomaly DetectionVideo Anomaly Detection	CodeCode Available	1
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency	Dec 12, 2024	cross-modal alignmentTransfer Learning	CodeCode Available	1
PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields	Dec 12, 2024	3D ReconstructionInverse Rendering	CodeCode Available	1