The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 18301–18350 of 474278 papers

Title	Date	Tasks	Status	Hype
SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation	Jan 20, 2025	Speaker VerificationSpeech Enhancement	CodeCode Available	1
Technical Report for the Forgotten-by-Design Project: Targeted Obfuscation for Machine Learning	Jan 20, 2025	Inference AttackMachine Unlearning	CodeCode Available	1
A Survey of World Models for Autonomous Driving	Jan 20, 2025	Anomaly DetectionAutonomous Driving	CodeCode Available	1
UniTrans: A Unified Vertical Federated Knowledge Transfer Framework for Enhancing Cross-Hospital Collaboration	Jan 20, 2025	Federated LearningPrivacy Preserving	CodeCode Available	1
Communication-Efficient Federated Learning Based on Explanation-Guided Pruning for Remote Sensing Image Classification	Jan 20, 2025	Federated Learningimage-classification	CodeCode Available	1
Curiosity-Driven Reinforcement Learning from Human Feedback	Jan 20, 2025	DiversityInstruction Following	CodeCode Available	1
Automatic Labelling & Semantic Segmentation with 4D Radar Tensors	Jan 20, 2025	Semantic Segmentationvehicle detection	CodeCode Available	1
MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking	Jan 20, 2025	Decision MakingGSM8K	CodeCode Available	1
Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP Documents	Jan 20, 2025	ChunkingRAG	CodeCode Available	1
MedicoSAM: Towards foundation models for medical image segmentation	Jan 20, 2025	Image SegmentationInteractive Segmentation	CodeCode Available	1
Glinthawk: A Two-Tiered Architecture for Offline LLM Inference	Jan 20, 2025	CPULanguage Modeling	CodeCode Available	1
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation	Jan 20, 2025	Computational Efficiency	CodeCode Available	1
PD-SORT: Occlusion-Robust Multi-Object Tracking Using Pseudo-Depth Cues	Jan 20, 2025	Motion CompensationMulti-Object Tracking	CodeCode Available	1
Control LLM: Controlled Evolution for Intelligence Retention in LLM	Jan 19, 2025	MathMathematical Reasoning	CodeCode Available	1
Synthetic Data Generation by Supervised Neural Gas Network for Physiological Emotion Recognition Data	Jan 19, 2025	EEGEmotion Recognition	CodeCode Available	1
InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language Models	Jan 19, 2025	BenchmarkingQuestion Answering	CodeCode Available	1
ChaosEater: Fully Automating Chaos Engineering with Large Language Models	Jan 19, 2025	Code Generation	CodeCode Available	1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model	Jan 19, 2025	In-Context LearningLanguage Modeling	CodeCode Available	1
Tell me about yourself: LLMs are aware of their learned behaviors	Jan 19, 2025		CodeCode Available	1
BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution	Jan 19, 2025	Optical Flow EstimationSSIM	CodeCode Available	1
GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human	Jan 19, 2025	Text Detection	CodeCode Available	1
A Remote Sensing Image Change Detection Method Integrating Layer Exchange and Channel-Spatial Differences	Jan 19, 2025	Change DetectionEarth Observation	CodeCode Available	1
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs	Jan 19, 2025		CodeCode Available	1
Simultaneous Computation with Multiple Prioritizations in Multi-Agent Motion Planning	Jan 18, 2025	Motion PlanningMulti-Agent Path Finding	CodeCode Available	1
Graph Coloring to Reduce Computation Time in Prioritized Planning	Jan 18, 2025	Motion PlanningMulti-Agent Path Finding	CodeCode Available	1
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection	Jan 18, 2025	Contrastive LearningDecoder	CodeCode Available	1
Dynamic Trend Fusion Module for Traffic Flow Prediction	Jan 18, 2025	PredictionTraffic Prediction	CodeCode Available	1
Semi-supervised Semantic Segmentation for Remote Sensing Images via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention	Jan 18, 2025	Image SegmentationSegmentation	CodeCode Available	1
MedFILIP: Medical Fine-grained Language-Image Pre-training	Jan 18, 2025	Contrastive LearningDiagnostic	CodeCode Available	1
GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor	Jan 17, 2025		CodeCode Available	1
Evaluation and Efficiency Comparison of Evolutionary Algorithms for Service Placement Optimization in Fog Architectures	Jan 17, 2025	DiversityEvolutionary Algorithms	CodeCode Available	1
GenSC-6G: A Prototype Testbed for Integrated Generative AI, Quantum, and Semantic Communication	Jan 17, 2025	Semantic Communication	CodeCode Available	1
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Jan 17, 2025	Few-Shot Semantic SegmentationSegmentation	CodeCode Available	1
MSTS: A Multimodal Safety Test Suite for Vision-Language Models	Jan 17, 2025		CodeCode Available	1
When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis	Jan 17, 2025	Large Language ModelMultimodal Large Language Model	CodeCode Available	1
Agent-as-Judge for Factual Summarization of Long Narratives	Jan 17, 2025	Long-Form Narrative Summarization	CodeCode Available	1
The R-Vessel-X Project	Jan 17, 2025	AnatomySegmentation	CodeCode Available	1
Aneumo: A Large-Scale Comprehensive Synthetic Dataset of Aneurysm Hemodynamics	Jan 17, 2025		CodeCode Available	1
PandaSkill -- Player Performance and Skill Rating in Esports: Application to League of Legends	Jan 17, 2025		CodeCode Available	1
landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images	Jan 17, 2025	Pose Estimation	CodeCode Available	1
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations	Jan 17, 2025	Contrastive LearningNavigate	CodeCode Available	1
Surrogate-based multiscale analysis of experiments on thermoplastic composites under off-axis loading	Jan 17, 2025	Transfer Learning	CodeCode Available	1
FaceXBench: Evaluating Multimodal LLMs on Face Understanding	Jan 17, 2025	FairnessMultiple-choice	CodeCode Available	1
MechIR: A Mechanistic Interpretability Framework for Information Retrieval	Jan 17, 2025	DiagnosticInformation Retrieval	CodeCode Available	1
A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression	Jan 17, 2025	Conformal PredictionPrediction	CodeCode Available	1
OpticFusion: Multi-Modal Neural Implicit 3D Reconstruction of Microstructures by Fusing White Light Interferometry and Optical Microscopy	Jan 16, 2025	3D geometry3D Reconstruction	CodeCode Available	1
HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation	Jan 16, 2025	Depth EstimationMonocular Depth Estimation	CodeCode Available	1
DSTIGCN: Deformable Spatial-Temporal Interaction Graph Convolution Network for Pedestrian Trajectory Prediction	Jan 16, 2025	Autonomous DrivingPedestrian Trajectory Prediction	CodeCode Available	1
Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging	Jan 16, 2025		CodeCode Available	1
Lossy Compression with Pretrained Diffusion Models	Jan 16, 2025		CodeCode Available	1