The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 19101–19150 of 474278 papers

Title	Date	Tasks	Status
Dynamic Malware Classification of Windows PE Files using CNNs and Greyscale Images Derived from Runtime API Call Argument Conversion	May 30, 2025	Malware ClassificationMalware Detection	—Unverified
Towards Scalable Schema Mapping using Large Language Models	May 30, 2025	Data Integration	—Unverified
HESEIA: A community-based dataset for evaluating social biases in large language models, co-designed in real school settings in Latin America	May 30, 2025	Diversity	—Unverified
Robust Federated Learning against Model Perturbation in Edge Networks	May 30, 2025	Federated Learning	—Unverified
Online Fair Division with Additional Information	May 30, 2025	Fairness	—Unverified
Guiding Generative Storytelling with Knowledge Graphs	May 30, 2025	Knowledge GraphsRAG	—Unverified
Coordinated Beamforming for RIS-Empowered ISAC Systems over Secure Low-Altitude Networks	May 30, 2025	Integrated sensing and communicationISAC	—Unverified
Interactive Video Generation via Domain Adaptation	May 30, 2025	AttributeDenoising	—Unverified
DexMachina: Functional Retargeting for Bimanual Dexterous Manipulation	May 30, 2025	Object	—Unverified
Bi-Manual Joint Camera Calibration and Scene Representation	May 30, 2025	Camera CalibrationRobot Manipulation	—Unverified
DiG-Net: Enhancing Quality of Life through Hyper-Range Dynamic Gesture Recognition in Assistive Robotics	May 30, 2025	Gesture Recognition	—Unverified
Black-box Adversarial Attacks on CNN-based SLAM Algorithms	May 30, 2025	Simultaneous Localization and Mapping	—Unverified
SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping	May 30, 2025	3D Object Reconstruction3D Reconstruction	—Unverified
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction	May 30, 2025	Action GenerationOptical Flow Estimation	—Unverified
MGS3: A Multi-Granularity Self-Supervised Code Search Framework	May 30, 2025	Code SearchContrastive Learning	—Unverified
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation	May 30, 2025	Knowledge GraphsMisinformation	—Unverified
SentinelAgent: Graph-based Anomaly Detection in Multi-Agent Systems	May 30, 2025	Anomaly DetectionLarge Language Model	—Unverified
Bootstrapping LLM Robustness for VLM Safety via Reducing the Pretraining Modality Gap	May 30, 2025	Safety Alignment	—Unverified
E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness	May 30, 2025	RAGRetrieval	—Unverified
Three Kinds of Negation in Knowledge and Their Mathematical Foundations	May 30, 2025	NegationPhilosophy	—Unverified
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning	May 30, 2025	ARCReinforcement Learning (RL)	—Unverified
P: A Universal Measure of Predictive Intelligence	May 30, 2025	Protein Folding	—Unverified
Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction	May 30, 2025	Domain GeneralizationMixture-of-Experts	—Unverified
Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning	May 30, 2025	Chunkinggraph construction	—Unverified
AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models	May 30, 2025	Deep Reinforcement Learning	—Unverified
The Butterfly Effect in Pathology: Exploring Security in Pathology Foundation Models	May 30, 2025	Adversarial Robustness	CodeCode Available
Attractor learning for spatiotemporally chaotic dynamical systems using echo state networks with transfer learning	May 30, 2025	Transfer Learning	—Unverified
Beyond Linear Steering: Unified Multi-Attribute Control for Language Models	May 30, 2025	Attribute	—Unverified
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs	May 30, 2025	DiagnosticImage Comprehension	—Unverified
Sparsity-Driven Parallel Imaging Consistency for Improved Self-Supervised MRI Reconstruction	May 30, 2025	MRI ReconstructionSelf-Supervised Learning	—Unverified
S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Modelwith Spatio-Temporal Visual Representation	May 30, 2025	Autonomous DrivingAutonomous Vehicles	—Unverified
RCCDA: Adaptive Model Updates in the Presence of Concept Drift under a Constrained Resource Budget	May 30, 2025	Domain GeneralizationDrift Detection	—Unverified
LKD-KGC: Domain-Specific KG Construction via LLM-driven Knowledge Dependency Parsing	May 30, 2025	Dependency Parsinggraph construction	—Unverified
Towards Unified Modeling in Federated Multi-Task Learning via Subspace Decoupling	May 30, 2025	Multi-Task LearningPrivacy Preserving	—Unverified
Benchmarking Foundation Models for Zero-Shot Biometric Tasks	May 30, 2025	AttributeBenchmarking	—Unverified
Reasoning Can Hurt the Inductive Abilities of Large Language Models	May 30, 2025	Diagnostic	—Unverified
LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework	May 30, 2025	3D Generation3D Shape Generation	—Unverified
Faithful and Robust LLM-Driven Theorem Proving for NLI Explanations	May 30, 2025	Automated Theorem ProvingNatural Language Inference	—Unverified
Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering	May 30, 2025	Language ModelingLanguage Modelling	—Unverified
The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It	May 30, 2025	DiversitySurvey	—Unverified
SASP: Strip-Aware Spatial Perception for Fine-Grained Bird Image Classification	May 30, 2025	image-classificationImage Classification	—Unverified
CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	May 30, 2025	Knowledge DistillationLanguage Modeling	—Unverified
Boosting Automatic Exercise Evaluation Through Musculoskeletal Simulation-Based IMU Data Augmentation	May 30, 2025	Data Augmentation	—Unverified
Localizing Persona Representations in LLMs	May 30, 2025	DecoderDimensionality Reduction	—Unverified
Deep Learning Weather Models for Subregional Ocean Forecasting: A Case Study on the Canary Current Upwelling System	May 30, 2025	Graph Neural NetworkWeather Forecasting	—Unverified
Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing	May 30, 2025	object-detectionObject Detection	—Unverified
Object Centric Concept Bottlenecks	May 30, 2025	Decision MakingObject	—Unverified
Cross-Attention Speculative Decoding	May 30, 2025	Decoder	—Unverified
Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX	May 30, 2025	Code Generation	—Unverified
Cloud Optical Thickness Retrievals Using Angle Invariant Attention Based Deep Learning Models	May 30, 2025	Retrieval	—Unverified