The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9951–9975 of 474278 papers

Title	Date	Tasks	Status	Hype
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio	Feb 14, 2024	Audio ClassificationDecoder	CodeCode Available	2
LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset	Feb 14, 2024	Drug Discovery	CodeCode Available	2
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference	Feb 14, 2024		CodeCode Available	2
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents	Feb 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Generalized Portrait Quality Assessment	Feb 14, 2024	Face Image Quality Assessment	CodeCode Available	2
PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments	Feb 14, 2024	3D Reconstruction3D Scene Reconstruction	CodeCode Available	2
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection	Feb 14, 2024	Fracture detectionmedical image detection	CodeCode Available	2
Extreme Video Compression with Pre-trained Diffusion Models	Feb 14, 2024	DecoderImage Compression	CodeCode Available	2
Personalized Large Language Models	Feb 14, 2024	Emotion RecognitionHate Speech Detection	CodeCode Available	2
MultiMedEval: A Benchmark and a Toolkit for Evaluating Medical Vision-Language Models	Feb 14, 2024	BenchmarkingDiversity	CodeCode Available	2
Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback	Feb 13, 2024	Video Synopsis	CodeCode Available	2
BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation	Feb 13, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	2
An Embarrassingly Simple Approach for LLM with Strong ASR Capacity	Feb 13, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
A Survey of Generative AI for de novo Drug Design: New Frontiers in Molecule and Protein Generation	Feb 13, 2024	Drug Design	CodeCode Available	2
RBF-PINN: Non-Fourier Positional Embedding in Physics-Informed Neural Networks	Feb 13, 2024		CodeCode Available	2
Learning Continuous 3D Words for Text-to-Image Generation	Feb 13, 2024	Image GenerationText to Image Generation	CodeCode Available	2
THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation	Feb 13, 2024	Robot Manipulation Generalization	CodeCode Available	2
Transductive Active Learning: Theory and Applications	Feb 13, 2024	Active LearningBayesian Optimization	CodeCode Available	2
DNABERT-S: Pioneering Species Differentiation with Species-Aware DNA Embeddings	Feb 13, 2024	Contrastive Learning	CodeCode Available	2
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast	Feb 13, 2024	Language ModellingLarge Language Model	CodeCode Available	2
Can LLMs Learn New Concepts Incrementally without Forgetting?	Feb 13, 2024	In-Context LearningIncremental Learning	CodeCode Available	2
Higher Layers Need More LoRA Experts	Feb 13, 2024	Mixture-of-Experts	CodeCode Available	2
ChatCell: Facilitating Single-Cell Analysis with Natural Language	Feb 13, 2024		CodeCode Available	2
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents	Feb 13, 2024	BenchmarkingModel Selection	CodeCode Available	2
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability	Feb 13, 2024	Text Generation	CodeCode Available	2