The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3651–3700 of 659983 papers

Title	Date	Tasks	Status	Hype
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs	Apr 11, 2024		CodeCode Available	3
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving	Apr 11, 2024	Autonomous DrivingNeRF	CodeCode Available	3
Rho-1: Not All Tokens Are What You Need	Apr 11, 2024	AllContinual Pretraining	CodeCode Available	3
Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs	Apr 10, 2024		CodeCode Available	3
Addressing the Abstraction and Reasoning Corpus via Procedural Example Generation	Apr 10, 2024	ARCDiversity	CodeCode Available	3
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection	Apr 9, 2024	Anomaly DetectionDecoder	CodeCode Available	3
ZeST: Zero-Shot Material Transfer from a Single Image	Apr 9, 2024	Appearance TransferObject	CodeCode Available	3
RoadBEV: Road Surface Reconstruction in Bird's Eye View	Apr 9, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python	Apr 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3
HPNet: Dynamic Trajectory Forecasting with Historical Prediction Attention	Apr 9, 2024	Autonomous DrivingPrediction	CodeCode Available	3
pfl-research: simulation framework for accelerating research in Private Federated Learning	Apr 9, 2024	Federated Learning	CodeCode Available	3
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Apr 8, 2024	Image GenerationImage-to-Image Translation	CodeCode Available	3
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection	Apr 8, 2024	Anomaly DetectionLanguage Modeling	CodeCode Available	3
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding	Apr 8, 2024	GPUMultiple-choice	CodeCode Available	3
AI2Apps: A Visual IDE for Building LLM-based AI Agent Applications	Apr 7, 2024	AI AgentManagement	CodeCode Available	3
Allo: A Programming Model for Composable Accelerator Design	Apr 7, 2024	GPUHigh-Level Synthesis	CodeCode Available	3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making	Apr 6, 2024	Decision Making	CodeCode Available	3
Lossless and Near-Lossless Compression for Foundation Models	Apr 5, 2024		CodeCode Available	3
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation	Apr 5, 2024	DecoderMamba	CodeCode Available	3
3D Facial Expressions through Analysis-by-Neural-Synthesis	Apr 5, 2024	3D Face ReconstructionFace Reconstruction	CodeCode Available	3
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions	Apr 4, 2024	Survey	CodeCode Available	3
LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis	Apr 3, 2024	3D Reconstruction4D reconstruction	CodeCode Available	3
RS-Mamba for Large Remote Sensing Image Dense Prediction	Apr 3, 2024	Building change detection for remote sensing imagesChange Detection	CodeCode Available	3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models	Apr 3, 2024	GPUMath	CodeCode Available	3
Faster Diffusion via Temporal Attention Decomposition	Apr 3, 2024		CodeCode Available	3
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models	Apr 3, 2024	GSM8KQuantization	CodeCode Available	3
Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining	Apr 2, 2024	Image ReconstructionRain Removal	CodeCode Available	3
Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration	Apr 2, 2024	Computational EfficiencyGPU	CodeCode Available	3
Advancing LLM Reasoning Generalists with Preference Trees	Apr 2, 2024	BenchmarkingCode Generation	CodeCode Available	3
SPMamba: State-space model is all you need in speech separation	Apr 2, 2024	AllMamba	CodeCode Available	3
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views	Apr 2, 2024	3DGSNovel View Synthesis	CodeCode Available	3
ViTamin: Designing Scalable Vision Models in the Vision-Language Era	Apr 2, 2024		CodeCode Available	3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks	Apr 2, 2024	In-Context Learning	CodeCode Available	3
Evalverse: Unified and Accessible Library for Large Language Model Evaluation	Apr 1, 2024	Language Model EvaluationLanguage Modeling	CodeCode Available	3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA	Apr 1, 2024	GPUMultiobjective Optimization	CodeCode Available	3
HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach	Apr 1, 2024		CodeCode Available	3
An RML-FNML module for Python user-defined functions in Morph-KGC	Apr 1, 2024	Data IntegrationKnowledge Graphs	CodeCode Available	3
Evaluating Text-to-Visual Generation with Image-to-Text Generation	Apr 1, 2024	Image to textQuestion Answering	CodeCode Available	3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models	Mar 31, 2024	Image-text RetrievalLanguage Modeling	CodeCode Available	3
Towards Realistic Scene Generation with LiDAR Diffusion Models	Mar 31, 2024	3D geometryImage Generation	CodeCode Available	3
DRCT: Saving Image Super-resolution away from Information Bottleneck	Mar 31, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
94% on CIFAR-10 in 3.29 Seconds on a Single GPU	Mar 30, 2024	GPU	CodeCode Available	3
Rewrite the Stars	Mar 29, 2024		CodeCode Available	3
UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation	Mar 29, 2024	Image SegmentationLesion Segmentation	CodeCode Available	3
Are We on the Right Way for Evaluating Large Vision-Language Models?	Mar 29, 2024	World Knowledge	CodeCode Available	3
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios	Mar 28, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
RSMamba: Remote Sensing Image Classification with State Space Model	Mar 28, 2024	Classificationimage-classification	CodeCode Available	3
Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to BRAKER, Galba, and TSEBRA	Mar 28, 2024		CodeCode Available	3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models	Mar 28, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Mar 28, 2024	Image RetrievalImplicit Relations	CodeCode Available	3