The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10351–10400 of 661570 papers

Title	Date	Tasks	Status	Hype
Making Large Language Models Perform Better in Knowledge Graph Completion	Oct 10, 2023	In-Context LearningKnowledge Graph Completion	CodeCode Available	2
SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving	Jun 15, 2023	3D Semantic Scene Completion3D Semantic Scene Completion from a single 2D image	CodeCode Available	2
SUNet: Swin Transformer UNet for Image Denoising	Feb 28, 2022	DenoisingImage Denoising	CodeCode Available	2
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?	Feb 7, 2025	8kInformation Retrieval	CodeCode Available	2
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image	Apr 2, 2022	NeRFNovel View Synthesis	CodeCode Available	2
Towards Knowledge-driven Autonomous Driving	Dec 7, 2023	Autonomous DrivingNeural Rendering	CodeCode Available	2
Ring Attention with Blockwise Transformers for Near-Infinite Context	Oct 3, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models	Jun 17, 2024		CodeCode Available	2
TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value Estimation	Jul 14, 2024	Computational EfficiencyPrompt Engineering	CodeCode Available	2
Language models scale reliably with over-training and on downstream tasks	Mar 13, 2024	Language Modelling	CodeCode Available	2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement	May 13, 2025	BenchmarkingLanguage Modeling	CodeCode Available	2
Editing Language Model-based Knowledge Graph Embeddings	Jan 25, 2023	EDIT Taskknowledge editing	CodeCode Available	2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap	Mar 27, 2025	Autonomous DrivingIn-Context Learning	CodeCode Available	2
STAMP: Scalable Task And Model-agnostic Collaborative Perception	Jan 24, 2025	Autonomous Driving	CodeCode Available	2
Dual Diffusion Implicit Bridges for Image-to-Image Translation	Mar 16, 2022	Image-to-Image TranslationTranslation	CodeCode Available	2
PartGS:Learning Part-aware 3D Representations by Fusing 2D Gaussians and Superquadrics	Aug 20, 2024	3D Reconstruction	CodeCode Available	2
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection	Mar 9, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization	Feb 3, 2025	model	CodeCode Available	2
Simple Online and Realtime Tracking	Feb 2, 2016	Multi-Object TrackingMultiple Object Tracking	CodeCode Available	2
Forecasting Global Weather with Graph Neural Networks	Feb 15, 2022		CodeCode Available	2
Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving	Mar 27, 2025	3D Semantic SegmentationAutonomous Driving	CodeCode Available	2
Learning representations of learning representations	Apr 12, 2024	Sentence	CodeCode Available	2
DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding	May 23, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Non-stationary Diffusion For Probabilistic Time Series Forecasting	May 7, 2025	DenoisingProbabilistic Time Series Forecasting	CodeCode Available	2
Rethinking Efficient Lane Detection via Curve Modeling	Mar 4, 2022	Lane Detection	CodeCode Available	2
Generative Auto-Bidding with Value-Guided Explorations	Apr 20, 2025	Reinforcement Learning (RL)	CodeCode Available	2
MonoCD: Monocular 3D Object Detection with Complementary Depths	Apr 4, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual Data	Dec 16, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching	Dec 22, 2024	Image GenerationText to Image Generation	CodeCode Available	2
OSSO: Obtaining Skeletal Shape from Outside	Apr 21, 2022		CodeCode Available	2
Composed Video Retrieval via Enriched Context and Discriminative Embeddings	Mar 25, 2024	Composed Video Retrieval (CoVR)Retrieval	CodeCode Available	2
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses	Jun 3, 2024		CodeCode Available	2
BRIO: Bringing Order to Abstractive Summarization	Mar 31, 2022	Abstractive Text SummarizationText Summarization	CodeCode Available	2
Towards Measuring and Modeling "Culture" in LLMs: A Survey	Mar 5, 2024	Survey	CodeCode Available	2
Vript: A Video Is Worth Thousands of Words	Jun 10, 2024	Video CaptioningVideo Understanding	CodeCode Available	2
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability	Feb 13, 2024	Text Generation	CodeCode Available	2
Tensor-Var: Variational Data Assimilation in Tensor Product Feature Space	Jan 23, 2025		CodeCode Available	2
CleanDIFT: Diffusion Features without Noise	Dec 4, 2024	Semantic correspondence	CodeCode Available	2
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness	Mar 13, 2025	3D Human Pose Estimation3D Human Shape Estimation	CodeCode Available	2
CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-Series	Oct 3, 2024	Causal DiscoveryTime Series	CodeCode Available	2
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting	Jan 22, 2025	Autonomous DrivingNeRF	CodeCode Available	2
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior	Apr 29, 2024	Image CompressionImage Reconstruction	CodeCode Available	2
SRFormerV2: Taking a Closer Look at Permuted Self-Attention for Image Super-Resolution	Mar 17, 2023	Image Super-ResolutionSuper-Resolution	CodeCode Available	2
ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases	Jun 8, 2023		CodeCode Available	2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian Splatting	Oct 7, 2024	3DGS	CodeCode Available	2
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration	Apr 2, 2024	AllDecoder	CodeCode Available	2
Towards Training-free Anomaly Detection with Vision and Language Foundation Models	Mar 24, 2025	Anomaly Detection	CodeCode Available	2
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer	Apr 7, 2022	Video Generation	CodeCode Available	2
LLM As DBA	Aug 10, 2023		CodeCode Available	2