The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6951–7000 of 661570 papers

Title	Date	Tasks	Status	Hype
Towards Generative Ray Path Sampling for Faster Point-to-Point Ray Tracing	Oct 31, 2024	valid	CodeCode Available	2
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators	Oct 31, 2024	BenchmarkingText Generation	CodeCode Available	2
Language Models can Self-Lengthen to Generate Long Texts	Oct 31, 2024	Text Generation	CodeCode Available	2
The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical Domains	Oct 31, 2024	GPUPhilosophy	CodeCode Available	2
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs	Oct 31, 2024	Knowledge GraphsLanguage Modeling	CodeCode Available	2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images	Oct 31, 2024	3D Object DetectionDepth Estimation	CodeCode Available	2
EgoMimic: Scaling Imitation Learning via Egocentric Video	Oct 31, 2024	DiversityImitation Learning	CodeCode Available	2
RSL-SQL: Robust Schema Linking in Text-to-SQL Generation	Oct 31, 2024	Text to SQLText-To-SQL	CodeCode Available	2
GPT or BERT: why not both?	Oct 31, 2024	Causal Language ModelingLanguage Modeling	CodeCode Available	2
Ada-MSHyper: Adaptive Multi-Scale Hypergraph Transformer for Time Series Forecasting	Oct 31, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2
VecCity: A Taxonomy-guided Library for Map Entity Representation Learning	Oct 31, 2024	Representation Learning	CodeCode Available	2
End-to-End Ontology Learning with Large Language Models	Oct 31, 2024		CodeCode Available	2
What is Wrong with Perplexity for Long-context Language Modeling?	Oct 31, 2024	Document SummarizationIn-Context Learning	CodeCode Available	2
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models	Oct 30, 2024	Benchmarking	CodeCode Available	2
SciPIP: An LLM-based Scientific Paper Idea Proposer	Oct 30, 2024	Retrieval	CodeCode Available	2
Multi-Agent Large Language Models for Conversational Task-Solving	Oct 30, 2024	FairnessQuestion Answering	CodeCode Available	2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Oct 30, 2024	General Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2
Controlling Language and Diffusion Models by Transporting Activations	Oct 30, 2024	Negation	CodeCode Available	2
Consistency Diffusion Bridge Models	Oct 30, 2024	DenoisingImage-to-Image Translation	CodeCode Available	2
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation	Oct 30, 2024	Domain AdaptationDomain Generalization	CodeCode Available	2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation	Oct 30, 2024	BenchmarkingPassage Retrieval	CodeCode Available	2
MassSpecGym: A benchmark for the discovery and identification of molecules	Oct 30, 2024	De novo molecule generation from MS/MS spectrumDe novo molecule generation from MS/MS spectrum (bonus chemical formulae)	CodeCode Available	2
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources	Oct 30, 2024	GPU	CodeCode Available	2
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis	Oct 30, 2024	Speech Synthesistext-to-speech	CodeCode Available	2
EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models	Oct 30, 2024	DeblurringEnsemble Learning	CodeCode Available	2
Multi-Programming Language Sandbox for LLMs	Oct 30, 2024		CodeCode Available	2
Very fast Bayesian Additive Regression Trees on GPU	Oct 30, 2024	CPUGPU	CodeCode Available	2
CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions	Oct 29, 2024		CodeCode Available	2
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Oct 29, 2024	Image RetrievalRAG	CodeCode Available	2
PC-Gym: Benchmark Environments For Process Control Problems	Oct 29, 2024	BenchmarkingChemical Process	CodeCode Available	2
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench	Oct 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Oct 29, 2024	Few-shot 3D Point Cloud Semantic SegmentationPoint Cloud Segmentation	CodeCode Available	2
ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation	Oct 29, 2024	Drug Discovery	CodeCode Available	2
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection	Oct 29, 2024	Anomaly Detection	CodeCode Available	2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting	Oct 29, 2024	Active 3D ReconstructionDecision Making	CodeCode Available	2
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts	Oct 29, 2024		CodeCode Available	2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance	Oct 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
RecFlow: An Industrial Full Flow Recommendation Dataset	Oct 28, 2024	Recommendation SystemsSelection bias	CodeCode Available	2
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior	Oct 28, 2024	Video GenerationVideo Reconstruction	CodeCode Available	2
Domain Adaptation with a Single Vision-Language Embedding	Oct 28, 2024	Domain AdaptationOne-shot Unsupervised Domain Adaptation	CodeCode Available	2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning	Oct 28, 2024	Binary ClassificationContrastive Learning	CodeCode Available	2
Skinned Motion Retargeting with Dense Geometric Interaction Perception	Oct 28, 2024	motion retargeting	CodeCode Available	2
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings	Oct 28, 2024	3D Reconstruction3D Scene Reconstruction	CodeCode Available	2
Fast Calibrated Explanations: Efficient and Uncertainty-Aware Explanations for Machine Learning Models	Oct 28, 2024	Computational EfficiencyFeature Importance	CodeCode Available	2
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks	Oct 28, 2024		CodeCode Available	2
BSD: a Bayesian framework for parametric models of neural spectra	Oct 28, 2024	Bayesian InferenceEEG	CodeCode Available	2
Trajectory Flow Matching with Applications to Clinical Time Series Modeling	Oct 28, 2024	Time Series	CodeCode Available	2
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders	Oct 28, 2024	Denoising	CodeCode Available	2
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval	Oct 28, 2024	Image RetrievalImage to text	CodeCode Available	2
Flaming-hot Initiation with Regular Execution Sampling for Large Language Models	Oct 28, 2024	DiversityMath	CodeCode Available	2