The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,142 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3901–3950 of 661570 papers

Title	Date	Tasks	Status	Hype
LoRA+: Efficient Low Rank Adaptation of Large Models	Feb 19, 2024		CodeCode Available	3
ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models	Feb 18, 2024	Language ModellingQuestion Answering	CodeCode Available	3
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations	Feb 18, 2024	DenoisingRobot Manipulation	CodeCode Available	3
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models	Feb 18, 2024	Event ExtractionHallucination	CodeCode Available	3
GenAD: Generative End-to-End Autonomous Driving	Feb 18, 2024	Autonomous DrivingBench2Drive	CodeCode Available	3
OneBit: Towards Extremely Low-bit Large Language Models	Feb 17, 2024	Quantization	CodeCode Available	3
LLMDFA: Analyzing Dataflow in Code with Large Language Models	Feb 16, 2024	Hallucination	CodeCode Available	3
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations	Feb 16, 2024	DenoisingRobot Manipulation	CodeCode Available	3
Discovering and exploring cases of educational source code plagiarism with Dolos	Feb 16, 2024		CodeCode Available	3
BitDelta: Your Fine-Tune May Only Be Worth One Bit	Feb 15, 2024	GPU	CodeCode Available	3
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips	Feb 15, 2024		CodeCode Available	3
QuRating: Selecting High-Quality Data for Training Language Models	Feb 15, 2024	In-Context Learning	CodeCode Available	3
Data Engineering for Scaling Language Models to 128K Context	Feb 15, 2024	4kContinual Pretraining	CodeCode Available	3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models	Feb 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning	Feb 15, 2024	Data AugmentationInstruction Following	CodeCode Available	3
GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering	Feb 15, 2024	3D ReconstructionNovel View Synthesis	CodeCode Available	3
Traj-LIO: A Resilient Multi-LiDAR Multi-IMU State Estimator Through Sparse Gaussian Process	Feb 14, 2024		CodeCode Available	3
Magic-Me: Identity-Specific Video Customized Diffusion	Feb 14, 2024	Image GenerationText to Image Generation	CodeCode Available	3
PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers	Feb 13, 2024	Question AnsweringRetrieval	CodeCode Available	3
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search	Feb 13, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
SPO: Sequential Monte Carlo Policy Optimisation	Feb 12, 2024	Decision MakingModel-based Reinforcement Learning	CodeCode Available	3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models	Feb 12, 2024	Answer GenerationHallucination	CodeCode Available	3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models	Feb 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Scaling Laws for Fine-Grained Mixture of Experts	Feb 12, 2024	Mixture-of-Experts	CodeCode Available	3
Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs	Feb 11, 2024	Image Quality AssessmentQuestion Answering	CodeCode Available	3
X-LoRA: Mixture of Low-Rank Adapter Experts, a Flexible Framework for Large Language Models with Applications in Protein Mechanics and Molecular Design	Feb 11, 2024	graph constructionKnowledge Graphs	CodeCode Available	3
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning	Feb 10, 2024	Federated LearningInstruction Following	CodeCode Available	3
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models	Feb 10, 2024	CPUGPU	CodeCode Available	3
ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement	Feb 9, 2024	HallucinationLanguage Modelling	CodeCode Available	3
FNSPID: A Comprehensive Financial News Dataset in Time Series	Feb 9, 2024	Financial AnalysisTime Series	CodeCode Available	3
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network Fabrics	Feb 9, 2024		CodeCode Available	3
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting	Feb 9, 2024		CodeCode Available	3
The boundary of neural network trainability is fractal	Feb 9, 2024		CodeCode Available	3
Noise Contrastive Alignment of Language Models with Explicit Rewards	Feb 8, 2024	Language ModellingMath	CodeCode Available	3
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey	Feb 8, 2024	ArticlesEntity Alignment	CodeCode Available	3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents	Feb 8, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	3
Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design	Feb 7, 2024		CodeCode Available	3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models	Feb 7, 2024	counterfactualImage Generation	CodeCode Available	3
MEMORYLLM: Towards Self-Updatable Large Language Models	Feb 7, 2024	Model Editing	CodeCode Available	3
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory	Feb 7, 2024		CodeCode Available	3
Temporal Graph Analysis with TGX	Feb 6, 2024		CodeCode Available	3
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation	Feb 6, 2024	Image to Video GenerationVideo Generation	CodeCode Available	3
Does confidence calibration improve conformal prediction?	Feb 6, 2024	Conformal PredictionPrediction	CodeCode Available	3
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving	Feb 6, 2024	Autonomous DrivingNeural Rendering	CodeCode Available	3
CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations	Feb 6, 2024	Visual Reasoning	CodeCode Available	3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls	Feb 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
DistiLLM: Towards Streamlined Distillation for Large Language Models	Feb 6, 2024	Instruction FollowingKnowledge Distillation	CodeCode Available	3
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry	Feb 6, 2024		CodeCode Available	3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs	Feb 6, 2024	BinarizationGPU	CodeCode Available	3
Deep Learning for Multivariate Time Series Imputation: A Survey	Feb 6, 2024	Deep LearningImputation	CodeCode Available	3