The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11601–11650 of 661570 papers

Title	Date	Tasks	Status	Hype
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models	May 24, 2023	Image GenerationIndoor Scene Synthesis	CodeCode Available	2
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition	May 24, 2023	DenoisingKnowledge Distillation	CodeCode Available	2
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence	May 24, 2023	Dense Pixel Correspondence EstimationRepresentation Learning	CodeCode Available	2
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario	May 24, 2023	Autonomous DrivingQuestion Answering	CodeCode Available	2
Enabling Large Language Models to Generate Text with Citations	May 24, 2023	HallucinationRetrieval	CodeCode Available	2
gRNAde: Geometric Deep Learning for 3D RNA inverse design	May 24, 2023	3D geometryDeep Learning	CodeCode Available	2
A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal Verification	May 24, 2023	C++ codeMathematical Proofs	CodeCode Available	2
torchgfn: A PyTorch GFlowNet library	May 24, 2023		CodeCode Available	2
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts	May 24, 2023	In-Context LearningInstruction Following	CodeCode Available	2
Adapting Language Models to Compress Contexts	May 24, 2023	In-Context LearningLanguage Modeling	CodeCode Available	2
Lawyer LLaMA Technical Report	May 24, 2023	ArticlesHallucination	CodeCode Available	2
Unpaired Image-to-Image Translation via Neural Schrödinger Bridge	May 24, 2023	Image-to-Image TranslationTranslation	CodeCode Available	2
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models	May 24, 2023	ChatbotNatural Language Understanding	CodeCode Available	2
Sparse4D v2: Recurrent Temporal Fusion with Sparse Model	May 23, 2023		CodeCode Available	2
Improving Factuality and Reasoning in Language Models through Multiagent Debate	May 23, 2023	Few-Shot LearningLanguage Modeling	CodeCode Available	2
Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning	May 23, 2023	Code GenerationConstituency Parsing	CodeCode Available	2
Link Prediction without Graph Neural Networks	May 23, 2023	AttributeGraph Learning	CodeCode Available	2
SAD: Segment Any RGBD	May 23, 2023	3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
DetGPT: Detect What You Need via Reasoning	May 23, 2023	Autonomous DrivingObject	CodeCode Available	2
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models	May 23, 2023	Common Sense ReasoningImage Generation	CodeCode Available	2
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach	May 23, 2023	GPUImage Generation	CodeCode Available	2
Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning	May 23, 2023	Image GenerationOptical Flow Estimation	CodeCode Available	2
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training	May 23, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Efficient Multi-Scale Attention Module with Cross-Spatial Learning	May 23, 2023	Dimensionality Reductionimage-classification	CodeCode Available	2
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning	May 23, 2023	Common Sense ReasoningCommon Sense Reasoning (Zero-Shot)	CodeCode Available	2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation	May 23, 2023	FormLanguage Modelling	CodeCode Available	2
REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos	May 23, 2023	Garment ReconstructionNeural Rendering	CodeCode Available	2
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra	May 23, 2023	DecoderDenoising	CodeCode Available	2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models	May 23, 2023	Retrieval	CodeCode Available	2
Perception Test: A Diagnostic Benchmark for Multimodal Video Models	May 23, 2023	DiagnosticGrounded Video Question Answering	CodeCode Available	2
SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes	May 23, 2023	Gaussian Processes	CodeCode Available	2
MAGE: Machine-generated Text Detection in the Wild	May 22, 2023	Binary text classificationFace Swapping	CodeCode Available	2
Hierarchical Integration Diffusion Model for Realistic Image Deblurring	May 22, 2023	DeblurringImage Deblurring	CodeCode Available	2
Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection	May 22, 2023	Fairness	CodeCode Available	2
Multimodal Automated Fact-Checking: A Survey	May 22, 2023	Fact CheckingMisinformation	CodeCode Available	2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars	May 22, 2023	2kImage Matting	CodeCode Available	2
Training Diffusion Models with Reinforcement Learning	May 22, 2023	Decision MakingDenoising	CodeCode Available	2
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation	May 22, 2023	Imitation LearningMotion Planning	CodeCode Available	2
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching	May 22, 2023	AllFew-Shot Semantic Segmentation	CodeCode Available	2
VDT: General-purpose Video Diffusion Transformers via Mask Modeling	May 22, 2023	Autonomous DrivingVideo Generation	CodeCode Available	2
Boosting Knowledge Graph Generation from Tabular Data with RML Views	May 22, 2023	Data IntegrationGraph Generation	CodeCode Available	2
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities	May 22, 2023	Event Extractiongraph construction	CodeCode Available	2
Mist: Towards Improved Adversarial Examples for Diffusion Models	May 22, 2023	Adversarial Defense	CodeCode Available	2
LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On	May 22, 2023	Virtual Try-on	CodeCode Available	2
Lion: Adversarial Distillation of Proprietary Large Language Models	May 22, 2023	Instruction FollowingKnowledge Distillation	CodeCode Available	2
ControlVideo: Training-free Controllable Text-to-Video Generation	May 22, 2023	Image GenerationText-to-Video Generation	CodeCode Available	2
VanillaNet: the Power of Minimalism in Deep Learning	May 22, 2023	Deep LearningPhilosophy	CodeCode Available	2
Evaluating the Performance of Large Language Models on GAOKAO Benchmark	May 21, 2023		CodeCode Available	2
Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning	May 20, 2023	Logical Reasoning	CodeCode Available	2
Knowledge-Design: Pushing the Limit of Protein Design via Knowledge Refinement	May 20, 2023	Protein DesignRetrieval	CodeCode Available	2