The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11201–11250 of 661570 papers

Title	Date	Tasks	Status	Hype
SSLRec: A Self-Supervised Learning Framework for Recommendation	Aug 10, 2023	Collaborative FilteringData Augmentation	CodeCode Available	2
LLM As DBA	Aug 10, 2023		CodeCode Available	2
Follow Anything: Open-set detection, tracking, and following in real-time	Aug 10, 2023		CodeCode Available	2
Flexible Isosurface Extraction for Gradient-Based Mesh Optimization	Aug 10, 2023		CodeCode Available	2
PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences	Aug 10, 2023	Deep Learningvalid	CodeCode Available	2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection	Aug 10, 2023	Objectobject-detection	CodeCode Available	2
Fuzz4All: Universal Fuzzing with Large Language Models	Aug 9, 2023		CodeCode Available	2
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning	Aug 8, 2023	Representation Learning	CodeCode Available	2
Cumulative Reasoning with Large Language Models	Aug 8, 2023	Decision MakingLogical Reasoning	CodeCode Available	2
LATR: 3D Lane Detection from Monocular Images with Transformer	Aug 8, 2023	3D Lane DetectionAutonomous Driving	CodeCode Available	2
FocalFormer3D : Focusing on Hard Instance for 3D Object Detection	Aug 8, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment	Aug 8, 2023	3D Question Answering (3D-QA)Dense Captioning	CodeCode Available	2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool	Aug 8, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Shepherd: A Critic for Language Model Generation	Aug 8, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions	Aug 8, 2023	Caption GenerationImage Captioning	CodeCode Available	2
AgentSims: An Open-Source Sandbox for Large Language Model Evaluation	Aug 8, 2023	Language Model EvaluationLanguage Modeling	CodeCode Available	2
ConDistFL: Conditional Distillation for Federated Learning from Partially Annotated Data	Aug 8, 2023	Federated LearningKnowledge Distillation	CodeCode Available	2
PokerKit: A Comprehensive Python Library for Fine-Grained Multi-Variant Poker Game Simulations	Aug 8, 2023		CodeCode Available	2
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition	Aug 7, 2023	named-entity-recognitionNamed Entity Recognition	CodeCode Available	2
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models	Aug 7, 2023	HallucinationObject Hallucination	CodeCode Available	2
Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue	Aug 7, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	2
SynJax: Structured Probability Distributions for JAX	Aug 7, 2023		CodeCode Available	2
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning	Aug 7, 2023	Offline RLreinforcement-learning	CodeCode Available	2
Dual Aggregation Transformer for Image Super-Resolution	Aug 7, 2023	Image Super-ResolutionSuper-Resolution	CodeCode Available	2
Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model	Aug 7, 2023	DenoisingImage Denoising	CodeCode Available	2
Spanish Pre-trained BERT Model and Evaluation Data	Aug 6, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies	Aug 6, 2023	Hallucination	CodeCode Available	2
Early Detection and Localization of Pancreatic Cancer by Label-Free Tumor Synthesis	Aug 6, 2023	Specificity	CodeCode Available	2
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education	Aug 5, 2023	ChatbotLanguage Modeling	CodeCode Available	2
PowerSimulationsDynamics.jl -- An Open Source Modeling Package for Modern Power Systems with Inverter-Based Resources	Aug 5, 2023		CodeCode Available	2
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	Aug 4, 2023	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data	Aug 4, 2023	Question AnsweringVisual Question Answering	CodeCode Available	2
FB-BEV: BEV Representation from Forward-Backward View Transformations	Aug 4, 2023		CodeCode Available	2
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities	Aug 4, 2023	MathMM-Vet	CodeCode Available	2
UniSim: A Neural Closed-Loop Sensor Simulator	Aug 3, 2023		CodeCode Available	2
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models	Aug 3, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints	Aug 3, 2023	Image GenerationLanguage Modelling	CodeCode Available	2
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World	Aug 3, 2023	AllQuestion Answering	CodeCode Available	2
DETR Doesn't Need Multi-Scale or Locality Design	Aug 3, 2023	Decoder	CodeCode Available	2
From Sparse to Soft Mixtures of Experts	Aug 2, 2023		CodeCode Available	2
Flows: Building Blocks of Reasoning and Collaborating AI	Aug 2, 2023	Prompt Engineering	CodeCode Available	2
Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking	Aug 1, 2023	Multi-Object TrackingMultiple Object Tracking	CodeCode Available	2
AnyLoc: Towards Universal Visual Place Recognition	Aug 1, 2023	Image RetrievalVisual Place Recognition	CodeCode Available	2
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving	Aug 1, 2023	Autonomous DrivingBench2Drive	CodeCode Available	2
FLatten Transformer: Vision Transformer using Focused Linear Attention	Aug 1, 2023	Diversity	CodeCode Available	2
UniVTG: Towards Unified Video-Language Temporal Grounding	Jul 31, 2023	Highlight DetectionMoment Retrieval	CodeCode Available	2
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding	Jul 31, 2023	Multiple-choiceQuestion Answering	CodeCode Available	2
LP-MusicCaps: LLM-Based Pseudo Music Captioning	Jul 31, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed Audio	Jul 31, 2023	AllDownbeat Tracking	CodeCode Available	2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design	Jul 31, 2023	Computational Efficiencytext-to-speech	CodeCode Available	2