The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1851–1875 of 661570 papers

Title	Date	Tasks	Status	Hype
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models	Feb 27, 2024	MarketingVideo Generation	CodeCode Available	4
LLM Inference Unveiled: Survey and Roofline Model Insights	Feb 26, 2024	Knowledge DistillationLanguage Modelling	CodeCode Available	4
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation	Feb 26, 2024	Code Documentation GenerationCode Generation	CodeCode Available	4
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT	Feb 26, 2024		CodeCode Available	4
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering	Feb 26, 2024	Evidence SelectionOpen-Ended Question Answering	CodeCode Available	4
Neural Operators with Localized Integral and Differential Kernels	Feb 26, 2024	Operator learning	CodeCode Available	4
Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step	Feb 25, 2024	Code GenerationHumanEval	CodeCode Available	4
Knowledge Fusion of Chat LLMs: A Preliminary Technical Report	Feb 25, 2024		CodeCode Available	4
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning	Feb 23, 2024		CodeCode Available	4
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System	Feb 23, 2024	AI Agent	CodeCode Available	4
Self-Supervised Pre-Training for Table Structure Recognition Transformer	Feb 23, 2024	Representation Learning	CodeCode Available	4
Cameras as Rays: Pose Estimation via Ray Diffusion	Feb 22, 2024	3D ReconstructionCamera Pose Estimation	CodeCode Available	4
2D Matryoshka Sentence Embeddings	Feb 22, 2024	RAGRepresentation Learning	CodeCode Available	4
TinyLLaVA: A Framework of Small-scale Large Multimodal Models	Feb 22, 2024	Visual Question Answering	CodeCode Available	4
Large Language Models for Data Annotation and Synthesis: A Survey	Feb 21, 2024	Survey	CodeCode Available	4
Benchmarking Retrieval-Augmented Generation for Medicine	Feb 20, 2024	BenchmarkingInformation Retrieval	CodeCode Available	4
Neural Network Diffusion	Feb 20, 2024	Decoder	CodeCode Available	4
FinBen: A Holistic Financial Benchmark for Large Language Models	Feb 20, 2024	Question AnsweringRAG	CodeCode Available	4
Aria Everyday Activities Dataset	Feb 20, 2024		CodeCode Available	4
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling	Feb 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	4
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs	Feb 19, 2024	Knowledge Distillation	CodeCode Available	4
GIM: Learning Generalizable Image Matcher From Internet Videos	Feb 16, 2024	3D ReconstructionCamera Pose Estimation	CodeCode Available	4
In Search of Needles in a 11M Haystack: Recurrent Memory Finds What LLMs Miss	Feb 16, 2024	RAG	CodeCode Available	4
Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation	Feb 16, 2024	Cardiac SegmentationDecoder	CodeCode Available	4
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation	Feb 16, 2024	Knowledge DistillationQuantization	CodeCode Available	4