The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10851–10900 of 661570 papers

Title	Date	Tasks	Status	Hype
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents	Oct 18, 2023		CodeCode Available	2
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture	Oct 18, 2023	4kimage-classification	CodeCode Available	2
Iterative Methods for Vecchia-Laplace Approximations for Latent Gaussian Process Models	Oct 18, 2023		CodeCode Available	2
LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks	Oct 17, 2023	In-Context Learning	CodeCode Available	2
BitNet: Scaling 1-bit Transformers for Large Language Models	Oct 17, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment	Oct 17, 2023	AttributeObject	CodeCode Available	2
Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting	Oct 16, 2023		CodeCode Available	2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models	Oct 16, 2023	General Reinforcement LearningGPU	CodeCode Available	2
AdaLomo: Low-memory Optimization with Adaptive Learning Rate	Oct 16, 2023		CodeCode Available	2
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation	Oct 16, 2023	GPUImage Animation	CodeCode Available	2
IDRNet: Intervention-Driven Relation Network for Semantic Segmentation	Oct 16, 2023	RelationRelation Network	CodeCode Available	2
FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models	Oct 16, 2023	Federated Learningparameter-efficient fine-tuning	CodeCode Available	2
HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending	Oct 16, 2023	Attribute	CodeCode Available	2
On Generative Agents in Recommendation	Oct 16, 2023	Collaborative FilteringMovie Recommendation	CodeCode Available	2
Character-LLM: A Trainable Agent for Role-Playing	Oct 16, 2023		CodeCode Available	2
Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies	Oct 16, 2023	Few-Shot LearningTime Series	CodeCode Available	2
The Calysto Scheme Project	Oct 16, 2023		CodeCode Available	2
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling	Oct 14, 2023	Speech Synthesistext-to-speech	CodeCode Available	2
An Expression Tree Decoding Strategy for Mathematical Equation Generation	Oct 14, 2023	MathMathematical Reasoning	CodeCode Available	2
Hawkeye: A PyTorch-based Library for Fine-Grained Image Recognition with Deep Learning	Oct 14, 2023	Fine-Grained Image Recognition	CodeCode Available	2
A Setwise Approach for Effective and Highly Efficient Zero-shot Ranking with Large Language Models	Oct 14, 2023	Document Ranking	CodeCode Available	2
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models	Oct 13, 2023	HallucinationImage Captioning	CodeCode Available	2
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models	Oct 13, 2023	Knowledge Base Question AnsweringKnowledge Graphs	CodeCode Available	2
X-Pose: Detecting Any Keypoints	Oct 12, 2023	2D Human Pose Estimation2D Pose Estimation	CodeCode Available	2
PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm	Oct 12, 2023	3D Object Detection3D Reconstruction	CodeCode Available	2
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models	Oct 12, 2023	GPUText to 3D	CodeCode Available	2
Jailbreaking Black Box Large Language Models in Twenty Queries	Oct 12, 2023		CodeCode Available	2
Learning to Act from Actionless Videos through Dense Correspondences	Oct 12, 2023		CodeCode Available	2
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving	Oct 12, 2023	3D Object Detection3D Semantic Segmentation	CodeCode Available	2
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing	Oct 12, 2023	text-guided-image-editing	CodeCode Available	2
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models	Oct 12, 2023	Natural Language UnderstandingQuantization	CodeCode Available	2
OmniControl: Control Any Joint at Any Time for Human Motion Generation	Oct 12, 2023	Motion Generation	CodeCode Available	2
Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes	Oct 12, 2023	GPUNovel View Synthesis	CodeCode Available	2
Octopus: Embodied Vision-Language Programmer from Environmental Feedback	Oct 12, 2023	BenchmarkingCode Generation	CodeCode Available	2
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity	Oct 11, 2023	RetrievalSpecificity	CodeCode Available	2
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models	Oct 11, 2023	Image Generation	CodeCode Available	2
ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons	Oct 11, 2023	BenchmarkingPosition	CodeCode Available	2
VeCLIP: Improving CLIP Training via Visual-enriched Captions	Oct 11, 2023	Image-text RetrievalRetrieval	CodeCode Available	2
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models	Oct 11, 2023	Code GenerationImage Generation	CodeCode Available	2
LLark: A Multimodal Instruction-Following Language Model for Music	Oct 11, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	2
Large Language Models Are Zero-Shot Time Series Forecasters	Oct 11, 2023	ImputationTime Series	CodeCode Available	2
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model	Oct 11, 2023	Autonomous DrivingImage Generation	CodeCode Available	2
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition	Oct 10, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
Making Large Language Models Perform Better in Knowledge Graph Completion	Oct 10, 2023	In-Context LearningKnowledge Graph Completion	CodeCode Available	2
TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning	Oct 10, 2023	3D Lane DetectionAutonomous Driving	CodeCode Available	2
A Semantic Invariant Robust Watermark for Large Language Models	Oct 10, 2023		CodeCode Available	2
Lemur: Harmonizing Natural Language and Code for Language Agents	Oct 10, 2023		CodeCode Available	2
Uni3D: Exploring Unified 3D Representation at Scale	Oct 10, 2023	3D Object ClassificationRetrieval	CodeCode Available	2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning	Oct 10, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Conformal Prediction for Deep Classifier via Label Ranking	Oct 10, 2023	Conformal PredictionPrediction	CodeCode Available	2