The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10751–10800 of 661570 papers

Title	Date	Tasks	Status	Hype
BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis	Nov 9, 2023	Face ReenactmentNeRF	CodeCode Available	2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving	Nov 9, 2023	Autonomous DrivingCommon Sense Reasoning	CodeCode Available	2
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents	Nov 9, 2023	Instruction FollowingLLM real-life tasks	CodeCode Available	2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions	Nov 9, 2023	HallucinationInformation Retrieval	CodeCode Available	2
Agent Lumos: Unified and Modular Training for Open-Source Language Agents	Nov 9, 2023	MathQuestion Answering	CodeCode Available	2
A differentiable brain simulator bridging brain simulation and brain-inspired computing	Nov 9, 2023		CodeCode Available	2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings	Nov 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
High-Performance Transformers for Table Structure Recognition Need Early Convolutions	Nov 9, 2023	DecoderRepresentation Learning	CodeCode Available	2
CellPhoneDB v5: inferring cell-cell communication from single-cell multiomics data	Nov 8, 2023		CodeCode Available	2
Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation	Nov 8, 2023	Style TransferVoice Conversion	CodeCode Available	2
Euclidean, Projective, Conformal: Choosing a Geometric Algebra for Equivariant Transformers	Nov 8, 2023		CodeCode Available	2
NExT-Chat: An LMM for Chat, Detection and Segmentation	Nov 8, 2023	Referring ExpressionReferring Expression Segmentation	CodeCode Available	2
Rethinking Benchmark and Contamination for Language Models with Rephrased Samples	Nov 8, 2023	HumanEvalMMLU	CodeCode Available	2
Neuro-GPT: Towards A Foundation Model for EEG	Nov 7, 2023	Brain Computer InterfaceEEG	CodeCode Available	2
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training	Nov 7, 2023	GPU	CodeCode Available	2
A Survey of Large Language Models Attribution	Nov 7, 2023	Survey	CodeCode Available	2
Towards Garment Sewing Pattern Reconstruction from a Single Image	Nov 7, 2023	Garment ReconstructionTexture Synthesis	CodeCode Available	2
A Foundation Model for Music Informatics	Nov 6, 2023	Information Retrievalmodel	CodeCode Available	2
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch	Nov 6, 2023	DecoderGSM8K	CodeCode Available	2
PhoGPT: Generative Pre-training for Vietnamese	Nov 6, 2023	Instruction Following	CodeCode Available	2
Can LLMs Follow Simple Rules?	Nov 6, 2023		CodeCode Available	2
GLaMM: Pixel Grounding Large Multimodal Model	Nov 6, 2023	Conversational Question AnsweringImage Captioning	CodeCode Available	2
QECO: A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing	Nov 4, 2023	Deep Reinforcement LearningEdge-computing	CodeCode Available	2
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning	Nov 4, 2023	Multi-Task Learning	CodeCode Available	2
Simplifying Transformer Blocks	Nov 3, 2023	Decoder	CodeCode Available	2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision	Nov 3, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	2
Medical Image Segmentation with Domain Adaptation: A Survey	Nov 3, 2023	Domain AdaptationImage Segmentation	CodeCode Available	2
Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review	Nov 3, 2023	Diagnostic	CodeCode Available	2
PPI++: Efficient Prediction-Powered Inference	Nov 2, 2023	Prediction	CodeCode Available	2
Diffusion Models for Reinforcement Learning: A Survey	Nov 2, 2023	reinforcement-learningReinforcement Learning	CodeCode Available	2
Adapting Frechet Audio Distance for Generative Music Evaluation	Nov 2, 2023	FAD	CodeCode Available	2
ProAgent: From Robotic Process Automation to Agentic Process Automation	Nov 2, 2023	Decision Making	CodeCode Available	2
TopicGPT: A Prompt-based Topic Modeling Framework	Nov 2, 2023	SpecificityTopic Models	CodeCode Available	2
Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers	Nov 2, 2023	Prompt Engineering	CodeCode Available	2
JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models	Nov 1, 2023	Natural Questions	CodeCode Available	2
OpenForest: A data catalogue for machine learning in forest monitoring	Nov 1, 2023		CodeCode Available	2
SoulChat: Improving LLMs' Empathy, Listening, and Comfort Abilities through Fine-tuning with Multi-turn Empathy Conversations	Nov 1, 2023		CodeCode Available	2
Efficient LLM Inference on CPUs	Nov 1, 2023	Quantization	CodeCode Available	2
Low-latency Real-time Voice Conversion on CPU	Nov 1, 2023	CPUKnowledge Distillation	CodeCode Available	2
What's In My Big Data?	Oct 31, 2023	Benchmarking	CodeCode Available	2
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction	Oct 31, 2023	PredictionSemantic Similarity	CodeCode Available	2
CapsFusion: Rethinking Image-Text Data at Scale	Oct 31, 2023	World Knowledge	CodeCode Available	2
ZoomNeXt: A Unified Collaborative Pyramid Network for Camouflaged Object Detection	Oct 31, 2023	Camouflaged Object Segmentation	CodeCode Available	2
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory	Oct 31, 2023	Deep Learning	CodeCode Available	2
Modular Boundaries in Recurrent Neural Networks	Oct 31, 2023	Community DetectionDimensionality Reduction	CodeCode Available	2
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition	Oct 30, 2023	Image ClassificationObject Detection	CodeCode Available	2
Evaluating Large Language Models: A Comprehensive Survey	Oct 30, 2023	Survey	CodeCode Available	2
Large Trajectory Models are Scalable Motion Predictors and Planners	Oct 30, 2023	Autonomous DrivingLanguage Modeling	CodeCode Available	2
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks	Oct 30, 2023	Benchmarkingobject-detection	CodeCode Available	2
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving	Oct 29, 2023	GPUQuantization	CodeCode Available	2