The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10101–10150 of 661570 papers

Title	Date	Tasks	Status	Hype
Federated Learning with New Knowledge: Fundamentals, Advances, and Futures	Feb 3, 2024	Federated LearningPrivacy Preserving	CodeCode Available	2
Cross-view Masked Diffusion Transformers for Person Image Synthesis	Feb 2, 2024	DenoisingImage Generation	CodeCode Available	2
DeepAAT: Deep Automated Aerial Triangulation for Fast UAV-based Mapping	Feb 2, 2024	3D ReconstructionEarth Observation	CodeCode Available	2
Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram	Feb 2, 2024	DiagnosticECG Classification	CodeCode Available	2
A Single Simple Patch is All You Need for AI-generated Image Detection	Feb 2, 2024	All	CodeCode Available	2
SynthCLIP: Are We Ready for a Fully Synthetic CLIP Training?	Feb 2, 2024		CodeCode Available	2
Improving Sequential Recommendations with LLMs	Feb 2, 2024	Sequential Recommendation	CodeCode Available	2
LitLLM: A Toolkit for Scientific Literature Review	Feb 2, 2024	RAGRetrieval	CodeCode Available	2
TrustAgent: Towards Safe and Trustworthy LLM-based Agents	Feb 2, 2024	Task Planning	CodeCode Available	2
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback	Feb 2, 2024	Code CompletionCode Generation	CodeCode Available	2
Efficient and Effective Time-Series Forecasting with Spiking Neural Networks	Feb 2, 2024	Model SelectionTime Series	CodeCode Available	2
InfMAE: A Foundation Model in the Infrared Modality	Feb 1, 2024	DecoderSelf-Supervised Learning	CodeCode Available	2
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models	Feb 1, 2024		CodeCode Available	2
Towards Efficient Exact Optimization of Language Model Alignment	Feb 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
A Survey on Hallucination in Large Vision-Language Models	Feb 1, 2024	HallucinationSurvey	CodeCode Available	2
Graph Domain Adaptation: Challenges, Progress and Prospects	Feb 1, 2024	Domain AdaptationGRAPH DOMAIN ADAPTATION	CodeCode Available	2
Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents	Feb 1, 2024		CodeCode Available	2
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning	Feb 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
On the Challenges of Fuzzing Techniques via Large Language Models	Feb 1, 2024	software testingSurvey	CodeCode Available	2
CapHuman: Capture Your Moments in Parallel Universes	Feb 1, 2024	Image Generation	CodeCode Available	2
Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management	Feb 1, 2024	Deep Reinforcement LearningManagement	CodeCode Available	2
PAM: Prompting Audio-Language Models for Audio Quality Assessment	Feb 1, 2024	Audio Quality AssessmentMusic Generation	CodeCode Available	2
CF4J: Collaborative Filtering for Java	Feb 1, 2024	Collaborative FilteringRecommendation Systems	CodeCode Available	2
Improved Scene Landmark Detection for Camera Localization	Jan 31, 2024	Camera LocalizationPose Estimation	CodeCode Available	2
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning	Jan 31, 2024	AudioCapsAudio captioning	CodeCode Available	2
On Prompt-Driven Safeguarding for Large Language Models	Jan 31, 2024		CodeCode Available	2
SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks	Jan 31, 2024	Sentence	CodeCode Available	2
Fin-GAN: forecasting and classifying financial time series via generative adversarial networks	Jan 31, 2024	Generative Adversarial NetworkProbabilistic Time Series Forecasting	CodeCode Available	2
AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error	Jan 31, 2024	Denoising	CodeCode Available	2
ControlCap: Controllable Region-level Captioning	Jan 31, 2024	Dense Captioning	CodeCode Available	2
Local Feature Matching Using Deep Learning: A Survey	Jan 31, 2024	3D ReconstructionDeep Learning	CodeCode Available	2
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement	Jan 31, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	2
SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition	Jan 31, 2024	Novel View SynthesisSegmentation	CodeCode Available	2
M2-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval	Jan 31, 2024	RetrievalText Retrieval	CodeCode Available	2
EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks	Jan 31, 2024	Audio GenerationSpeech Synthesis	CodeCode Available	2
Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators	Jan 31, 2024	Multivariate Time Series ForecastingTime Series	CodeCode Available	2
EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain	Jan 30, 2024	Image ComprehensionInstruction Following	CodeCode Available	2
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens	Jan 30, 2024	Language Modelling	CodeCode Available	2
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese	Jan 30, 2024	Text Generation	CodeCode Available	2
Weak-to-Strong Jailbreaking on Large Language Models	Jan 30, 2024		CodeCode Available	2
Finetuning Large Language Models for Vulnerability Detection	Jan 30, 2024	Transfer LearningVulnerability Detection	CodeCode Available	2
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks	Jan 30, 2024		CodeCode Available	2
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models	Jan 30, 2024	Data CompressionLanguage Modelling	CodeCode Available	2
Multi-granularity Correspondence Learning from Long-term Noisy Videos	Jan 30, 2024	Action SegmentationLong Video Retrieval (Background Removed)	CodeCode Available	2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation	Jan 30, 2024	HallucinationKnowledge Distillation	CodeCode Available	2
An Open Software Suite for Event-Based Video	Jan 30, 2024		CodeCode Available	2
MF-MOS: A Motion-Focused Model for Moving Object Segmentation	Jan 30, 2024	Autonomous DrivingObject	CodeCode Available	2
MouSi: Poly-Visual-Expert Vision-Language Models	Jan 30, 2024	Image SegmentationImage-text matching	CodeCode Available	2
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios	Jan 30, 2024	Benchmarking	CodeCode Available	2
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models	Jan 30, 2024		CodeCode Available	2