The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11051–11100 of 661570 papers

Title	Date	Tasks	Status	Hype
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite	Sep 15, 2023	Question Answering	CodeCode Available	2
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context	Sep 15, 2023		CodeCode Available	2
Optimization of Rank Losses for Image Retrieval	Sep 15, 2023	Image RetrievalRetrieval	CodeCode Available	2
PromptASR for contextualized ASR with controllable style	Sep 14, 2023	Automatic Speech Recognitionspeech-recognition	CodeCode Available	2
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec	Sep 14, 2023	Automatic Speech Recognitionspeech-recognition	CodeCode Available	2
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning	Sep 14, 2023	HallucinationIn-Context Learning	CodeCode Available	2
VerilogEval: Evaluating Large Language Models for Verilog Code Generation	Sep 14, 2023	BenchmarkingCode Generation	CodeCode Available	2
Generative Image Dynamics	Sep 14, 2023		CodeCode Available	2
Unified Human-Scene Interaction via Prompted Chain-of-Contacts	Sep 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting	Sep 13, 2023	Image SegmentationMedical Image Segmentation	CodeCode Available	2
PILOT: A Pre-Trained Model-Based Continual Learning Toolbox	Sep 13, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	2
SafetyBench: Evaluating the Safety of Large Language Models	Sep 13, 2023	Multiple-choice	CodeCode Available	2
CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics	Sep 13, 2023		CodeCode Available	2
BHASA: A Holistic Southeast Asian Linguistic and Cultural Evaluation Suite for Large Language Models	Sep 12, 2023	DiagnosticNatural Language Understanding	CodeCode Available	2
Commands as AI Conversations	Sep 12, 2023		CodeCode Available	2
Temporal Action Localization with Enhanced Instant Discriminability	Sep 11, 2023	Action DetectionAction Localization	CodeCode Available	2
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning	Sep 11, 2023	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	2
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation	Sep 11, 2023	Autonomous DrivingDomain Generalization	CodeCode Available	2
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning	Sep 11, 2023	MathMathematical Reasoning	CodeCode Available	2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications	Sep 11, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase	Sep 11, 2023	3D Semantic SegmentationLIDAR Semantic Segmentation	CodeCode Available	2
A physics-informed and attention-based graph learning approach for regional electric vehicle charging demand prediction	Sep 11, 2023	Graph LearningMeta-Learning	CodeCode Available	2
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation	Sep 10, 2023	Talking Head Generation	CodeCode Available	2
VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching	Sep 10, 2023	text-to-speechText to Speech	CodeCode Available	2
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization	Sep 9, 2023	Language ModellingLarge Language Model	CodeCode Available	2
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks	Sep 7, 2023	Keypoint Detection	CodeCode Available	2
A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation	Sep 7, 2023	Organ SegmentationSegmentation	CodeCode Available	2
XGen-7B Technical Report	Sep 7, 2023	2k8k	CodeCode Available	2
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models	Sep 7, 2023	TruthfulQA	CodeCode Available	2
PyGraft: Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips	Sep 7, 2023	BenchmarkingKnowledge Graphs	CodeCode Available	2
Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal Models	Sep 6, 2023	Reinforcement Learning (RL)	CodeCode Available	2
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models	Sep 6, 2023		CodeCode Available	2
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network	Sep 6, 2023	Generative Adversarial NetworkSpeech Synthesis	CodeCode Available	2
Automated Bioinformatics Analysis via AutoBA	Sep 6, 2023	AI AgentLanguage Modeling	CodeCode Available	2
GPT Can Solve Mathematical Problems Without a Calculator	Sep 6, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra	Sep 6, 2023	CoLAGaussian Processes	CodeCode Available	2
Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network Analysis	Sep 5, 2023		CodeCode Available	2
GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction	Sep 5, 2023	3D Reconstructionglobal-optimization	CodeCode Available	2
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning	Sep 5, 2023	DecoderImage Generation	CodeCode Available	2
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention	Sep 4, 2023	Image ClassificationInstance Segmentation	CodeCode Available	2
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis	Sep 4, 2023	Image Generation	CodeCode Available	2
Benchmarking Large Language Models in Retrieval-Augmented Generation	Sep 4, 2023	Benchmarkingcounterfactual	CodeCode Available	2
NLLB-CLIP -- train performant multilingual image retrieval model on a budget	Sep 4, 2023	Image RetrievalRetrieval	CodeCode Available	2
Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images	Sep 4, 2023	Change DetectionInteractive Segmentation	CodeCode Available	2
Orientation-Independent Chinese Text Recognition in Scene Images	Sep 3, 2023	BenchmarkingImage Reconstruction	CodeCode Available	2
Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning	Sep 3, 2023	Scene Text Recognition	CodeCode Available	2
RevColV2: Exploring Disentangled Representations in Masked Image Modeling	Sep 2, 2023	Decoderimage-classification	CodeCode Available	2
CityDreamer: Compositional Generative Model of Unbounded 3D Cities	Sep 1, 2023	modelScene Generation	CodeCode Available	2
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation	Sep 1, 2023	3D Open-Vocabulary Instance Segmentation3D Open-Vocabulary Object Detection	CodeCode Available	2
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following	Sep 1, 2023	3D Generation3D Question Answering (3D-QA)	CodeCode Available	2