The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1576–1600 of 661570 papers

Title	Date	Tasks	Status	Hype
On the limits of agency in agent-based models	Sep 14, 2024	Computational Efficiencycounterfactual	CodeCode Available	4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Sep 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	4
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale	Sep 12, 2024		CodeCode Available	4
GeoCalib: Learning Single-image Calibration with Geometric Optimization	Sep 10, 2024	3D geometryVisual Localization	CodeCode Available	4
RealisDance: Equip controllable character animation with realistic hands	Sep 10, 2024		CodeCode Available	4
One-Shot Diffusion Mimicker for Handwritten Text Generation	Sep 6, 2024	Handwriting generationText Generation	CodeCode Available	4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation	Sep 6, 2024	Image GenerationImage Reconstruction	CodeCode Available	4
xLAM: A Family of Large Action Models to Empower AI Agent Systems	Sep 5, 2024	AI Agent	CodeCode Available	4
iText2KG: Incremental Knowledge Graphs Construction Using Large Language Models	Sep 5, 2024	Few-Shot LearningInformation Retrieval	CodeCode Available	4
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA	Sep 4, 2024	Question AnsweringSentence	CodeCode Available	4
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark	Sep 4, 2024	Optical Character Recognition (OCR)	CodeCode Available	4
Large Language Model-Based Agents for Software Engineering: A Survey	Sep 4, 2024	AI AgentLanguage Modeling	CodeCode Available	4
OLMoE: Open Mixture-of-Experts Language Models	Sep 3, 2024	Language ModelingLanguage Modelling	CodeCode Available	4
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching	Sep 1, 2024	Patch MatchingStereo Matching	CodeCode Available	4
Diffusion Policy Policy Optimization	Sep 1, 2024	continuous-controlContinuous Control	CodeCode Available	4
CrisperWhisper: Accurate Timestamps on Verbatim Speech Transcriptions	Aug 29, 2024	Dynamic Time Warpingspeech-recognition	CodeCode Available	4
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders	Aug 28, 2024	Optical Character Recognition	CodeCode Available	4
MegActor-Σ: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer	Aug 27, 2024	Portrait Animation	CodeCode Available	4
Text2SQL is Not Enough: Unifying AI and Databases with TAG	Aug 27, 2024	RAGRetrieval-augmented Generation	CodeCode Available	4
Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web	Aug 26, 2024	Decision MakingMulti-class Classification	CodeCode Available	4
EmbodiedSAM: Online Segment Any 3D Thing in Real Time	Aug 21, 2024	3D Instance SegmentationGPU	CodeCode Available	4
SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition	Aug 20, 2024	Emotion RecognitionMultimodal Emotion Recognition	CodeCode Available	4
RUMI: Rummaging Using Mutual Information	Aug 19, 2024	Model Predictive ControlObject	CodeCode Available	4
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance	Aug 15, 2024	TARVideo Generation	CodeCode Available	4
FuseChat: Knowledge Fusion of Chat Models	Aug 15, 2024	Instruction Following	CodeCode Available	4