| Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles | Dec 5, 2016 | Image Classificationregression | VerifiedCommunity Verified — 1 reproduction | 2 | 23 |
| Energy-Based Transformers are Scalable Learners and Thinkers | Jul 2, 2025 | DenoisingImage Denoising | VerifiedCommunity Verified — 1 reproduction | 5 | 18 |
| Training independent subnetworks for robust prediction | Oct 13, 2020 | Image Classification | VerifiedCommunity Verified — 1 reproduction | 1 | 18 |
| Deep Ensembles: A Loss Landscape Perspective | Dec 5, 2019 | | VerifiedCommunity Verified — 1 reproduction | 1 | 18 |
| Universal Reasoning Model | Dec 26, 2025 | | VerifiedCommunity Verified — 1 reproduction | 1 | 18 |
| OpenHands: An Open Platform for AI Software Developers as Generalist Agents | Jul 23, 2024 | | CodeCode Available | 16 | 5 |
| YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information | Feb 21, 2024 | object-detectionObject Detection | CodeCode Available | 16 | 5 |
| MinerU: An Open-Source Solution for Precise Document Content Extraction | Sep 27, 2024 | DiversityOptical Character Recognition (OCR) | CodeCode Available | 16 | 5 |
| Docling Technical Report | Aug 19, 2024 | | CodeCode Available | 16 | 5 |
| DeepSeek-V3 Technical Report | Dec 27, 2024 | GPULanguage Modeling | CodeCode Available | 16 | 5 |
| AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems | Aug 9, 2024 | | CodeCode Available | 16 | 5 |
| Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory | Apr 28, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 16 | 5 |
| SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion | Mar 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 15 | 5 |
| DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning | Jan 22, 2025 | Mathematical ReasoningMulti-task Language Understanding | CodeCode Available | 15 | 5 |
| LightRAG: Simple and Fast Retrieval-Augmented Generation | Oct 8, 2024 | Information RetrievalRAG | CodeCode Available | 14 | 5 |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 14 | 5 |
| Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models | Feb 22, 2024 | ArticlesRetrieval | CodeCode Available | 14 | 5 |
| TradingAgents: Multi-Agents LLM Financial Trading Framework | Dec 28, 2024 | Management | CodeCode Available | 14 | 5 |
| ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools | Jun 18, 2024 | AllGSM8K | CodeCode Available | 14 | 5 |
| From Local to Global: A Graph RAG Approach to Query-Focused Summarization | Apr 24, 2024 | Query-focused SummarizationQuestion Answering | CodeCode Available | 14 | 5 |
| 1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs | Oct 21, 2024 | | CodeCode Available | 14 | 5 |
| FLUX that Plays Music | Sep 1, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 14 | 5 |
| Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference | Mar 7, 2024 | Chatbot | CodeCode Available | 14 | 5 |
| UI-TARS: Pioneering Automated GUI Interaction with Native Agents | Jan 21, 2025 | | CodeCode Available | 14 | 5 |
| Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k | Mar 12, 2025 | Video Generation | CodeCode Available | 14 | 5 |
| Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria Reranking | Mar 14, 2025 | AllLarge Language Model | CodeCode Available | 14 | 5 |
| Autonomous Agents for Collaborative Task under Information Asymmetry | Jun 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 14 | 5 |
| Qwen3 Technical Report | May 14, 2025 | Code GenerationMathematical Reasoning | CodeCode Available | 14 | 5 |
| Qwen2.5 Technical Report | Dec 19, 2024 | Common Sense Reasoning | CodeCode Available | 13 | 5 |
| Qwen2 Technical Report | Jul 15, 2024 | Arithmetic ReasoningGSM8K | CodeCode Available | 13 | 5 |
| R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization | May 21, 2025 | Code GenerationModel Optimization | CodeCode Available | 13 | 5 |
| Open-Sora: Democratizing Efficient Video Production for All | Dec 29, 2024 | AllImage Generation | CodeCode Available | 13 | 5 |
| Bitnet.cpp: Efficient Edge Inference for Ternary LLMs | Feb 17, 2025 | | CodeCode Available | 13 | 5 |
| MiniCPM-V: A GPT-4V Level MLLM on Your Phone | Aug 3, 2024 | HallucinationMultiple-choice | CodeCode Available | 12 | 5 |
| Zep: A Temporal Knowledge Graph Architecture for Agent Memory | Jan 20, 2025 | Large Language ModelRAG | CodeCode Available | 12 | 5 |
| OmniParser for Pure Vision Based GUI Agent | Aug 1, 2024 | Natural Language Visual Grounding | CodeCode Available | 12 | 5 |
| SAM 2: Segment Anything in Images and Videos | Aug 1, 2024 | Image SegmentationRobot Manipulation Generalization | CodeCode Available | 12 | 5 |
| FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision | Jul 11, 2024 | GPUQuantization | CodeCode Available | 12 | 5 |
| SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics | Jun 2, 2025 | Action GenerationGPU | CodeCode Available | 12 | 5 |
| DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence | Jan 25, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 11 | 5 |
| Qwen2.5-Coder Technical Report | Sep 18, 2024 | Code Generation | CodeCode Available | 11 | 5 |
| EAP4EMSIG -- Experiment Automation Pipeline for Event-Driven Microscopy to Smart Microfluidic Single-Cells Analysis | Nov 6, 2024 | | CodeCode Available | 11 | 5 |
| AgentScope: A Flexible yet Robust Multi-Agent Platform | Feb 21, 2024 | Multi-agent Integration | CodeCode Available | 11 | 5 |
| NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security | Jun 8, 2024 | Task PlanningVulnerability Detection | CodeCode Available | 11 | 5 |
| WebWalker: Benchmarking LLMs in Web Traversal | Jan 13, 2025 | BenchmarkingOpen-Domain Question Answering | CodeCode Available | 11 | 5 |
| Gymnasium: A Standard Interface for Reinforcement Learning Environments | Jul 24, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 11 | 5 |
| KAN: Kolmogorov-Arnold Networks | Apr 30, 2024 | Kolmogorov-Arnold Networks | CodeCode Available | 11 | 5 |
| F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching | Oct 9, 2024 | Denoisingtext-to-speech | CodeCode Available | 11 | 5 |
| HunyuanVideo: A Systematic Framework For Large Video Generative Models | Dec 3, 2024 | Video AlignmentVideo Generation | CodeCode Available | 11 | 5 |
| Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution | Sep 18, 2024 | Natural Language Visual Grounding | CodeCode Available | 11 | 5 |