| NeuMan: Neural Human Radiance Field from a Single Video | Mar 23, 2022 | NeRF | CodeCode Available | 3 | 5 |
| Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving | Jul 8, 2025 | Code RepairTransfer Learning | CodeCode Available | 3 | 5 |
| One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale | Mar 12, 2023 | AllImage Generation | CodeCode Available | 3 | 5 |
| Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket | Jan 4, 2024 | image-classificationImage Classification | CodeCode Available | 3 | 5 |
| Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis | Oct 9, 2023 | BenchmarkingMultivariate Time Series Forecasting | CodeCode Available | 3 | 5 |
| Bridging Language and Items for Retrieval and Recommendation | Mar 6, 2024 | RetrievalSentence | CodeCode Available | 3 | 5 |
| Graph Retrieval-Augmented Generation: A Survey | Aug 15, 2024 | HallucinationRAG | CodeCode Available | 3 | 5 |
| GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Apr 2, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 3 | 5 |
| ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer | Nov 4, 2024 | PositionTime Series | CodeCode Available | 3 | 5 |
| FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework | Aug 12, 2024 | Object TrackingOptical Flow Estimation | CodeCode Available | 3 | 5 |
| HuatuoGPT, towards Taming Language Model to Be a Doctor | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| Improving Transformers with Dynamically Composable Multi-Head Attention | May 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis | Mar 19, 2020 | Generalizable Novel View SynthesisLow-Dose X-Ray Ct Reconstruction | CodeCode Available | 3 | 5 |
| GARField: Group Anything with Radiance Fields | Jan 17, 2024 | Scene Understanding | CodeCode Available | 3 | 5 |
| Do We Need Anisotropic Graph Neural Networks? | Apr 3, 2021 | Graph Neural Network | CodeCode Available | 3 | 5 |
| Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models | Jan 9, 2024 | GPU | CodeCode Available | 3 | 5 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 | 5 |
| WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild | Sep 18, 2024 | 3D Hand Pose EstimationHand Detection | CodeCode Available | 3 | 5 |
| T^3Bench: Benchmarking Current Progress in Text-to-3D Generation | Oct 4, 2023 | 3D GenerationBenchmarking | CodeCode Available | 3 | 5 |
| Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences | Jun 16, 2025 | Document SummarizationGPU | CodeCode Available | 3 | 5 |
| Human-like Episodic Memory for Infinite Context LLMs | Jul 12, 2024 | Computational EfficiencyEvent Segmentation | CodeCode Available | 3 | 5 |
| OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Aug 27, 2024 | 3D Generation3D Shape Generation | CodeCode Available | 3 | 5 |
| OneBit: Towards Extremely Low-bit Large Language Models | Feb 17, 2024 | Quantization | CodeCode Available | 3 | 5 |
| MoMask: Generative Masked Modeling of 3D Human Motions | Nov 29, 2023 | Human motion predictionMotion Forecasting | CodeCode Available | 3 | 5 |
| Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Nov 21, 2024 | Visual Reasoning | CodeCode Available | 3 | 5 |
| SlimPajama-DC: Understanding Data Combinations for LLM Training | Sep 19, 2023 | | CodeCode Available | 3 | 5 |
| Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs | Feb 20, 2025 | Quantization | CodeCode Available | 3 | 5 |
| Splatter Image: Ultra-Fast Single-View 3D Reconstruction | Dec 20, 2023 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 3 | 5 |
| MatterGen: a generative model for inorganic materials design | Dec 6, 2023 | model | CodeCode Available | 3 | 5 |
| MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices | Dec 28, 2023 | AutoMLCPU | CodeCode Available | 3 | 5 |
| Universal Instance Perception as Object Discovery and Retrieval | Mar 12, 2023 | Described Object DetectionGeneralized Referring Expression Comprehension | CodeCode Available | 3 | 5 |
| EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks | May 28, 2019 | Action RecognitionDomain Generalization | CodeCode Available | 3 | 5 |
| Beat this! Accurate beat tracking without DBN postprocessing | Jul 31, 2024 | Beat TrackingDownbeat Tracking | CodeCode Available | 3 | 5 |
| BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection | Mar 31, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 3 | 5 |
| Parametric Retrieval Augmented Generation | Jan 27, 2025 | Domain AdaptationRAG | CodeCode Available | 3 | 5 |
| LLMs Get Lost In Multi-Turn Conversation | May 9, 2025 | | CodeCode Available | 3 | 5 |
| ORLM: A Customizable Framework in Training Large Models for Automated Optimization Modeling | May 28, 2024 | Prompt Engineering | CodeCode Available | 3 | 5 |
| MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline | Jan 16, 2024 | GSM8KMath | CodeCode Available | 3 | 5 |
| ViNT: A Foundation Model for Visual Navigation | Jun 26, 2023 | modelVisual Navigation | CodeCode Available | 3 | 5 |
| The Prusti project: Formal verification for Rust | May 20, 2022 | | CodeCode Available | 3 | 5 |
| UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation | Oct 14, 2024 | Semantic SegmentationSemi-supervised Change Detection | CodeCode Available | 3 | 5 |
| RAKG:Document-level Retrieval Augmented Knowledge Graph Construction | Apr 14, 2025 | coreference-resolutionCoreference Resolution | CodeCode Available | 3 | 5 |
| ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels | Oct 29, 2019 | General ClassificationTime Series | CodeCode Available | 3 | 5 |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Feb 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text | May 22, 2023 | Language ModellingLarge Language Model | CodeCode Available | 3 | 5 |
| Punica: Multi-Tenant LoRA Serving | Oct 28, 2023 | GPU | CodeCode Available | 3 | 5 |
| ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation | Apr 12, 2023 | Image GenerationPreference Mapping | CodeCode Available | 3 | 5 |
| RepViT-SAM: Towards Real-Time Segmenting Anything | Dec 10, 2023 | | CodeCode Available | 3 | 5 |
| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 | 5 |
| Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Mar 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |