| Navigation World Models | Dec 4, 2024 | Robot NavigationVideo Generation | CodeCode Available | 4 | 5 |
| Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models | Apr 21, 2025 | MMEVideo MME | CodeCode Available | 4 | 5 |
| Diffusion-Based Planning for Autonomous Driving with Flexible Guidance | Jan 26, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 4 | 5 |
| Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed | Mar 7, 2024 | 3D ReconstructionImage Retrieval | CodeCode Available | 4 | 5 |
| VideoChat: Chat-Centric Video Understanding | May 10, 2023 | Question AnsweringVideo-based Generative Performance Benchmarking | CodeCode Available | 4 | 5 |
| HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition | Dec 2, 2024 | Gesture RecognitionHand Detection | CodeCode Available | 4 | 5 |
| Contextual Multilingual Spellchecker for User Queries | May 1, 2023 | | CodeCode Available | 4 | 5 |
| Panoptic Feature Pyramid Networks | Jan 8, 2019 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 4 | 5 |
| Evolution Transformer: In-Context Evolutionary Optimization | Mar 5, 2024 | | CodeCode Available | 4 | 5 |
| Segment and Track Anything | May 11, 2023 | Autonomous Drivingmultimodal interaction | CodeCode Available | 4 | 5 |
| SmoothGrad: removing noise by adding noise | Jun 12, 2017 | Interpretable Machine LearningSensitivity | CodeCode Available | 4 | 5 |
| A Comprehensive Survey on 3D Content Generation | Feb 2, 2024 | Survey | CodeCode Available | 4 | 5 |
| Autoregressive Models in Vision: A Survey | Nov 8, 2024 | 3D GenerationImage Generation | CodeCode Available | 4 | 5 |
| SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Dec 10, 2024 | 4D reconstructionVideo Generation | CodeCode Available | 4 | 5 |
| Ray: A Distributed Framework for Emerging AI Applications | Dec 16, 2017 | reinforcement-learningReinforcement Learning | CodeCode Available | 4 | 5 |
| RegNet: Self-Regulated Network for Image Classification | Jan 3, 2021 | ClassificationGeneral Classification | CodeCode Available | 4 | 5 |
| MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | May 20, 2024 | NeRFNovel View Synthesis | CodeCode Available | 4 | 5 |
| CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset | Feb 27, 2020 | Dialogue State TrackingTask-Oriented Dialogue Systems | CodeCode Available | 4 | 5 |
| On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than English | Sep 1, 2021 | Language Modelling | CodeCode Available | 4 | 5 |
| Dive into Deep Learning | Jun 21, 2021 | Deep LearningMath | CodeCode Available | 4 | 5 |
| RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem | Nov 25, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 4 | 5 |
| Kolmogorov-Arnold Transformer | Sep 16, 2024 | Image Classification | CodeCode Available | 4 | 5 |
| A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery | Jun 16, 2024 | scientific discoverySurvey | CodeCode Available | 4 | 5 |
| Sonata: Self-Supervised Learning of Reliable Point Representations | Mar 20, 2025 | 3D Semantic SegmentationSelf-Supervised Learning | CodeCode Available | 4 | 5 |
| Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models | Feb 27, 2024 | MarketingVideo Generation | CodeCode Available | 4 | 5 |
| fastai: A Layered API for Deep Learning | Feb 11, 2020 | Deep LearningGPU | CodeCode Available | 4 | 5 |
| Learning Important Features Through Propagating Activation Differences | Apr 10, 2017 | Interpretable Machine Learning | CodeCode Available | 4 | 5 |
| DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents | Jun 13, 2025 | Information RetrievalRetrieval | CodeCode Available | 4 | 5 |
| Orion-14B: Open-source Multilingual Large Language Models | Jan 20, 2024 | Scheduling | CodeCode Available | 4 | 5 |
| iText2KG: Incremental Knowledge Graphs Construction Using Large Language Models | Sep 5, 2024 | Few-Shot LearningInformation Retrieval | CodeCode Available | 4 | 5 |
| Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System | May 17, 2024 | Data AugmentationSpeech Dereverberation | CodeCode Available | 4 | 5 |
| KernelBench: Can LLMs Write Efficient GPU Kernels? | Feb 14, 2025 | GPU | CodeCode Available | 4 | 5 |
| Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in Keras | Jul 25, 2023 | Image SegmentationSegmentation | CodeCode Available | 4 | 5 |
| MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided Mask | Feb 9, 2021 | Click-Through Rate PredictionRecommendation Systems | CodeCode Available | 4 | 5 |
| WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning | Nov 4, 2024 | | CodeCode Available | 4 | 5 |
| Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale | Sep 12, 2024 | | CodeCode Available | 4 | 5 |
| A Framework For Contrastive Self-Supervised Learning And Designing A New Approach | Aug 31, 2020 | Data AugmentationImage Classification | CodeCode Available | 4 | 5 |
| Brain-inspired Multilayer Perceptron with Spiking Neurons | Mar 28, 2022 | Inductive Bias | CodeCode Available | 4 | 5 |
| V3D: Video Diffusion Models are Effective 3D Generators | Mar 11, 2024 | 3D GenerationNovel View Synthesis | CodeCode Available | 4 | 5 |
| LLM4AD: A Platform for Algorithm Design with Large Language Model | Dec 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting | Apr 20, 2018 | | CodeCode Available | 4 | 5 |
| An Extended Sequence Tagging Vocabulary for Grammatical Error Correction | Feb 12, 2023 | Grammatical Error CorrectionMorphological Inflection | CodeCode Available | 4 | 5 |
| GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles | Oct 27, 2020 | BIG-bench Machine LearningCPU | CodeCode Available | 4 | 5 |
| TransPixeler: Advancing Text-to-Video Generation with Transparency | Jan 6, 2025 | Text-to-Video GenerationVideo Generation | CodeCode Available | 4 | 5 |
| BlazePose: On-device Real-time Body Pose tracking | Jun 17, 2020 | 2D Human Pose Estimation3D Human Pose Estimation | CodeCode Available | 4 | 5 |
| FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on | Nov 15, 2024 | Virtual Try-on | CodeCode Available | 4 | 5 |
| EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation | Jan 29, 2023 | GPUNavigate | CodeCode Available | 4 | 5 |
| Amortized Planning with Large-Scale Transformers: A Case Study on Chess | Feb 7, 2024 | Memorization | CodeCode Available | 4 | 5 |
| LISA: Reasoning Segmentation via Large Language Model | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 | 5 |
| Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding | Sep 22, 2024 | Anomaly DetectionGPU | CodeCode Available | 4 | 5 |