| MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting | Oct 2, 2024 | Multivariate Time Series ForecastingMultivariate Time Series Forecastingm | CodeCode Available | 3 | 5 |
| TOTEM: TOkenized Time Series EMbeddings for General Time Series Analysis | Feb 26, 2024 | Anomaly DetectionImputation | CodeCode Available | 3 | 5 |
| MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels | May 13, 2024 | Information RetrievalRetrieval | CodeCode Available | 3 | 5 |
| Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative Representations | Oct 15, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 3 | 5 |
| GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding | Jun 16, 2024 | | CodeCode Available | 3 | 5 |
| NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models | Jul 17, 2024 | Instruction FollowingVision and Language Navigation | CodeCode Available | 3 | 5 |
| MetaAgents: Simulating Interactions of Human Behaviors for LLM-based Task-oriented Coordination via Collaborative Generative Agents | Oct 10, 2023 | | CodeCode Available | 3 | 5 |
| MobileNetV4 -- Universal Models for the Mobile Ecosystem | Apr 16, 2024 | Image ClassificationNeural Architecture Search | CodeCode Available | 3 | 5 |
| OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation | May 6, 2025 | Robot ManipulationVision-Language-Action | CodeCode Available | 3 | 5 |
| DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing | Mar 21, 2024 | Image Generationspatial-aware image editing | CodeCode Available | 3 | 5 |
| SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation | Feb 18, 2025 | Voice Cloning | CodeCode Available | 3 | 5 |
| Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of Czech | Jun 18, 2024 | Deep LearningDependency Parsing | CodeCode Available | 3 | 5 |
| Model Inversion Attacks: A Survey of Approaches and Countermeasures | Nov 15, 2024 | Survey | CodeCode Available | 3 | 5 |
| GIFT-Eval: A Benchmark For General Time Series Forecasting Model Evaluation | Oct 14, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 3 | 5 |
| CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms | Nov 16, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Jan 5, 2025 | Data AugmentationDepth Estimation | CodeCode Available | 3 | 5 |
| Leveraging Self-Supervised Learning for Speaker Diarization | Sep 14, 2024 | Self-Supervised Learningspeaker-diarization | CodeCode Available | 3 | 5 |
| ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation | Mar 13, 2024 | Simulated Gaussian Manipulation | CodeCode Available | 3 | 5 |
| Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Jan 30, 2025 | Action RecognitionDomain Adaptation | CodeCode Available | 3 | 5 |
| REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites | Apr 15, 2025 | Autonomous Web NavigationBenchmarking | CodeCode Available | 3 | 5 |
| VideoTetris: Towards Compositional Text-to-Video Generation | Jun 6, 2024 | DenoisingText-to-Video Generation | CodeCode Available | 3 | 5 |
| FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering | Aug 15, 2024 | Computational EfficiencyScheduling | CodeCode Available | 3 | 5 |
| ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget | Jul 31, 2024 | Document-level Closed Information ExtractionEntity Linking | CodeCode Available | 3 | 5 |
| EasyVolcap: Accelerating Neural Volumetric Video Research | Dec 11, 2023 | | CodeCode Available | 3 | 5 |
| SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images | Oct 2, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 3 | 5 |
| High-Speed Stereo Visual SLAM for Low-Powered Computing Devices | Oct 5, 2024 | GPU | CodeCode Available | 3 | 5 |
| Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining | Feb 5, 2024 | Image SegmentationMamba | CodeCode Available | 3 | 5 |
| Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video | Mar 27, 2025 | Camera Pose EstimationDepth Estimation | CodeCode Available | 3 | 5 |
| Automated Movie Generation via Multi-Agent CoT Planning | Mar 10, 2025 | Video Generation | CodeCode Available | 3 | 5 |
| OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data | May 24, 2025 | Image Stylization | CodeCode Available | 3 | 5 |
| KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding | Mar 4, 2025 | HumanEvalmbpp | CodeCode Available | 3 | 5 |
| EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video | Sep 3, 2024 | 3D ReconstructionScene Understanding | CodeCode Available | 3 | 5 |
| UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs | Apr 11, 2024 | | CodeCode Available | 3 | 5 |
| Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models | Mar 14, 2022 | Text Classification | CodeCode Available | 3 | 5 |
| Thinkless: LLM Learns When to Think | May 19, 2025 | GSM8KMath | CodeCode Available | 3 | 5 |
| Rethinking Vision Transformers for MobileNet Size and Speed | Dec 15, 2022 | | CodeCode Available | 3 | 5 |
| Sentiment Reasoning for Healthcare | Jul 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer | Jan 18, 2024 | | CodeCode Available | 3 | 5 |
| DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks | Feb 24, 2025 | Conditional Image GenerationImage Generation | CodeCode Available | 3 | 5 |
| MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images | Jan 30, 2024 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 3 | 5 |
| HtFLlib: A Comprehensive Heterogeneous Federated Learning Library and Benchmark | Jun 4, 2025 | Federated LearningTransfer Learning | CodeCode Available | 3 | 5 |
| Motion Anything: Any to Motion Generation | Mar 10, 2025 | Motion GenerationMotion Synthesis | CodeCode Available | 3 | 5 |
| RAP-SAM: Towards Real-Time All-Purpose Segment Anything | Jan 18, 2024 | AllDecoder | CodeCode Available | 3 | 5 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| EnvGS: Modeling View-Dependent Appearance with Environment Gaussian | Dec 19, 2024 | Novel View Synthesis | CodeCode Available | 3 | 5 |
| A Survey on Data Selection for Language Models | Feb 26, 2024 | SurveyUnsupervised Pre-training | CodeCode Available | 3 | 5 |
| MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Mar 28, 2024 | Image RetrievalImplicit Relations | CodeCode Available | 3 | 5 |
| FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation | Feb 7, 2025 | Computational EfficiencyText-to-Video Generation | CodeCode Available | 3 | 5 |
| A Survey on Deep Learning for Theorem Proving | Apr 15, 2024 | Automated Theorem ProvingDeep Learning | CodeCode Available | 3 | 5 |
| APOLLO: SGD-like Memory, AdamW-level Performance | Dec 6, 2024 | GPUQuantization | CodeCode Available | 3 | 5 |