| torchgfn: A PyTorch GFlowNet library | May 24, 2023 | | CodeCode Available | 2 | 5 |
| Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation | Dec 16, 2021 | Contrastive LearningRecommendation Systems | CodeCode Available | 2 | 5 |
| UniVTG: Towards Unified Video-Language Temporal Grounding | Jul 31, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment | Sep 14, 2022 | RetrievalText Retrieval | CodeCode Available | 2 | 5 |
| On Bringing Robots Home | Nov 27, 2023 | | CodeCode Available | 2 | 5 |
| PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes | Sep 19, 2023 | Self-Driving Cars | CodeCode Available | 2 | 5 |
| Adversarial Open Domain Adaptation for Sketch-to-Photo Synthesis | Apr 12, 2021 | Domain AdaptationImage-to-Image Translation | CodeCode Available | 2 | 5 |
| One Model for ALL: Low-Level Task Interaction Is a Key to Task-Agnostic Image Fusion | Feb 27, 2025 | All | CodeCode Available | 2 | 5 |
| Efficient Large-Scale Traffic Forecasting with Transformers: A Spatial Data Management Perspective | Dec 13, 2024 | ManagementTraffic Prediction | CodeCode Available | 2 | 5 |
| Backtracing: Retrieving the Cause of the Query | Mar 6, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 2 | 5 |
| ConceptFusion: Open-set Multimodal 3D Mapping | Feb 14, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 | 5 |
| CapHuman: Capture Your Moments in Parallel Universes | Feb 1, 2024 | Image Generation | CodeCode Available | 2 | 5 |
| RBF-PINN: Non-Fourier Positional Embedding in Physics-Informed Neural Networks | Feb 13, 2024 | | CodeCode Available | 2 | 5 |
| Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification | Mar 15, 2024 | Object | CodeCode Available | 2 | 5 |
| LightSeq2: Accelerated Training for Transformer-based Models on GPUs | Oct 12, 2021 | DecoderGPU | CodeCode Available | 2 | 5 |
| You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs | Mar 19, 2024 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Deep Learning for Time Series Forecasting: Tutorial and Literature Survey | Apr 21, 2020 | BIG-bench Machine LearningDeep Learning | CodeCode Available | 2 | 5 |
| BiFormer: Vision Transformer with Bi-Level Routing Attention | Mar 15, 2023 | Computational EfficiencyGPU | CodeCode Available | 2 | 5 |
| MakeItTalk: Speaker-Aware Talking-Head Animation | Apr 27, 2020 | Talking Face GenerationTalking Head Generation | CodeCode Available | 2 | 5 |
| Lite DETR : An Interleaved Multi-Scale Encoder for Efficient DETR | Mar 13, 2023 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion | Mar 21, 2023 | Optical Flow EstimationScene Flow Estimation | CodeCode Available | 2 | 5 |
| CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control | May 29, 2024 | RAGResponse Generation | CodeCode Available | 2 | 5 |
| Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges | Apr 24, 2024 | Drug DesignInductive Bias | CodeCode Available | 2 | 5 |
| Scalable Zero-shot Entity Linking with Dense Entity Retrieval | Nov 10, 2019 | Entity EmbeddingsEntity Linking | CodeCode Available | 2 | 5 |
| MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection | Mar 24, 2022 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 | 5 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 | 5 |
| RhythmFormer: Extracting Patterned rPPG Signals based on Periodic Sparse Attention | Feb 20, 2024 | | CodeCode Available | 2 | 5 |
| Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future | Sep 27, 2023 | Navigate | CodeCode Available | 2 | 5 |
| Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching | Jun 3, 2024 | Denoising | CodeCode Available | 2 | 5 |
| MiVOLO: Multi-input Transformer for Age and Gender Estimation | Jul 10, 2023 | Age And Gender ClassificationAge and Gender Estimation | CodeCode Available | 2 | 5 |
| Graph Neural Networks in Supply Chain Analytics and Optimization: Concepts, Perspectives, Dataset and Benchmarks | Nov 13, 2024 | Anomaly DetectionDemand Forecasting | CodeCode Available | 2 | 5 |
| Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation | Dec 5, 2024 | Semantic SegmentationTime Series | CodeCode Available | 2 | 5 |
| RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild | Jun 1, 2020 | 3D Face ReconstructionFace Alignment | CodeCode Available | 2 | 5 |
| RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining | Jul 31, 2024 | Optical Flow EstimationRain Removal | CodeCode Available | 2 | 5 |
| Mergenetic: a Simple Evolutionary Model Merging Library | May 16, 2025 | Evolutionary Algorithmsmodel | CodeCode Available | 2 | 5 |
| Augmented Object Intelligence with XR-Objects | Apr 20, 2024 | ObjectSemantic Segmentation | CodeCode Available | 2 | 5 |
| TART: A plug-and-play Transformer module for task-agnostic reasoning | Sep 21, 2023 | | CodeCode Available | 2 | 5 |
| Hulk: A Universal Knowledge Translator for Human-Centric Tasks | Dec 4, 2023 | 3D Human Pose EstimationAction Recognition | CodeCode Available | 2 | 5 |
| Face Swap via Diffusion Model | Mar 2, 2024 | Face AlignmentFace Detection | CodeCode Available | 2 | 5 |
| Segment anything model 2: an application to 2D and 3D medical images | Aug 1, 2024 | Computed Tomography (CT)Segmentation | CodeCode Available | 2 | 5 |
| Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following | Sep 1, 2023 | 3D Generation3D Question Answering (3D-QA) | CodeCode Available | 2 | 5 |
| A Comparative Study on Reasoning Patterns of OpenAI's o1 Model | Oct 17, 2024 | Math | CodeCode Available | 2 | 5 |
| RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars | May 22, 2023 | 2kImage Matting | CodeCode Available | 2 | 5 |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | Mar 24, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| SMILEtrack: SiMIlarity LEarning for Occlusion-Aware Multiple Object Tracking | Nov 16, 2022 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 2 | 5 |
| SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields | Dec 5, 2022 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 2 | 5 |
| Augraphy: A Data Augmentation Library for Document Images | Aug 30, 2022 | Data AugmentationDenoising | CodeCode Available | 2 | 5 |
| EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models | Feb 1, 2024 | | CodeCode Available | 2 | 5 |
| Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects | Apr 1, 2024 | Articulated Object modelling | CodeCode Available | 2 | 5 |
| Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation | Apr 1, 2024 | Denoising | CodeCode Available | 2 | 5 |