| Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training | Nov 25, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Brain Tumour Removing and Missing Modality Generation using 3D WDM | Nov 7, 2024 | GPUPrediction | CodeCode Available | 2 |
| Center-based 3D Object Detection and Tracking | Jun 19, 2020 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| Vision6D: 3D-to-2D Interactive Visualization and Annotation Tool for 6D Pose Estimation | Apr 21, 2025 | 6D Pose EstimationPose Estimation | CodeCode Available | 2 |
| SlimSAM: 0.1% Data Makes Segment Anything Slim | Dec 8, 2023 | | CodeCode Available | 2 |
| Personality Alignment of Large Language Models | Aug 21, 2024 | Personality Alignment | CodeCode Available | 2 |
| STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction | Apr 28, 2025 | GPU | CodeCode Available | 2 |
| Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking | Dec 20, 2024 | MambaObject Tracking | CodeCode Available | 2 |
| Fast Training of Diffusion Models with Masked Transformers | Jun 15, 2023 | DecoderDenoising | CodeCode Available | 2 |
| SkiROS2: A skill-based Robot Control Platform for ROS | Jun 29, 2023 | SchedulingTask Planning | CodeCode Available | 2 |
| Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding | May 23, 2022 | | CodeCode Available | 2 |
| VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models | Oct 10, 2024 | Math | CodeCode Available | 2 |
| Prodigy: An Expeditiously Adaptive Parameter-Free Learner | Jun 9, 2023 | | CodeCode Available | 2 |
| Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization | Sep 2, 2024 | DiversityOffline RL | CodeCode Available | 2 |
| Associative Recurrent Memory Transformer | Jul 5, 2024 | Retrieval | CodeCode Available | 2 |
| UltraMedical: Building Specialized Generalists in Biomedicine | Jun 6, 2024 | | CodeCode Available | 2 |
| ResCLIP: Residual Attention for Training-free Dense Vision-language Inference | Nov 24, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| STAR: Scale-wise Text-to-image generation via Auto-Regressive representations | Jun 16, 2024 | DiversityImage Generation | CodeCode Available | 2 |
| YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection | Aug 10, 2023 | Objectobject-detection | CodeCode Available | 2 |
| AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior | Mar 20, 2024 | | CodeCode Available | 2 |
| Low-Light Image Enhancement via Structure Modeling and Guidance | May 10, 2023 | Edge DetectionImage Enhancement | CodeCode Available | 2 |
| Vision-Language Pre-Training with Triple Contrastive Learning | Feb 21, 2022 | Contrastive Learningcross-modal alignment | CodeCode Available | 2 |
| LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs | Jun 17, 2025 | | CodeCode Available | 2 |
| UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces | Dec 25, 2023 | Image SegmentationObject | CodeCode Available | 2 |
| KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints | May 10, 2022 | 3D Face Reconstruction3D Human Reconstruction | CodeCode Available | 2 |
| RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control | Jun 6, 2023 | continuous-controlContinuous Control | CodeCode Available | 2 |
| DepthMaster: Taming Diffusion Models for Monocular Depth Estimation | Jan 5, 2025 | DenoisingDepth Estimation | CodeCode Available | 2 |
| Efficient Autoregressive Audio Modeling via Next-Scale Prediction | Aug 16, 2024 | Audio GenerationFAD | CodeCode Available | 2 |
| Accelerating DETR Convergence via Semantic-Aligned Matching | Mar 14, 2022 | Objectobject-detection | CodeCode Available | 2 |
| SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models | Mar 4, 2022 | Contrastive LearningGraph Embedding | CodeCode Available | 2 |
| Make-A-Shape: a Ten-Million-scale 3D Shape Model | Jan 20, 2024 | | CodeCode Available | 2 |
| On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection | Oct 31, 2024 | Video Forensics | CodeCode Available | 2 |
| The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG) | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Adversarial Detection and Correction by Matching Prediction Distributions | Feb 21, 2020 | Prediction | CodeCode Available | 2 |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Jul 2, 2024 | PredictionText to 3D | CodeCode Available | 2 |
| ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation | Feb 20, 2025 | 3D Molecule GenerationProtein Design | CodeCode Available | 2 |
| Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching | Mar 1, 2024 | Stereo Matching | CodeCode Available | 2 |
| Zero Shot Health Trajectory Prediction Using Transformer | Jul 30, 2024 | ICU AdmissionICU Mortality | CodeCode Available | 2 |
| Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics | Feb 19, 2025 | | CodeCode Available | 2 |
| X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios | Nov 2, 2024 | Denoising | CodeCode Available | 2 |
| MHG-GNN: Combination of Molecular Hypergraph Grammar with Graph Neural Network | Sep 28, 2023 | Graph Neural NetworkPrediction | CodeCode Available | 2 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 |
| Learning-Based Defect Recognitions for Autonomous UAV Inspections | Feb 13, 2023 | Crack SegmentationSegmentation | CodeCode Available | 2 |
| InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences | Dec 2, 2024 | | CodeCode Available | 2 |
| LaRE^2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection | Mar 26, 2024 | Image Generation | CodeCode Available | 2 |
| Deep Learning-Based Point Cloud Registration: A Comprehensive Survey and Taxonomy | Apr 22, 2024 | Autonomous DrivingDeep Learning | CodeCode Available | 2 |
| Geometry Aware Operator Transformer as an Efficient and Accurate Neural Surrogate for PDEs on Arbitrary Domains | May 24, 2025 | Computational EfficiencyOperator learning | CodeCode Available | 2 |
| Poison-splat: Computation Cost Attack on 3D Gaussian Splatting | Oct 10, 2024 | 3DGS | CodeCode Available | 2 |
| Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Mar 18, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| A Survey on Diffusion Models for Recommender Systems | Sep 8, 2024 | Data AugmentationRecommendation Systems | CodeCode Available | 2 |