| Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics | Jul 11, 2022 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 | 5 |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Jul 11, 2022 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| SenseFi: A Library and Benchmark on Deep-Learning-Empowered WiFi Human Sensing | Jul 16, 2022 | Activity RecognitionDeep Learning | CodeCode Available | 2 | 5 |
| ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Jul 19, 2022 | Camera Pose EstimationMotion Segmentation | CodeCode Available | 2 | 5 |
| CGVQM+D: Computer Graphics Video Quality Metric and Dataset | Jun 13, 2025 | DenoisingNovel View Synthesis | CodeCode Available | 2 | 5 |
| Language Models Can Teach Themselves to Program Better | Jul 29, 2022 | Code Generation | CodeCode Available | 2 | 5 |
| Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours | Aug 2, 2022 | Text Classification | CodeCode Available | 2 | 5 |
| MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth | Aug 4, 2022 | 3D ReconstructionPoint Clouds | CodeCode Available | 2 | 5 |
| No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects | Aug 7, 2022 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| CitySim: A Drone-Based Vehicle Trajectory Dataset for Safety Oriented Research and Digital Twins | Aug 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 | 5 |
| GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Dec 4, 2023 | Face Model | CodeCode Available | 2 | 5 |
| GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Dec 4, 2023 | 2kDepth Estimation | CodeCode Available | 2 | 5 |
| Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data | Aug 4, 2023 | Question AnsweringVisual Question Answering | CodeCode Available | 2 | 5 |
| CW-ERM: Improving Autonomous Driving Planning with Closed-loop Weighted Empirical Risk Minimization | Oct 5, 2022 | Autonomous DrivingImitation Learning | CodeCode Available | 2 | 5 |
| Text Detection Forgot About Document OCR | Oct 14, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 | 5 |
| Thinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks | Jan 1, 2023 | Image Quality Assessment | CodeCode Available | 2 | 5 |
| NVIDIA FLARE: Federated Learning from Simulation to Real-World | Oct 24, 2022 | Federated LearningPrivacy Preserving | CodeCode Available | 2 | 5 |
| Current Trends in Deep Learning for Earth Observation: An Open-source Benchmark Arena for Image Classification | Jul 14, 2022 | ClassificationEarth Observation | CodeCode Available | 2 | 5 |
| A Survey of Deep Face Restoration: Denoise, Super-Resolution, Deblur, Artifact Removal | Nov 5, 2022 | Deep LearningImage Restoration | CodeCode Available | 2 | 5 |
| Open-Vocabulary Online Semantic Mapping for SLAM | Nov 22, 2024 | SegmentationSemantic SLAM | CodeCode Available | 2 | 5 |
| Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring | Nov 22, 2022 | DeblurringDecoder | CodeCode Available | 2 | 5 |
| High-fidelity 3D GAN Inversion by Pseudo-multi-view Optimization | Nov 28, 2022 | AttributeGenerative Adversarial Network | CodeCode Available | 2 | 5 |
| Fine-tuned CLIP Models are Efficient Video Learners | Dec 6, 2022 | | CodeCode Available | 2 | 5 |
| End-to-End Modeling Hierarchical Time Series Using Autoregressive Transformer and Conditional Normalizing Flow based Reconciliation | Dec 28, 2022 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 | 5 |
| H-CoT: Hijacking the Chain-of-Thought Safety Reasoning Mechanism to Jailbreak Large Reasoning Models, Including OpenAI o1/o3, DeepSeek-R1, and Gemini 2.0 Flash Thinking | Feb 18, 2025 | | CodeCode Available | 2 | 5 |