| Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping | Sep 27, 2022 | NeRFVisual Odometry | CodeCode Available | 2 |
| Liquid Structural State-Space Models | Sep 26, 2022 | Heart rate estimationLong-range modeling | CodeCode Available | 2 |
| STD: Stable Triangle Descriptor for 3D place recognition | Sep 26, 2022 | 3D Place Recognition | CodeCode Available | 2 |
| Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps | Sep 26, 2022 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| Generalized Parametric Contrastive Learning | Sep 26, 2022 | Contrastive LearningDomain Generalization | CodeCode Available | 2 |
| Learning to Learn with Generative Models of Neural Network Checkpoints | Sep 26, 2022 | | CodeCode Available | 2 |
| Learning GFlowNets from partial episodes for improved convergence and stability | Sep 26, 2022 | | CodeCode Available | 2 |
| PL-EVIO: Robust Monocular Event-based Visual Inertial Odometry with Point and Line Features | Sep 25, 2022 | Management | CodeCode Available | 2 |
| Personalizing Text-to-Image Generation via Aesthetic Gradients | Sep 25, 2022 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| Accurate and Efficient Stereo Matching via Attention Concatenation Volume | Sep 23, 2022 | Stereo Matching | CodeCode Available | 2 |
| On Efficient Reinforcement Learning for Full-length Game of StarCraft II | Sep 23, 2022 | CPUreinforcement-learning | CodeCode Available | 2 |
| Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration | Sep 22, 2022 | Compressed Image Super-resolutionImage Restoration | CodeCode Available | 2 |
| UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer | Sep 22, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement | Sep 22, 2022 | Audio Super-ResolutionAutomatic Speech Recognition | CodeCode Available | 2 |
| A Generalist Neural Algorithmic Learner | Sep 22, 2022 | Graph Neural NetworkLearning to Execute | CodeCode Available | 2 |
| A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases | Sep 22, 2022 | Inductive Bias | CodeCode Available | 2 |
| Poisson Flow Generative Models | Sep 22, 2022 | Image Generation | CodeCode Available | 2 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Understanding the Tricks of Deep Learning in Medical Image Segmentation: Challenges and Future Directions | Sep 21, 2022 | Data AugmentationDomain Adaptation | CodeCode Available | 2 |
| HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image Classification | Sep 21, 2022 | Classificationimage-classification | CodeCode Available | 2 |
| Mega: Moving Average Equipped Gated Attention | Sep 21, 2022 | Image ClassificationInductive Bias | CodeCode Available | 2 |
| BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo | Sep 21, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Text2Light: Zero-Shot Text-Driven HDR Panorama Generation | Sep 20, 2022 | 4kinverse tone mapping | CodeCode Available | 2 |
| MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge -- Motion Prediction | Sep 20, 2022 | motion predictionPrediction | CodeCode Available | 2 |
| Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering | Sep 20, 2022 | Multimodal Deep LearningMultimodal Reasoning | CodeCode Available | 2 |
| TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization | Sep 19, 2022 | | CodeCode Available | 2 |
| Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning | Sep 18, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| Human Performance Modeling and Rendering via Neural Animated Mesh | Sep 18, 2022 | | CodeCode Available | 2 |
| SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation | Sep 18, 2022 | Real-Time Semantic SegmentationSegmentation | CodeCode Available | 2 |
| RDD2022: A multi-national image dataset for automatic Road Damage Detection | Sep 18, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Scalable SoftGroup for 3D Instance Segmentation on Point Clouds | Sep 17, 2022 | 3D Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| DytanVO: Joint Refinement of Visual Odometry and Motion Segmentation in Dynamic Environments | Sep 17, 2022 | Motion SegmentationSemantic Segmentation | CodeCode Available | 2 |
| A real-time dynamic obstacle tracking and mapping system for UAV navigation and collision avoidance with an RGB-D camera | Sep 17, 2022 | Autonomous DrivingCollision Avoidance | CodeCode Available | 2 |
| IoT Data Analytics in Dynamic Environments: From An Automated Machine Learning Perspective | Sep 16, 2022 | Anomaly DetectionAutoML | CodeCode Available | 2 |
| Omni-Dimensional Dynamic Convolution | Sep 16, 2022 | | CodeCode Available | 2 |
| ZeroEGGS: Zero-shot Example-based Gesture Generation from Speech | Sep 15, 2022 | Gesture Generation | CodeCode Available | 2 |
| Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based B-spline trajectory optimization | Sep 15, 2022 | Navigate | CodeCode Available | 2 |
| On-Device Domain Generalization | Sep 15, 2022 | Data AugmentationDomain Generalization | CodeCode Available | 2 |
| Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models | Sep 15, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Pose2Sim: An open-source Python package for multiview markerless kinematics | Sep 14, 2022 | | CodeCode Available | 2 |
| CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment | Sep 14, 2022 | RetrievalText Retrieval | CodeCode Available | 2 |
| GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots | Sep 12, 2022 | | CodeCode Available | 2 |
| CSL: A Large-scale Chinese Scientific Literature Dataset | Sep 12, 2022 | text-classificationText Classification | CodeCode Available | 2 |
| CenterFormer: Center-based Transformer for 3D Object Detection | Sep 12, 2022 | 3D Object DetectionObject | CodeCode Available | 2 |
| 3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation | Sep 12, 2022 | 3D Face AnimationDisentanglement | CodeCode Available | 2 |
| Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation | Sep 12, 2022 | Robot ManipulationRobot Manipulation Generalization | CodeCode Available | 2 |
| Git Re-Basin: Merging Models modulo Permutation Symmetries | Sep 11, 2022 | Linear Mode ConnectivityRe-basin | CodeCode Available | 2 |
| Diffusion Models in Vision: A Survey | Sep 10, 2022 | ArticlesDenoising | CodeCode Available | 2 |
| MCIBI++: Soft Mining Contextual Information Beyond Image for Semantic Segmentation | Sep 9, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| TEACH: Temporal Action Composition for 3D Humans | Sep 9, 2022 | Motion SynthesisSentence | CodeCode Available | 2 |