| Model scale versus domain knowledge in statistical forecasting of chaotic systems | Mar 13, 2023 | Time SeriesTime Series Analysis | CodeCode Available | 2 |
| ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions | Mar 12, 2023 | Image CaptioningQuestion Answering | CodeCode Available | 2 |
| Iterative Geometry Encoding Volume for Stereo Matching | Mar 12, 2023 | Omnnidirectional Stereo Depth EstimationStereo Matching | CodeCode Available | 2 |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Mar 11, 2023 | Automatic Speech Recognitionimage-classification | CodeCode Available | 2 |
| A Systematic Study of Joint Representation Learning on Protein Sequences and Structures | Mar 11, 2023 | Contrastive LearningProtein Function Prediction | CodeCode Available | 2 |
| DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation | Mar 11, 2023 | Image Manipulation | CodeCode Available | 2 |
| ACR: Attention Collaboration-based Regressor for Arbitrary Two-Hand Reconstruction | Mar 10, 2023 | 3D Interacting Hand Pose Estimation3D Reconstruction | CodeCode Available | 2 |
| Side-channel analysis against ANSSI’s protected AES implementation on ARM: end-to-end attacks with multi-task learning | Mar 10, 2023 | Multi-Task LearningSide Channel Analysis | CodeCode Available | 2 |
| HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining | Mar 10, 2023 | AttributeAutonomous Driving | CodeCode Available | 2 |
| StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces | Mar 10, 2023 | AttributeSuper-Resolution | CodeCode Available | 2 |
| Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking | Mar 9, 2023 | Contrastive LearningDecoder | CodeCode Available | 2 |
| DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation | Mar 9, 2023 | DecoderDenoising | CodeCode Available | 2 |
| 3DGen: Triplane Latent Diffusion for Textured Mesh Generation | Mar 9, 2023 | DiversityGPU | CodeCode Available | 2 |
| X-Avatar: Expressive Human Avatars | Mar 8, 2023 | 3D Human Reconstruction | CodeCode Available | 2 |
| Video-P2P: Video Editing with Cross-attention Control | Mar 8, 2023 | Image GenerationVideo Editing | CodeCode Available | 2 |
| Video-P2P: Video Editing with Cross-attention Control | Mar 8, 2023 | Image GenerationVideo Editing | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning | Mar 8, 2023 | Semantic Segmentation | CodeCode Available | 2 |
| Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models? | Mar 7, 2023 | | CodeCode Available | 2 |
| OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception | Mar 7, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks | Mar 7, 2023 | CPUGPU | CodeCode Available | 2 |
| 3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction | Mar 6, 2023 | Drug Design | CodeCode Available | 2 |
| PaLM-E: An Embodied Multimodal Language Model | Mar 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OpenICL: An Open-Source Framework for In-context Learning | Mar 6, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| KBNet: Kernel Basis Network for Image Restoration | Mar 6, 2023 | Color Image DenoisingDeblurring | CodeCode Available | 2 |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Mar 5, 2023 | | CodeCode Available | 2 |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Mar 5, 2023 | 3D Human Pose EstimationHuman Detection | CodeCode Available | 2 |
| Streaming Active Learning with Deep Neural Networks | Mar 5, 2023 | Active LearningDiversity | CodeCode Available | 2 |
| Virtual Sparse Convolution for Multimodal 3D Object Detection | Mar 4, 2023 | 3D Multi-Object Tracking3D Object Detection | CodeCode Available | 2 |
| Extended Agriculture-Vision: An Extension of a Large Aerial Image Dataset for Agricultural Pattern Analysis | Mar 4, 2023 | BenchmarkingContrastive Learning | CodeCode Available | 2 |
| Chasing Low-Carbon Electricity for Practical and Sustainable DNN Training | Mar 4, 2023 | | CodeCode Available | 2 |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Mar 4, 2023 | BenchmarkingGPU | CodeCode Available | 2 |
| Towards Democratizing Joint-Embedding Self-Supervised Learning | Mar 3, 2023 | Data AugmentationMisconceptions | CodeCode Available | 2 |
| MixVPR: Feature Mixing for Visual Place Recognition | Mar 3, 2023 | Autonomous DrivingImage Retrieval | CodeCode Available | 2 |
| Unleashing Text-to-Image Diffusion Models for Visual Perception | Mar 3, 2023 | DenoisingDepth Estimation | CodeCode Available | 2 |
| Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement | Mar 3, 2023 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners | Mar 3, 2023 | Few-Shot LearningRepresentation Learning | CodeCode Available | 2 |
| POPGym: Benchmarking Partially Observable Reinforcement Learning | Mar 3, 2023 | BenchmarkingGPU | CodeCode Available | 2 |
| Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering | Mar 3, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Dropout Reduces Underfitting | Mar 2, 2023 | | CodeCode Available | 2 |
| UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy | Mar 2, 2023 | Motion Planning | CodeCode Available | 2 |
| Human Motion Diffusion as a Generative Prior | Mar 2, 2023 | DenoisingMotion Synthesis | CodeCode Available | 2 |
| Image as Set of Points | Mar 2, 2023 | Clustering | CodeCode Available | 2 |
| Delivering Arbitrary-Modal Semantic Segmentation | Mar 2, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation | Mar 1, 2023 | Video Frame Interpolation | CodeCode Available | 2 |
| UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers | Mar 1, 2023 | Domain AdaptationInformation Retrieval | CodeCode Available | 2 |
| Multimodal Industrial Anomaly Detection via Hybrid Fusion | Mar 1, 2023 | 3D Anomaly DetectionAnomaly Detection | CodeCode Available | 2 |
| Efficient and Explicit Modelling of Image Hierarchies for Image Restoration | Mar 1, 2023 | Image DeblurringImage Defocus Deblurring | CodeCode Available | 2 |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images | Feb 28, 2023 | 3D Face ReconstructionDisentanglement | CodeCode Available | 2 |