| A Physics-informed Diffusion Model for High-fidelity Flow Field Reconstruction | Nov 26, 2022 | Vocal Bursts Intensity Prediction | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction | Nov 25, 2022 | 3D Face ReconstructionDecoder | CodeCode Available | 2 |
| Fine-Grained Face Swapping via Regional GAN Inversion | Nov 25, 2022 | DisentanglementFace Swapping | CodeCode Available | 2 |
| CLIP-ReID: Exploiting Vision-Language Model for Image Re-Identification without Concrete Text Labels | Nov 25, 2022 | image-classificationImage Classification | CodeCode Available | 2 |
| Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism | Nov 25, 2022 | GPU | CodeCode Available | 2 |
| Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark | Nov 24, 2022 | 2D Object DetectionImage Retrieval | CodeCode Available | 2 |
| MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning | Nov 24, 2022 | Deep Reinforcement LearningLayout Design | CodeCode Available | 2 |
| Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation | Nov 24, 2022 | FairnessFraud Detection | CodeCode Available | 2 |
| Immersive Neural Graphics Primitives | Nov 24, 2022 | BenchmarkingNeRF | CodeCode Available | 2 |
| A Self-Attention Ansatz for Ab-initio Quantum Chemistry | Nov 24, 2022 | | CodeCode Available | 2 |
| Melting Pot 2.0 | Nov 24, 2022 | Artificial LifeNavigate | CodeCode Available | 2 |
| Certified data-driven physics-informed greedy auto-encoder simulator | Nov 24, 2022 | | CodeCode Available | 2 |
| Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration | Nov 23, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| Inversion-Based Style Transfer with Diffusion Models | Nov 23, 2022 | DenoisingImage Generation | CodeCode Available | 2 |
| Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Nov 23, 2022 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model with Discrete and Continuous Denoising | Nov 23, 2022 | DenoisingVector Graphics | CodeCode Available | 2 |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Nov 23, 2022 | 3D Scene ReconstructionDeblurring | CodeCode Available | 2 |
| Latent Video Diffusion Models for High-Fidelity Long Video Generation | Nov 23, 2022 | DenoisingImage Generation | CodeCode Available | 2 |
| GhostNetV2: Enhance Cheap Operation with Long-Range Attention | Nov 23, 2022 | | CodeCode Available | 2 |
| AERO: Audio Super Resolution in the Spectral Domain | Nov 22, 2022 | Audio Super-ResolutionBandwidth Extension | CodeCode Available | 2 |
| Instant Volumetric Head Avatars | Nov 22, 2022 | Face ModelGPU | CodeCode Available | 2 |
| X^2-VLM: All-In-One Pre-trained Model For Vision-Language Tasks | Nov 22, 2022 | AllCross-Modal Retrieval | CodeCode Available | 2 |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Nov 22, 2022 | Image GenerationImage-to-Image Translation | CodeCode Available | 2 |
| Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition | Nov 22, 2022 | NeRFTalking Face Generation | CodeCode Available | 2 |
| Efficient Frequency Domain-based Transformers for High-Quality Image Deblurring | Nov 22, 2022 | DeblurringDecoder | CodeCode Available | 2 |
| OpenFE: Automated Feature Generation with Expert-level Performance | Nov 22, 2022 | Feature Importance | CodeCode Available | 2 |
| EDICT: Exact Diffusion Inversion via Coupled Transformations | Nov 22, 2022 | DenoisingImage Reconstruction | CodeCode Available | 2 |
| SinDiffusion: Learning a Diffusion Model from a Single Natural Image | Nov 22, 2022 | DenoisingDiversity | CodeCode Available | 2 |
| PermutoSDF: Fast Multi-View Reconstruction with Implicit Surfaces using Permutohedral Lattices | Nov 22, 2022 | | CodeCode Available | 2 |
| Person Image Synthesis via Denoising Diffusion Model | Nov 22, 2022 | DenoisingDiversity | CodeCode Available | 2 |
| Discovering Evolution Strategies via Meta-Black-Box Optimization | Nov 21, 2022 | continuous-controlContinuous Control | CodeCode Available | 2 |
| UniMSE: Towards Unified Multimodal Sentiment Analysis and Emotion Recognition | Nov 21, 2022 | Contrastive LearningEmotion Recognition | CodeCode Available | 2 |
| SPARF: Neural Radiance Fields from Sparse and Noisy Poses | Nov 21, 2022 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding | Nov 21, 2022 | Drug Discovery | CodeCode Available | 2 |
| Blur Interpolation Transformer for Real-World Motion from Blur | Nov 21, 2022 | DeblurringSuper-Resolution | CodeCode Available | 2 |
| PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning | Nov 21, 2022 | 3D Classification3D Object Detection | CodeCode Available | 2 |
| ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Nov 21, 2022 | 3D ReconstructionCamera Localization | CodeCode Available | 2 |
| Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion | Nov 21, 2022 | 3D ReconstructionNeRF | CodeCode Available | 2 |
| NeRF-RPN: A general framework for object detection in NeRFs | Nov 21, 2022 | NeRFobject-detection | CodeCode Available | 2 |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Nov 21, 2022 | Face Model | CodeCode Available | 2 |
| Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering | Nov 21, 2022 | Dynamic ReconstructionTensor Decomposition | CodeCode Available | 2 |
| DynIBaR: Neural Dynamic Image-Based Rendering | Nov 20, 2022 | Dynamic Reconstruction | CodeCode Available | 2 |
| Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models | Nov 20, 2022 | Story ContinuationStory Visualization | CodeCode Available | 2 |
| MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception | Nov 19, 2022 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 2 |
| Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion | Nov 19, 2022 | 3D Object Detectionobject-detection | CodeCode Available | 2 |
| EDGE: Editable Dance Generation From Music | Nov 19, 2022 | DiversityMotion Synthesis | CodeCode Available | 2 |
| DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting | Nov 19, 2022 | DecoderScene Text Detection | CodeCode Available | 2 |
| GNS: A generalizable Graph Neural Network-based simulator for particulate and fluid modeling | Nov 18, 2022 | Graph Neural Network | CodeCode Available | 2 |
| Visual Programming: Compositional visual reasoning without training | Nov 18, 2022 | In-Context LearningQuestion Answering | CodeCode Available | 2 |