| Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey | Nov 30, 2024 | Anomaly Detectionobject-detection | —Unverified | 0 |
| Graph Canvas for Controllable 3D Scene Generation | Nov 27, 2024 | In-Context LearningScene Generation | CodeCode Available | 0 |
| CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models | Nov 27, 2024 | 4D reconstructionNovel View Synthesis | —Unverified | 0 |
| ROOT: VLM based System for Indoor Scene Understanding and Beyond | Nov 24, 2024 | Scene GenerationScene Understanding | CodeCode Available | 1 |
| What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable Generation | Nov 23, 2024 | Image GenerationScene Generation | CodeCode Available | 2 |
| Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction | Nov 21, 2024 | 3D GenerationGPU | —Unverified | 0 |
| SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model | Nov 19, 2024 | Scene Generation | CodeCode Available | 0 |
| Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Nov 14, 2024 | Depth EstimationImage Inpainting | —Unverified | 0 |
| DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion | Nov 7, 2024 | 3D GenerationDenoising | —Unverified | 0 |
| DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Oct 31, 2024 | Scene Generation | CodeCode Available | 1 |
| SceneGenAgent: Precise Industrial Scene Generation with Coding Agent | Oct 29, 2024 | C++ codeScene Generation | CodeCode Available | 1 |
| CompGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians | Oct 28, 2024 | 3D GenerationImage Generation | —Unverified | 0 |
| SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Oct 26, 2024 | 3D ReconstructionScene Generation | —Unverified | 0 |
| Learning Global Object-Centric Representations via Disentangled Slot Attention | Oct 24, 2024 | ObjectPosition | —Unverified | 0 |
| DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes | Oct 23, 2024 | Scene Generation | CodeCode Available | 3 |
| The Scene Language: Representing Scenes with Programs, Words, and Embeddings | Oct 22, 2024 | Scene Generation | —Unverified | 0 |
| L3DG: Latent 3D Gaussian Diffusion | Oct 17, 2024 | Scene Generation | —Unverified | 0 |
| SceneCraft: Layout-Guided 3D Scene Generation | Oct 11, 2024 | 3D GenerationImage Generation | CodeCode Available | 3 |
| Skyeyes: Ground Roaming using Aerial View Images | Sep 25, 2024 | Autonomous DrivingScene Generation | —Unverified | 0 |
| SpaceBlender: Creating Context-Rich Collaborative Spaces Through Generative 3D Scene Blending | Sep 20, 2024 | Depth EstimationScene Generation | —Unverified | 0 |
| LT3SD: Latent Trees for 3D Scene Diffusion | Sep 12, 2024 | Scene Generation | —Unverified | 0 |
| EarthGen: Generating the World from Top-Down Views | Sep 2, 2024 | Scene GenerationSuper-Resolution | CodeCode Available | 0 |
| Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation | Aug 27, 2024 | Image GenerationObject | —Unverified | 0 |
| Alfie: Democratising RGBA Image Generation With No $ | Aug 27, 2024 | Image GenerationImage Matting | CodeCode Available | 2 |
| SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Aug 25, 2024 | 3DGSImage Generation | CodeCode Available | 2 |
| Draw Like an Artist: Complex Scene Generation with Diffusion Model via Composition, Painting, and Retouching | Aug 25, 2024 | Scene Generation | —Unverified | 0 |
| LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation | Aug 23, 2024 | Scene Generation | —Unverified | 0 |
| Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE | Aug 10, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents | Aug 8, 2024 | Scene GenerationTask Planning | CodeCode Available | 1 |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aug 4, 2024 | GPUImage Generation | —Unverified | 0 |
| iControl3D: An Interactive System for Controllable 3D Scene Generation | Aug 3, 2024 | NavigateNeural Rendering | CodeCode Available | 0 |
| SceneTeller: Language-to-3D Scene Generation | Jul 30, 2024 | In-Context LearningScene Generation | —Unverified | 0 |
| Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering | Jul 30, 2024 | Inverse RenderingNeRF | CodeCode Available | 1 |
| CityX: Controllable Procedural Content Generation for Unbounded 3D Cities | Jul 24, 2024 | Autonomous VehiclesScene Generation | —Unverified | 0 |
| HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions | Jul 21, 2024 | Scene Generation | —Unverified | 0 |
| Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jul 18, 2024 | Image GenerationLayout-to-Image Generation | CodeCode Available | 1 |
| The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation | Jul 17, 2024 | DiversityImage Generation | —Unverified | 0 |
| Sketch-Guided Scene Image Generation | Jul 9, 2024 | Image GenerationObject | —Unverified | 0 |
| MMIS: Multimodal Dataset for Interior Scene Visual Generation and Recognition | Jul 8, 2024 | Image GenerationRepresentation Learning | —Unverified | 0 |
| Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis | Jul 7, 2024 | Indoor Scene SynthesisScene Generation | —Unverified | 0 |
| Solving Motion Planning Tasks with a Scalable Generative Model | Jul 3, 2024 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |
| MultiDiff: Consistent Novel View Synthesis from a Single Image | Jun 26, 2024 | Image GenerationNovel View Synthesis | —Unverified | 0 |
| Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models | Jun 26, 2024 | Collision AvoidanceHuman-Object Interaction Detection | —Unverified | 0 |
| Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text | Jun 25, 2024 | 3D GenerationDenoising | CodeCode Available | 3 |
| WonderWorld: Interactive 3D Scene Generation from a Single Image | Jun 13, 2024 | Depth EstimationGPU | —Unverified | 0 |
| SimGen: Simulator-conditioned Driving Scene Generation | Jun 13, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 |
| 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Jun 11, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation | Jun 10, 2024 | 3D GenerationNeRF | CodeCode Available | 3 |
| CityCraft: A Real Crafter for 3D City Generation | Jun 7, 2024 | Autonomous DrivingDiversity | —Unverified | 0 |
| REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment | May 28, 2024 | Image to 3DObject | CodeCode Available | 2 |