| SG-Tailor: Inter-Object Commonsense Relationship Reasoning for Scene Graph Manipulation | Mar 23, 2025 | Scene Generation | CodeCode Available | 0 |
| Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes | Mar 23, 2025 | ObjectRetrieval | —Unverified | 0 |
| HSM: Hierarchical Scene Motifs for Multi-Scale Indoor Scene Generation | Mar 21, 2025 | Layout GenerationScene Generation | —Unverified | 0 |
| NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes | Mar 20, 2025 | Scene Generation | CodeCode Available | 2 |
| DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation | Mar 19, 2025 | Novel View SynthesisScene Generation | —Unverified | 0 |
| Cube: A Roblox View of 3D Intelligence | Mar 19, 2025 | Scene GenerationText Generation | CodeCode Available | 4 |
| SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis | Mar 18, 2025 | Indoor Scene SynthesisScene Generation | —Unverified | 0 |
| Advances in 4D Generation: A Survey | Mar 18, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 |
| ChatBEV: A Visual Language Model that Understands BEV Maps | Mar 18, 2025 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model | Mar 18, 2025 | Autonomous DrivingImage Generation | CodeCode Available | 1 |
| Bolt3D: Generating 3D Scenes in Seconds | Mar 18, 2025 | 3D geometry3D Reconstruction | —Unverified | 0 |
| Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception | Mar 17, 2025 | Future predictionScene Generation | CodeCode Available | 2 |
| SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering | Mar 15, 2025 | Scene GenerationVideo Generation | CodeCode Available | 2 |
| WonderVerse: Extendable 3D Scene Generation with Video Generative Models | Mar 12, 2025 | 3D ReconstructionDepth Estimation | —Unverified | 0 |
| Controllable 3D Outdoor Scene Generation via Scene Graphs | Mar 10, 2025 | Autonomous DrivingScene Generation | CodeCode Available | 2 |
| Unlocking Generalization for Robotics via Modularity and Scale | Mar 10, 2025 | Scene Generation | —Unverified | 0 |
| DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance | Mar 5, 2025 | 3D Object DetectionBEV Segmentation | CodeCode Available | 1 |
| Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection | Mar 3, 2025 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| A Survey on Text-Driven 360-Degree Panorama Generation | Feb 20, 2025 | Scene GenerationSurvey | —Unverified | 0 |
| Rolling Ahead Diffusion for Traffic Scene Simulation | Feb 13, 2025 | Computational EfficiencyModel Predictive Control | —Unverified | 0 |
| MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation | Feb 9, 2025 | Scene Generation | CodeCode Available | 1 |
| Functional 3D Scene Synthesis through Human-Scene Optimization | Feb 5, 2025 | Human-Object Interaction DetectionScene Generation | —Unverified | 0 |
| LAYOUTDREAMER: Physics-guided Layout for Text-to-3D Compositional Scene Generation | Feb 4, 2025 | 3DGSScene Generation | —Unverified | 0 |
| PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation | Feb 2, 2025 | Scene GenerationText to 3D | —Unverified | 0 |
| HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Jan 24, 2025 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation | Jan 15, 2025 | Point cloud reconstructionScene Generation | CodeCode Available | 0 |
| CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities | Jan 15, 2025 | Scene Generation | CodeCode Available | 2 |
| StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | Jan 10, 2025 | Perpetual View GenerationScene Generation | —Unverified | 0 |
| Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors | Jan 5, 2025 | Scene GenerationText to 3D | —Unverified | 0 |
| MRG: A Multi-Robot Manufacturing Digital Scene Generation Method Using Multi-Instance Point Cloud Registration | Jan 3, 2025 | Industrial RobotsPoint Cloud Registration | —Unverified | 0 |
| FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement | Jan 1, 2025 | 3D geometryCommon Sense Reasoning | —Unverified | 0 |
| Robust Multi-Object 4D Generation for In-the-wild Videos | Jan 1, 2025 | ObjectScene Generation | —Unverified | 0 |
| SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model | Jan 1, 2025 | Scene Generation | —Unverified | 0 |
| DreamDrive: Generative 4D Scene Modeling from Street View Images | Dec 31, 2024 | Autonomous DrivingNeural Rendering | —Unverified | 0 |
| Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Dec 30, 2024 | 3D GenerationImage Generation | —Unverified | 0 |
| Toward Scene Graph and Layout Guided Complex 3D Scene Generation | Dec 29, 2024 | 3D GenerationScene Generation | —Unverified | 0 |
| DepthLab: From Partial to Complete | Dec 24, 2024 | Depth CompletionMissing Values | —Unverified | 0 |
| OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Dec 23, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| T^3-S2S: Training-free Triplet Tuning for Sketch to Scene Generation | Dec 18, 2024 | Scene GenerationTriplet | CodeCode Available | 0 |
| Wonderland: Navigating 3D Scenes from a Single Image | Dec 16, 2024 | 3D ReconstructionScene Generation | —Unverified | 0 |
| OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation | Dec 15, 2024 | MambaScene Generation | —Unverified | 0 |
| GPD-1: Generative Pre-training for Driving | Dec 11, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations | Dec 11, 2024 | AttributeImage Generation | CodeCode Available | 2 |
| UniScene: Unified Occupancy-centric Driving Scene Generation | Dec 6, 2024 | Autonomous DrivingScene Generation | CodeCode Available | 4 |
| SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout | Dec 5, 2024 | DenoisingLarge Language Model | —Unverified | 0 |
| PaintScene4D: Consistent 4D Scene Generation from Text Prompts | Dec 5, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models | Dec 5, 2024 | Scene Generation | —Unverified | 0 |
| Supertoroid fitting of objects with holes for robotic grasping and scene generation | Dec 5, 2024 | Robotic GraspingScene Generation | CodeCode Available | 0 |
| SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation | Dec 2, 2024 | Scene Generation | —Unverified | 0 |
| HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving | Dec 2, 2024 | Autonomous DrivingDepth Estimation | —Unverified | 0 |