| Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | Jul 3, 2024 | Domain GeneralizationKnowledge Distillation | CodeCode Available | 2 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Jul 3, 2024 | Neural Rendering | CodeCode Available | 2 |
| Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jul 3, 2024 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation | Jul 3, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Jul 3, 2024 | Image GenerationMolecular Docking | CodeCode Available | 2 |
| CoIR: A Comprehensive Benchmark for Code Information Retrieval Models | Jul 3, 2024 | BenchmarkingCode Search | CodeCode Available | 2 |
| Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion | Jul 3, 2024 | Point Cloud Completion | CodeCode Available | 2 |
| CATT: Character-based Arabic Tashkeel Transformer | Jul 3, 2024 | Arabic Text DiacritizationDecoder | CodeCode Available | 2 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| MHNet: Multi-view High-order Network for Diagnosing Neurodevelopmental Disorders Using Resting-state fMRI | Jul 3, 2024 | Functional ConnectivityGraph Neural Network | CodeCode Available | 2 |
| Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages | Jul 3, 2024 | Language Modellingvalid | CodeCode Available | 2 |
| Solving Motion Planning Tasks with a Scalable Generative Model | Jul 3, 2024 | Autonomous DrivingMotion Planning | CodeCode Available | 2 |
| SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding | Jul 3, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Jul 2, 2024 | PredictionText to 3D | CodeCode Available | 2 |
| BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream | Jul 2, 2024 | NeRF | CodeCode Available | 2 |
| AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scans | Jul 2, 2024 | 3D ClassificationAlzheimer's Disease Detection | CodeCode Available | 2 |
| MG-Verilog: Multi-grained Dataset Towards Enhanced LLM-assisted Verilog Generation | Jul 2, 2024 | In-Context Learning | CodeCode Available | 2 |
| GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models | Jul 2, 2024 | Marketing | CodeCode Available | 2 |
| MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation | Jul 2, 2024 | HallucinationRAG | CodeCode Available | 2 |
| A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding | Jul 2, 2024 | document understandingKey Information Extraction | CodeCode Available | 2 |
| Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models | Jul 2, 2024 | Story Visualization | CodeCode Available | 2 |
| Safety-Driven Deep Reinforcement Learning Framework for Cobots: A Sim2Real Approach | Jul 2, 2024 | Deep Reinforcement Learning | CodeCode Available | 2 |
| VFIMamba: Video Frame Interpolation with State Space Models | Jul 2, 2024 | 2k4k | CodeCode Available | 2 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |