| How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments | Mar 18, 2024 | Decision Making | CodeCode Available | 2 |
| Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail | Mar 18, 2024 | Lifelike 3D Human Generation | CodeCode Available | 2 |
| Fed3DGS: Scalable 3D Gaussian Splatting with Federated Learning | Mar 18, 2024 | 3DGS3D Reconstruction | CodeCode Available | 2 |
| CRS-Diff: Controllable Remote Sensing Image Generation with Diffusion Model | Mar 18, 2024 | Image Generation | CodeCode Available | 2 |
| SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction | Mar 18, 2024 | Autonomous Vehiclesmotion prediction | CodeCode Available | 2 |
| ReGenNet: Towards Human Action-Reaction Synthesis | Mar 18, 2024 | Decoder | CodeCode Available | 2 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Mar 18, 2024 | Instance SegmentationNeRF | CodeCode Available | 2 |
| Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems | Mar 18, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control | Mar 18, 2024 | Instruction FollowingMinecraft | CodeCode Available | 2 |
| Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Mar 18, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 2 |
| HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Mar 18, 2024 | Scene Graph Generation | CodeCode Available | 2 |
| LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models | Mar 18, 2024 | | CodeCode Available | 2 |
| Graph Neural Networks for Learning Equivariant Representations of Neural Networks | Mar 18, 2024 | | CodeCode Available | 2 |
| Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models | Mar 18, 2024 | 4kPosition | CodeCode Available | 2 |
| Continual Forgetting for Pre-trained Vision Models | Mar 18, 2024 | Continual ForgettingFace Recognition | CodeCode Available | 2 |
| DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation | Mar 18, 2024 | Feature EngineeringImage Manipulation | CodeCode Available | 2 |
| BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation | Mar 18, 2024 | Decision MakingScene Segmentation | CodeCode Available | 2 |
| Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt | Mar 18, 2024 | AttributeDecoder | CodeCode Available | 2 |
| ThermoNeRF: Joint RGB and Thermal Novel View Synthesis for Building Facades using Multimodal Neural Radiance Fields | Mar 18, 2024 | 3D geometryImage Generation | CodeCode Available | 2 |
| Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning | Mar 18, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 2 |
| NetTrack: Tracking Highly Dynamic Objects with a Net | Mar 17, 2024 | Multi-Object TrackingObject | CodeCode Available | 2 |
| DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation | Mar 17, 2024 | Semantic SegmentationWeakly supervised Semantic Segmentation | CodeCode Available | 2 |
| MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data | Mar 17, 2024 | Image RetrievalRetrieval | CodeCode Available | 2 |
| CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations | Mar 17, 2024 | Objectobject-detection | CodeCode Available | 2 |