| Evolving Reservoirs for Meta Reinforcement Learning | Dec 9, 2023 | Meta Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Steering Llama 2 via Contrastive Activation Addition | Dec 9, 2023 | Multiple-choice | CodeCode Available | 2 |
| DPoser: Diffusion Model as Robust 3D Human Pose Prior | Dec 9, 2023 | DenoisingHuman Mesh Recovery | CodeCode Available | 2 |
| SlimSAM: 0.1% Data Makes Segment Anything Slim | Dec 8, 2023 | | CodeCode Available | 2 |
| Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review | Dec 8, 2023 | Autonomous DrivingRetrieval | CodeCode Available | 2 |
| Zoology: Measuring and Improving Recall in Efficient Language Models | Dec 8, 2023 | | CodeCode Available | 2 |
| Reconstructing Hands in 3D with Transformers | Dec 8, 2023 | 3D Hand Pose Estimation | CodeCode Available | 2 |
| UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models | Dec 8, 2023 | Image GenerationScene Text Editing | CodeCode Available | 2 |
| FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation | Dec 7, 2023 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| Towards Knowledge-driven Autonomous Driving | Dec 7, 2023 | Autonomous DrivingNeural Rendering | CodeCode Available | 2 |
| RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Dec 7, 2023 | AttributeVideo Editing | CodeCode Available | 2 |
| Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation | Dec 7, 2023 | Domain Generalization | CodeCode Available | 2 |
| Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models | Dec 7, 2023 | | CodeCode Available | 2 |
| Inversion-Free Image Editing with Natural Language | Dec 7, 2023 | Image ManipulationText-based Image Editing | CodeCode Available | 2 |
| Scaling Laws of Synthetic Images for Model Training ... for Now | Dec 7, 2023 | | CodeCode Available | 2 |
| Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future | Dec 6, 2023 | Autonomous Driving | CodeCode Available | 2 |
| Online Vectorized HD Map Construction using Geometry | Dec 6, 2023 | Online Vectorized HD Map Construction | CodeCode Available | 2 |
| OneLLM: One Framework to Align All Modalities with Language | Dec 6, 2023 | AllQuestion Answering | CodeCode Available | 2 |
| Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Dec 6, 2023 | 3DGS3D scene Editing | CodeCode Available | 2 |
| XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies | Dec 6, 2023 | 3D Shape GenerationScene Generation | CodeCode Available | 2 |
| AnimateZero: Video Diffusion Models are Zero-Shot Image Animators | Dec 6, 2023 | Image AnimationVideo Generation | CodeCode Available | 2 |
| DiffusionSat: A Generative Foundation Model for Satellite Imagery | Dec 6, 2023 | Crop Yield PredictionImage Generation | CodeCode Available | 2 |
| Kandinsky 3.0 Technical Report | Dec 6, 2023 | Image GenerationImage to Video Generation | CodeCode Available | 2 |
| Alpha-CLIP: A CLIP Model Focusing on Wherever You Want | Dec 6, 2023 | 3D Generation | CodeCode Available | 2 |
| VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation | Dec 6, 2023 | Language ModellingNavigate | CodeCode Available | 2 |
| Return of Unconditional Generation: A Self-supervised Representation Generation Method | Dec 6, 2023 | Conditional Image GenerationImage Generation | CodeCode Available | 2 |
| Analyzing and Improving the Training Dynamics of Diffusion Models | Dec 5, 2023 | Image GenerationPhilosophy | CodeCode Available | 2 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models | Dec 5, 2023 | Decoder | CodeCode Available | 2 |
| Customization Assistant for Text-to-image Generation | Dec 5, 2023 | DescriptiveImage Generation | CodeCode Available | 2 |
| mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs | Dec 5, 2023 | GPULarge Language Model | CodeCode Available | 2 |
| Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey | Dec 5, 2023 | | CodeCode Available | 2 |
| Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving? | Dec 5, 2023 | AllAutonomous Driving | CodeCode Available | 2 |
| HHAvatar: Gaussian Head Avatar with Dynamic Hairs | Dec 5, 2023 | 2k | CodeCode Available | 2 |
| Towards Automatic Power Battery Detection: New Challenge, Benchmark Dataset and Baseline | Dec 5, 2023 | Crowd Countingobject-detection | CodeCode Available | 2 |
| Large Language Models on Graphs: A Comprehensive Survey | Dec 5, 2023 | Language ModellingSurvey | CodeCode Available | 2 |
| GPT4Point: A Unified Framework for Point-Language Understanding and Generation | Dec 5, 2023 | 3D GenerationImage Generation | CodeCode Available | 2 |
| RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze! | Dec 5, 2023 | Information RetrievalReranking | CodeCode Available | 2 |
| GauHuman: Articulated Gaussian Splatting from Monocular Human Videos | Dec 5, 2023 | Generalizable Novel View SynthesisNeRF | CodeCode Available | 2 |
| Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation | Dec 5, 2023 | Logical Reasoning | CodeCode Available | 2 |
| Tree of Attacks: Jailbreaking Black-Box LLMs Automatically | Dec 4, 2023 | Navigate | CodeCode Available | 2 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 |
| PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness | Dec 4, 2023 | Autonomous Driving | CodeCode Available | 2 |
| GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Dec 4, 2023 | Face Model | CodeCode Available | 2 |
| PixelLM: Pixel Reasoning with Large Multimodal Model | Dec 4, 2023 | Decodermodel | CodeCode Available | 2 |
| SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes | Dec 4, 2023 | Novel View Synthesis | CodeCode Available | 2 |
| DiffiT: Diffusion Vision Transformers for Image Generation | Dec 4, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning | Dec 4, 2023 | In-Context Learning | CodeCode Available | 2 |
| GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Dec 4, 2023 | 2kDepth Estimation | CodeCode Available | 2 |