| PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting | Jun 14, 2024 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis | Jun 14, 2024 | SuperpixelsVector Graphics | CodeCode Available | 2 |
| ControlVAR: Exploring Controllable Visual Autoregressive Modeling | Jun 14, 2024 | Image Generation | CodeCode Available | 2 |
| Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs | Jun 14, 2024 | Memorization | CodeCode Available | 2 |
| DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications | Jun 14, 2024 | Autonomous DrivingDepth Estimation | CodeCode Available | 2 |
| Consistency-diversity-realism Pareto fronts of conditional image generative models | Jun 14, 2024 | Diversity | CodeCode Available | 2 |
| ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation | Jun 14, 2024 | Code Generation | CodeCode Available | 2 |
| SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages | Jun 14, 2024 | Diversity | CodeCode Available | 2 |
| Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation | Jun 14, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| QQQ: Quality Quattuor-Bit Quantization for Large Language Models | Jun 14, 2024 | Quantization | CodeCode Available | 2 |
| RaNeuS: Ray-adaptive Neural Surface Reconstruction | Jun 14, 2024 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| BEACON: Benchmark for Comprehensive RNA Tasks and Language Models | Jun 14, 2024 | Language Modelling | CodeCode Available | 2 |
| PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance | Jun 13, 2024 | Motion GenerationPosition | CodeCode Available | 2 |
| JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models | Jun 13, 2024 | | CodeCode Available | 2 |
| Are We There Yet? A Brief Survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges | Jun 13, 2024 | Emotion RecognitionMusic Emotion Recognition | CodeCode Available | 2 |
| Yo'LLaVA: Your Personalized Language and Vision Assistant | Jun 13, 2024 | Image CaptioningQuestion Answering | CodeCode Available | 2 |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Jun 13, 2024 | Adversarial RobustnessExplainable Artificial Intelligence (XAI) | CodeCode Available | 2 |
| Fredformer: Frequency Debiased Transformer for Time Series Forecasting | Jun 13, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection | Jun 13, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer | Jun 13, 2024 | Face Image QualityFace Image Quality Assessment | CodeCode Available | 2 |
| Understanding Hallucinations in Diffusion Models through Mode Interpolation | Jun 13, 2024 | HallucinationImage Generation | CodeCode Available | 2 |
| An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios | Jun 13, 2024 | Language IdentificationSelf-Supervised Learning | CodeCode Available | 2 |
| Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models | Jun 13, 2024 | MathQuantization | CodeCode Available | 2 |
| Classic GNNs are Strong Baselines: Reassessing GNNs for Node Classification | Jun 13, 2024 | Node ClassificationNode Property Prediction | CodeCode Available | 2 |
| Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors | Jun 13, 2024 | Data AugmentationText Detection | CodeCode Available | 2 |