| Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Aug 27, 2024 | DeblurringVideo Enhancement | CodeCode Available | 2 |
| SAM & SAM 2 in 3D Slicer: SegmentWithSAM Extension for Annotating Medical Images | Aug 27, 2024 | | CodeCode Available | 2 |
| HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling | Aug 27, 2024 | Domain GeneralizationPrompt Engineering | CodeCode Available | 2 |
| LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals | Aug 27, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation | Aug 26, 2024 | Continual Learning | CodeCode Available | 2 |
| Training-Free Activation Sparsity in Large Language Models | Aug 26, 2024 | Quantization | CodeCode Available | 2 |
| A Practitioner's Guide to Continual Multimodal Pretraining | Aug 26, 2024 | Continual LearningContinual Pretraining | CodeCode Available | 2 |
| GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy | Aug 26, 2024 | Few-Shot LearningImage Generation | CodeCode Available | 2 |
| MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Video-CCAM: Enhancing Video-Language Understanding with Causal Cross-Attention Masks for Short and Long Videos | Aug 26, 2024 | Large Language ModelMVBench | CodeCode Available | 2 |
| LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings | Aug 25, 2024 | Language ModellingLink Prediction | CodeCode Available | 2 |
| MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation | Aug 25, 2024 | Image SegmentationMamba | CodeCode Available | 2 |
| MobileQuant: Mobile-friendly Quantization for On-device Language Models | Aug 25, 2024 | Quantization | CodeCode Available | 2 |
| SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Aug 25, 2024 | 3DGSImage Generation | CodeCode Available | 2 |
| TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather | Aug 25, 2024 | Autonomous DrivingDenoising | CodeCode Available | 2 |
| 3D-RCNet: Learning from Transformer to Build a 3D Relational ConvNet for Hyperspectral Image Classification | Aug 25, 2024 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 2 |
| Segment Any Mesh: Zero-shot Mesh Part Segmentation via Lifting Segment Anything 2 to 3D | Aug 24, 2024 | DiversitySegmentation | CodeCode Available | 2 |
| DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation | Aug 24, 2024 | Anomaly ClassificationAnomaly Detection | CodeCode Available | 2 |
| SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description | Aug 24, 2024 | DescriptiveSpeech Synthesis | CodeCode Available | 2 |
| WildFusion: Individual Animal Identification with Calibrated Similarity Fusion | Aug 23, 2024 | | CodeCode Available | 2 |
| Data-Driven Parametrization of Molecular Mechanics Force Fields for Expansive Chemical Space Coverage | Aug 23, 2024 | Computational EfficiencyDrug Discovery | CodeCode Available | 2 |
| Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler | Aug 23, 2024 | | CodeCode Available | 2 |
| LLM-PBE: Assessing Data Privacy in Large Language Models | Aug 23, 2024 | | CodeCode Available | 2 |
| DeTPP: Leveraging Object Detection for Robust Long-Horizon Event Prediction | Aug 23, 2024 | DiversityPoint Processes | CodeCode Available | 2 |