| On the Effectiveness of Dataset Alignment for Fake Image Detection | Oct 15, 2024 | DenoisingFake Image Detection | CodeCode Available | 1 |
| On-the-fly Modulation for Balanced Multimodal Learning | Oct 15, 2024 | | CodeCode Available | 1 |
| Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation | Oct 15, 2024 | | CodeCode Available | 1 |
| Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models | Oct 15, 2024 | NavigateSensitivity | CodeCode Available | 1 |
| A State-of-the-Art Morphosyntactic Parser and Lemmatizer for Ancient Greek | Oct 15, 2024 | Lemmatization | CodeCode Available | 1 |
| From promise to practice: realizing high-performance decentralized training | Oct 15, 2024 | | CodeCode Available | 1 |
| Safety Filtering While Training: Improving the Performance and Sample Efficiency of Reinforcement Learning Agents | Oct 15, 2024 | Reinforcement Learning (RL) | CodeCode Available | 1 |
| Cognitive Overload Attack:Prompt Injection for Long Context | Oct 15, 2024 | In-Context LearningLLM Jailbreak | CodeCode Available | 1 |
| Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation | Oct 15, 2024 | Knowledge DistillationRgb-T Tracking | CodeCode Available | 1 |
| Leveraging Multi-Temporal Sentinel 1 and 2 Satellite Data for Leaf Area Index Estimation With Deep Learning | Oct 15, 2024 | Decoder | CodeCode Available | 1 |
| Offline Model-Based Optimization by Learning to Rank | Oct 15, 2024 | Ensemble LearningLearning-To-Rank | CodeCode Available | 1 |
| Why Go Full? Elevating Federated Learning Through Partial Network Updates | Oct 15, 2024 | Federated Learning | CodeCode Available | 1 |
| TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement | Oct 15, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| TraM : Enhancing User Sleep Prediction with Transformer-based Multivariate Time Series Modeling and Machine Learning Ensembles | Oct 15, 2024 | Time Series | CodeCode Available | 1 |
| Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss | Oct 15, 2024 | Motion Estimation | CodeCode Available | 1 |
| Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws | Oct 15, 2024 | Computational Efficiency | CodeCode Available | 1 |
| V3D-SLAM: Robust RGB-D SLAM in Dynamic Environments with 3D Semantic Geometry Voting | Oct 15, 2024 | Simultaneous Localization and Mapping | CodeCode Available | 1 |
| DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Oct 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention Detection | Oct 15, 2024 | EEG | CodeCode Available | 1 |
| Zero-shot Model-based Reinforcement Learning using Large Language Models | Oct 15, 2024 | In-Context LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| RuleRAG: Rule-guided retrieval-augmented generation with language models for question answering | Oct 15, 2024 | In-Context LearningInstruction Following | CodeCode Available | 1 |
| Anatomical feature-prioritized loss for enhanced MR to CT translation | Oct 14, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 1 |
| MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models | Oct 14, 2024 | Multiple-choice | CodeCode Available | 1 |
| Interaction-Guided Two-Branch Image Dehazing Network | Oct 14, 2024 | Image Dehazing | CodeCode Available | 1 |
| Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models | Oct 14, 2024 | Federated LearningMixture-of-Experts | CodeCode Available | 1 |
| On Information-Theoretic Measures of Predictive Uncertainty | Oct 14, 2024 | Out-of-Distribution Detection | CodeCode Available | 1 |
| PointNet with KAN versus PointNet with MLP for 3D Classification and Segmentation of Point Sets | Oct 14, 2024 | 3D Classification3D Object Classification | CodeCode Available | 1 |
| Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion Models | Oct 14, 2024 | | CodeCode Available | 1 |
| AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality | Oct 14, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 1 |
| V2M: Visual 2-Dimensional Mamba for Image Representation Learning | Oct 14, 2024 | Instance SegmentationMamba | CodeCode Available | 1 |
| Is Parameter Collision Hindering Continual Learning in LLMs? | Oct 14, 2024 | Continual Learning | CodeCode Available | 1 |
| Adaptive Diffusion Terrain Generator for Autonomous Uneven Terrain Navigation | Oct 14, 2024 | DenoisingDiversity | CodeCode Available | 1 |
| α-DPO: Adaptive Reward Margin is What Direct Preference Optimization Needs | Oct 14, 2024 | Computational Efficiency | CodeCode Available | 1 |
| Persistent Topological Features in Large Language Models | Oct 14, 2024 | Decision MakingTopological Data Analysis | CodeCode Available | 1 |
| Denial-of-Service Poisoning Attacks against Large Language Models | Oct 14, 2024 | 16kSpeech-to-Text | CodeCode Available | 1 |
| MagicEraser: Erasing Any Objects via Semantics-Aware Control | Oct 14, 2024 | Image InpaintingObject | CodeCode Available | 1 |
| Towards Foundation Models for 3D Vision: How Close Are We? | Oct 14, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 1 |
| EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning | Oct 14, 2024 | Code Generation | CodeCode Available | 1 |
| Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations | Oct 14, 2024 | Dimensionality ReductionMuJoCo | CodeCode Available | 1 |
| GlobalMamba: Global Image Serialization for Vision Mamba | Oct 14, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| BookWorm: A Dataset for Character Description and Analysis | Oct 14, 2024 | Retrieval | CodeCode Available | 1 |
| Adapt-: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection | Oct 14, 2024 | | CodeCode Available | 1 |
| First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending | Oct 14, 2024 | Image GenerationText Generation | CodeCode Available | 1 |
| CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning | Oct 14, 2024 | Audio DenoisingDecoder | CodeCode Available | 1 |
| Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework | Oct 14, 2024 | Data SummarizationMulti-Task Learning | CodeCode Available | 1 |
| TABCF: Counterfactual Explanations for Tabular Data Using a Transformer-Based VAE | Oct 14, 2024 | counterfactual | CodeCode Available | 1 |
| Locking Down the Finetuned LLMs Safety | Oct 14, 2024 | Safety Alignment | CodeCode Available | 1 |
| CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes | Oct 14, 2024 | Autonomous DrivingPanoptic Segmentation | CodeCode Available | 1 |
| FormalAlign: Automated Alignment Evaluation for Autoformalization | Oct 14, 2024 | Mathematical Proofsvalid | CodeCode Available | 1 |
| Beyond Graphs: Can Large Language Models Comprehend Hypergraphs? | Oct 14, 2024 | | CodeCode Available | 1 |