| Reactive Environments for Active Inference Agents with RxEnvironments.jl | Sep 17, 2024 | | CodeCode Available | 1 |
| Subgroups: A Python library for Subgroup Discovery | Sep 17, 2024 | Data MiningSubgroup Discovery | CodeCode Available | 1 |
| MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping | Sep 17, 2024 | DecoderFew-Shot Semantic Segmentation | CodeCode Available | 1 |
| Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark | Sep 17, 2024 | Few-Shot Semantic SegmentationGeneralized Few-Shot Semantic Segmentation | CodeCode Available | 1 |
| Propulsion: Steering LLM with Tiny Fine-Tuning | Sep 17, 2024 | parameter-efficient fine-tuning | CodeCode Available | 1 |
| SynthSOD: Developing an Heterogeneous Dataset for Orchestra Music Source Separation | Sep 17, 2024 | Music Source Separation | CodeCode Available | 1 |
| SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking | Sep 17, 2024 | Multiple Object TrackingObject Tracking | CodeCode Available | 1 |
| CUNSB-RFIE: Context-aware Unpaired Neural Schr"odinger Bridge in Retinal Fundus Image Enhancement | Sep 17, 2024 | Image EnhancementImage-to-Image Translation | CodeCode Available | 1 |
| MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance | Sep 17, 2024 | Face GenerationImage Generation | CodeCode Available | 1 |
| Annealed Winner-Takes-All for Motion Forecasting | Sep 17, 2024 | AllAutonomous Driving | CodeCode Available | 1 |
| ULOC: Learning to Localize in Complex Large-Scale Environments with Ultra-Wideband Ranges | Sep 17, 2024 | Mamba | CodeCode Available | 1 |
| Volvo Discovery Challenge at ECML-PKDD 2024 | Sep 17, 2024 | | CodeCode Available | 1 |
| STCMOT: Spatio-Temporal Cohesion Learning for UAV-Based Multiple Object Tracking | Sep 17, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| TTT-Unet: Enhancing U-Net with Test-Time Training Layers for Biomedical Image Segmentation | Sep 17, 2024 | Cell SegmentationImage Segmentation | CodeCode Available | 1 |
| Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning | Sep 17, 2024 | Incremental Learning | CodeCode Available | 1 |
| EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage | Sep 17, 2024 | | CodeCode Available | 1 |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Sep 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing RL Safety with Counterfactual LLM Reasoning | Sep 16, 2024 | counterfactualLanguage Modeling | CodeCode Available | 1 |
| ScaleFlow++: Robust and Accurate Estimation of 3D Motion from Video | Sep 16, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 1 |
| Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles | Sep 16, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios | Sep 16, 2024 | | CodeCode Available | 1 |
| Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots | Sep 16, 2024 | Management | CodeCode Available | 1 |
| Towards Physically Realizable Adversarial Attacks in Embodied Vision Navigation | Sep 16, 2024 | Adversarial Robustnessobject-detection | CodeCode Available | 1 |
| AutoSafeCoder: A Multi-Agent Framework for Securing LLM Code Generation through Static Analysis and Fuzz Testing | Sep 16, 2024 | Code GenerationProgram Synthesis | CodeCode Available | 1 |
| SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps | Sep 16, 2024 | DenoisingDepth Completion | CodeCode Available | 1 |
| E2Map: Experience-and-Emotion Map for Self-Reflective Robot Navigation with Language Models | Sep 16, 2024 | General KnowledgeRobot Navigation | CodeCode Available | 1 |
| MusicLIME: Explainable Multimodal Music Understanding | Sep 16, 2024 | Decision MakingFairness | CodeCode Available | 1 |
| DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Sep 16, 2024 | 3DGSNeRF | CodeCode Available | 1 |
| Econometric Inference for High Dimensional Predictive Regressions | Sep 16, 2024 | regression | CodeCode Available | 1 |
| Trustworthiness in Retrieval-Augmented Generation Systems: A Survey | Sep 16, 2024 | FairnessHallucination | CodeCode Available | 1 |
| Robust image representations with counterfactual contrastive learning | Sep 16, 2024 | Contrastive Learningcounterfactual | CodeCode Available | 1 |
| MotIF: Motion Instruction Fine-tuning | Sep 16, 2024 | | CodeCode Available | 1 |
| NARX Transformer: A Dynamic Model for Leveraging Multicycle Data in Long-Term Battery State of Health Estimation | Sep 16, 2024 | Battery cycle life predictionBattery diagnosis | CodeCode Available | 1 |
| Deep-Wide Learning Assistance for Insect Pest Classification | Sep 16, 2024 | ClassificationData Augmentation | CodeCode Available | 1 |
| Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT | Sep 16, 2024 | Acoustic Unit DiscoveryClustering | CodeCode Available | 1 |
| Flash STU: Fast Spectral Transform Units | Sep 16, 2024 | State Space Models | CodeCode Available | 1 |
| Manifold-Constrained Nucleus-Level Denoising Diffusion Model for Structure-Based Drug Design | Sep 16, 2024 | DenoisingDrug Design | CodeCode Available | 1 |
| AceParse: A Comprehensive Dataset with Diverse Structured Texts for Academic Literature Parsing | Sep 16, 2024 | | CodeCode Available | 1 |
| SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation | Sep 16, 2024 | Autonomous Driving | CodeCode Available | 1 |
| MetaFormer and CNN Hybrid Model for Polyp Image Segmentation | Sep 16, 2024 | BenchmarkingImage Segmentation | CodeCode Available | 1 |
| TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer | Sep 15, 2024 | text-guided-image-editing | CodeCode Available | 1 |
| One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild | Sep 15, 2024 | GPUImage Generation | CodeCode Available | 1 |
| DiFSD: Ego-Centric Fully Sparse Paradigm with Uncertainty Denoising and Iterative Refinement for Efficient End-to-End Self-Driving | Sep 15, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 1 |
| SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks | Sep 15, 2024 | Image ClassificationObject Detection | CodeCode Available | 1 |
| Finetuning CLIP to Reason about Pairwise Differences | Sep 15, 2024 | AttributeContrastive Learning | CodeCode Available | 1 |
| PROSE-FD: A Multimodal PDE Foundation Model for Learning Multiple Operators for Forecasting Fluid Dynamics | Sep 15, 2024 | Operator learningPrediction | CodeCode Available | 1 |
| Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion | Sep 15, 2024 | Mamba | CodeCode Available | 1 |
| Unsupervised Hyperspectral and Multispectral Image Blind Fusion Based on Deep Tucker Decomposition Network with Spatial-Spectral Manifold Learning | Sep 15, 2024 | | CodeCode Available | 1 |
| SITSMamba for Crop Classification based on Satellite Image Time Series | Sep 15, 2024 | ClassificationCrop Classification | CodeCode Available | 1 |
| GLCONet: Learning Multi-source Perception Representation for Camouflaged Object Detection | Sep 15, 2024 | Decoderobject-detection | CodeCode Available | 1 |