| S^2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification | Apr 28, 2024 | Hyperspectral Image Classificationimage-classification | CodeCode Available | 2 |
| Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment | Apr 28, 2024 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Apr 28, 2024 | Image InpaintingLanguage Modeling | CodeCode Available | 2 |
| LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing | Apr 27, 2024 | Log Parsing | CodeCode Available | 2 |
| FRAME: A Modular Framework for Autonomous Map Merging: Advancements in the Field | Apr 27, 2024 | Point Cloud Registration | CodeCode Available | 2 |
| Generative Diffusion-based Downscaling for Climate | Apr 27, 2024 | Super-Resolution | CodeCode Available | 2 |
| UniRGB-IR: A Unified Framework for RGB-Infrared Semantic Tasks via Adapter Tuning | Apr 26, 2024 | Multispectral Object DetectionPedestrian Detection | CodeCode Available | 2 |
| PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games | Apr 26, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations | Apr 26, 2024 | Imitation Learning | CodeCode Available | 2 |
| Embedded FPGA Developments in 130nm and 28nm CMOS for Machine Learning in Particle Detector Readout | Apr 26, 2024 | | CodeCode Available | 2 |
| IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages | Apr 25, 2024 | Cross-Lingual Question AnsweringDiversity | CodeCode Available | 2 |
| Learning Visuotactile Skills with Two Multifingered Hands | Apr 25, 2024 | | CodeCode Available | 2 |
| REBEL: Reinforcement Learning via Regressing Relative Rewards | Apr 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search | Apr 25, 2024 | Entity EmbeddingsImage Captioning | CodeCode Available | 2 |
| CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions | Apr 25, 2024 | MambaMultispectral Object Detection | CodeCode Available | 2 |
| Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents | Apr 25, 2024 | Decision MakingSpecificity | CodeCode Available | 2 |
| A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Apr 25, 2024 | Autonomous DrivingEvolutionary Algorithms | CodeCode Available | 2 |
| Commonsense Prototype for Outdoor Unsupervised 3D Object Detection | Apr 25, 2024 | 3D Object DetectionObject | CodeCode Available | 2 |
| Weak-to-Strong Extrapolation Expedites Alignment | Apr 25, 2024 | | CodeCode Available | 2 |
| Multi-Scale Representations by Varying Window Attention for Semantic Segmentation | Apr 25, 2024 | DecoderSemantic Segmentation | CodeCode Available | 2 |
| EEG-Deformer: A Dense Convolutional Transformer for Brain-computer Interfaces | Apr 25, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs | Apr 25, 2024 | Visual GroundingVisual Question Answering | CodeCode Available | 2 |
| Latent Modulated Function for Computational Optimal Continuous Image Representation | Apr 25, 2024 | Computational EfficiencySuper-Resolution | CodeCode Available | 2 |
| DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting | Apr 25, 2024 | Exemplar-Free CountingFew-shot Object Counting and Detection | CodeCode Available | 2 |