| CityLLaVA: Efficient Fine-Tuning for VLMs in City Scenario | May 6, 2024 | PositionPrediction | CodeCode Available | 2 |
| LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model | May 6, 2024 | Motion Generation | CodeCode Available | 2 |
| Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge | May 6, 2024 | | CodeCode Available | 2 |
| Foundation Models for Video Understanding: A Survey | May 6, 2024 | SurveyVideo Understanding | CodeCode Available | 2 |
| Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom | May 6, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 2 |
| 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation | May 6, 2024 | Autonomous VehiclesDecoder | CodeCode Available | 2 |
| Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning | May 6, 2024 | Reinforcement Learning (RL) | CodeCode Available | 2 |
| Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration | May 6, 2024 | | CodeCode Available | 2 |
| Word2World: Generating Stories and Worlds through Large Language Models | May 6, 2024 | Game Design | CodeCode Available | 2 |
| TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning | May 6, 2024 | Multiple Instance LearningTime Series | CodeCode Available | 2 |
| DVMSR: Distillated Vision Mamba for Efficient Super-Resolution | May 5, 2024 | Image Super-ResolutionLong-range modeling | CodeCode Available | 2 |
| Self-Reflection in LLM Agents: Effects on Problem-Solving Performance | May 5, 2024 | Multiple-choice | CodeCode Available | 2 |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning | May 5, 2024 | GSM8KMath | CodeCode Available | 2 |
| iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval | May 5, 2024 | BenchmarkingComposed Image Retrieval (CoIR) | CodeCode Available | 2 |
| Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration | May 5, 2024 | Color Image DenoisingImage Restoration | CodeCode Available | 2 |
| Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | May 5, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model | May 4, 2024 | ObjectOptical Flow Estimation | CodeCode Available | 2 |
| MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | May 4, 2024 | Earth Observationimage-classification | CodeCode Available | 2 |
| Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health Records | May 4, 2024 | Information RetrievalQuestion Answering | CodeCode Available | 2 |
| PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property Generation | May 4, 2024 | In-Context LearningRetrieval | CodeCode Available | 2 |
| SCIMAP: A Python Toolkit for Integrated Spatial Analysis of Multiplexed Imaging Data | May 3, 2024 | | CodeCode Available | 2 |
| SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation | May 3, 2024 | feature selection | CodeCode Available | 2 |
| Self-Supervised Learning for Real-World Super-Resolution from Dual and Multiple Zoomed Observations | May 3, 2024 | Optical Flow EstimationReference-based Super-Resolution | CodeCode Available | 2 |
| Reinforcement Learning control strategies for Electric Vehicles and Renewable energy sources Virtual Power Plants | May 3, 2024 | | CodeCode Available | 2 |
| FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space | May 3, 2024 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 2 |