| Large Vision-Language Models for Remote Sensing Visual Question Answering | Nov 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model | Nov 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Nov 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Take Package as Language: Anomaly Detection Using Transformer | Nov 15, 2024 | Anomaly DetectionIntrusion Detection | CodeCode Available | 0 |
| Debias your Large Multi-Modal Model at Test-Time with Non-Contrastive Visual Attribute Steering | Nov 15, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Chain of Alignment: Integrating Public Will with Expert Intelligence for Language Model Alignment | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging large language models for efficient representation learning for entity resolution | Nov 15, 2024 | BlockingContrastive Learning | —Unverified | 0 |
| Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving | Nov 15, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization | Nov 15, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| TEESlice: Protecting Sensitive Neural Network Models in Trusted Execution Environments When Attackers have Pre-Trained Models | Nov 15, 2024 | GPULanguage Modeling | —Unverified | 0 |