| ICARUS: An Android-Based Unmanned Aerial Vehicle (UAV) Search and Rescue Eye in the Sky | Aug 29, 2023 | Descriptive | —Unverified | 0 |
| Interpretable Image Quality Assessment via CLIP with Multiple Antonym-Prompt Pairs | Aug 24, 2023 | DescriptiveImage Quality Assessment | —Unverified | 0 |
| Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation | Aug 24, 2023 | cross-modal alignmentDescriptive | CodeCode Available | 1 |
| ViCo: Engaging Video Comment Generation with Human Preference Rewards | Aug 22, 2023 | Caption GenerationComment Generation | —Unverified | 0 |
| CiteTracker: Correlating Image and Text for Visual Tracking | Aug 22, 2023 | AttributeDescriptive | CodeCode Available | 1 |
| Data-Driven Reachability Analysis of Pedestrians Using Behavior Modes | Aug 21, 2023 | Descriptive | —Unverified | 0 |
| Epicure: Distilling Sequence Model Predictions into Patterns | Aug 16, 2023 | Descriptivemodel | —Unverified | 0 |
| TeCH: Text-guided Reconstruction of Lifelike Clothed Humans | Aug 16, 2023 | DescriptiveQuestion Answering | CodeCode Available | 2 |
| A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision | Aug 15, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| Can Knowledge Graphs Simplify Text? | Aug 14, 2023 | DescriptiveKG-to-Text Generation | CodeCode Available | 1 |