| ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions | Jan 5, 2023 | ArticlesBenchmarking | CodeCode Available | 0 |
| Towards Table-to-Text Generation with Pretrained Language Model: A Table Structure Understanding and Text Deliberating Approach | Jan 5, 2023 | DecoderDescriptive | CodeCode Available | 0 |
| ReGen: A good Generative Zero-Shot Video Classifier Should be Rewarded | Jan 1, 2023 | Action ClassificationAction Recognition | —Unverified | 0 |
| LeaF: Learning Frames for 4D Point Cloud Sequence Understanding | Jan 1, 2023 | Descriptive | —Unverified | 0 |
| Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding | Jan 1, 2023 | DescriptiveObject | CodeCode Available | 1 |
| Towards Modality-Agnostic Person Re-Identification With Descriptive Query | Jan 1, 2023 | DescriptivePerson Re-Identification | CodeCode Available | 1 |
| L-CoIns: Language-Based Colorization With Instance Awareness | Jan 1, 2023 | ColorizationDescriptive | —Unverified | 0 |
| ViewNet: A Novel Projection-Based Backbone With View Pooling for Few-Shot Point Cloud Classification | Jan 1, 2023 | DescriptiveFew-Shot Learning | CodeCode Available | 1 |
| MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding | Dec 26, 2022 | DecoderDescriptive | —Unverified | 0 |
| Risk assessment and mitigation of e-scooter crashes with naturalistic driving data | Dec 24, 2022 | Descriptive | —Unverified | 0 |