| AudioGen: Textually Guided Audio Generation | Sep 30, 2022 | Audio GenerationDescriptive | CodeCode Available | 6 |
| Learning Transferable Spatiotemporal Representations from Natural Script Knowledge | Sep 30, 2022 | DescriptiveRepresentation Learning | CodeCode Available | 1 |
| VREN: Volleyball Rally Dataset with Expression Notation Language | Sep 28, 2022 | Decision MakingDescriptive | CodeCode Available | 0 |
| Medical Image Captioning via Generative Pretrained Transformers | Sep 28, 2022 | Caption GenerationDescriptive | —Unverified | 0 |
| RepsNet: Combining Vision with Language for Automated Medical Reports | Sep 27, 2022 | Contrastive LearningDecoder | —Unverified | 0 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research | Sep 23, 2022 | DescriptiveGenre classification | —Unverified | 0 |
| XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages | Sep 22, 2022 | Data-to-Text GenerationDescriptive | —Unverified | 0 |
| Rule-adhering synthetic data -- the lingua franca of learning | Sep 12, 2022 | Descriptive | —Unverified | 0 |
| Multi-Document Scientific Summarization from a Knowledge Graph-Centric View | Sep 9, 2022 | DecoderDescriptive | CodeCode Available | 1 |