| ViewNet: A Novel Projection-Based Backbone With View Pooling for Few-Shot Point Cloud Classification | Jan 1, 2023 | DescriptiveFew-Shot Learning | CodeCode Available | 1 |
| Finetune like you pretrain: Improved finetuning of zero-shot vision models | Dec 1, 2022 | DescriptiveFew-Shot Learning | CodeCode Available | 1 |
| Language-Assisted 3D Feature Learning for Semantic Scene Understanding | Nov 25, 2022 | DescriptiveInstance Segmentation | CodeCode Available | 1 |
| Visual Classification via Description from Large Language Models | Oct 13, 2022 | ClassificationDescriptive | CodeCode Available | 1 |
| OpenCQA: Open-ended Question Answering with Charts | Oct 12, 2022 | Arithmetic ReasoningDescriptive | CodeCode Available | 1 |
| EgoTaskQA: Understanding Human Tasks in Egocentric Videos | Oct 8, 2022 | Action Localizationcounterfactual | CodeCode Available | 1 |
| Learning Transferable Spatiotemporal Representations from Natural Script Knowledge | Sep 30, 2022 | DescriptiveRepresentation Learning | CodeCode Available | 1 |
| Multi-Document Scientific Summarization from a Knowledge Graph-Centric View | Sep 9, 2022 | DecoderDescriptive | CodeCode Available | 1 |
| Contrastive Audio-Language Learning for Music | Aug 25, 2022 | Audio to Text RetrievalDescriptive | CodeCode Available | 1 |
| VAuLT: Augmenting the Vision-and-Language Transformer for Sentiment Classification on Social Media | Aug 18, 2022 | DescriptiveDiversity | CodeCode Available | 1 |