| What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights | May 31, 2024 | DescriptiveSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 | 5 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 | 5 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 | 5 |
| Can Knowledge Graphs Simplify Text? | Aug 14, 2023 | DescriptiveKG-to-Text Generation | CodeCode Available | 1 | 5 |
| From Artificially Real to Real: Leveraging Pseudo Data from Large Language Models for Low-Resource Molecule Discovery | Sep 11, 2023 | DescriptiveDomain Adaptation | CodeCode Available | 1 | 5 |
| First Steps of an Approach to the ARC Challenge based on Descriptive Grid Models and the Minimum Description Length Principle | Dec 1, 2021 | ARCDescriptive | CodeCode Available | 1 | 5 |
| Finetune like you pretrain: Improved finetuning of zero-shot vision models | Dec 1, 2022 | DescriptiveFew-Shot Learning | CodeCode Available | 1 | 5 |
| From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering | May 30, 2022 | counterfactualDescriptive | CodeCode Available | 1 | 5 |
| A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning Processes | Sep 15, 2020 | Deep LearningDescriptive | CodeCode Available | 1 | 5 |
| A Variational Algorithm for Quantum Neural Networks | Jun 15, 2020 | DescriptiveGeneral Classification | CodeCode Available | 1 | 5 |
| Zero-Shot Compositional Policy Learning via Language Grounding | Apr 15, 2020 | DescriptiveDomain Adaptation | CodeCode Available | 1 | 5 |
| Automatic Generation of Topic Labels | May 29, 2020 | DescriptiveInformation Retrieval | CodeCode Available | 1 | 5 |
| Beyond Co-occurrence: Multi-modal Session-based Recommendation | Sep 29, 2023 | Contrastive LearningDescriptive | CodeCode Available | 1 | 5 |
| Bias Loss for Mobile Neural Networks | Jul 23, 2021 | DescriptiveDiversity | CodeCode Available | 1 | 5 |
| Natural scene reconstruction from fMRI signals using generative latent diffusion | Mar 9, 2023 | Brain Computer InterfaceBrain Decoding | CodeCode Available | 1 | 5 |
| A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D Faces | Jun 6, 2020 | DescriptiveFace Model | CodeCode Available | 1 | 5 |
| FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models | Nov 2, 2023 | DescriptiveInstruction Following | CodeCode Available | 1 | 5 |
| FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes | Oct 15, 2021 | DescriptiveImage Classification | CodeCode Available | 1 | 5 |
| FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant | Aug 19, 2024 | DescriptiveFace Swapping | CodeCode Available | 1 | 5 |
| Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts | Jul 21, 2023 | DescriptivePrompt Engineering | CodeCode Available | 1 | 5 |
| Comprehensive Information Integration Modeling Framework for Video Titling | Jun 24, 2020 | DescriptiveVideo Captioning | CodeCode Available | 1 | 5 |
| CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving | Jul 26, 2022 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 1 | 5 |
| Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search | Feb 2, 2021 | DescriptiveImage Generation | CodeCode Available | 1 | 5 |
| Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding | May 10, 2025 | DescriptiveEmotion Recognition | CodeCode Available | 1 | 5 |