LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 7GPT-4 Technical Report Mar 15, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 6MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments Feb 1, 2024 Embodied Question Answering Language Modeling
Code Code Available 5ImageBind: One Embedding Space To Bind Them All May 9, 2023 All Cross-Modal Retrieval
Code Code Available 5Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese Nov 2, 2022 Contrastive Learning image-classification
Code Code Available 5Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Jan 5, 2024 image-classification Image Classification
Code Code Available 5Zephyr: Direct Distillation of LM Alignment Oct 25, 2023 2D Cyclist Detection Few-Shot Learning
Code Code Available 5MEDITRON-70B: Scaling Medical Pretraining for Large Language Models Nov 27, 2023 Articles Conditional Text Generation
Code Code Available 4The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot Jun 29, 2023 Image Segmentation Semantic Segmentation
Code Code Available 4Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 4Flamingo: a Visual Language Model for Few-Shot Learning Apr 29, 2022 Few-Shot Learning Generative Visual Question Answering
Code Code Available 4A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges Jan 4, 2025 Fairness Hallucination
Code Code Available 4FG-CLIP: Fine-Grained Visual and Textual Alignment May 8, 2025 Image-text Retrieval object-detection
Code Code Available 4Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 4Multi-label Cluster Discrimination for Visual Representation Learning Jul 24, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 4Zero-shot forecasting of chaotic systems Sep 24, 2024 Attribute In-Context Learning
Code Code Available 4Multimodal Whole Slide Foundation Model for Pathology Nov 29, 2024 Cross-Modal Retrieval model
Code Code Available 4Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning May 23, 2025 Decoder Image Captioning
Code Code Available 4Time-LLM: Time Series Forecasting by Reprogramming Large Language Models Oct 3, 2023 Time Series Time Series Forecasting
Code Code Available 4Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3LLM-Pruner: On the Structural Pruning of Large Language Models May 19, 2023 Text Generation zero-shot-classification
Code Code Available 3Description Boosting for Zero-Shot Entity and Relation Classification Jun 4, 2024 Relation Relation Classification
Code Code Available 3AnyGraph: Graph Foundation Model in the Wild Aug 20, 2024 Graph Learning Mixture-of-Experts
Code Code Available 3Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters Mar 18, 2024 Continual Learning Incremental Learning
Code Code Available 3Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 3MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories Jun 5, 2025 Benchmarking Optical Character Recognition
Code Code Available 2LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 2MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale Aug 29, 2024 Deep Reinforcement Learning Imitation Learning
Code Code Available 2BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Jun 30, 2022 Diversity Language Model Evaluation
Code Code Available 2Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Jun 25, 2024 cross-modal alignment Image Classification
Code Code Available 2Learning Transferable Visual Models From Natural Language Supervision Feb 26, 2021 Action Recognition Benchmarking
Code Code Available 2Is ChatGPT a General-Purpose Natural Language Processing Task Solver? Feb 8, 2023 Arithmetic Reasoning Zero-Shot Learning
Code Code Available 2BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Jan 13, 2025 Articles Image-text Retrieval
Code Code Available 2BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning Mar 3, 2022 Compositional Zero-Shot Learning Contrastive Learning
Code Code Available 2Improving CLIP Fine-tuning Performance Jan 1, 2023 Diagnostic object-detection
Code Code Available 2Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding Jan 24, 2025 Anatomy Contrastive Learning
Code Code Available 2GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 2Audio-FLAN: A Preliminary Release Feb 23, 2025 Zero-Shot Learning
Code Code Available 2ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation Jul 19, 2024 Decoder Image Segmentation
Code Code Available 2Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification Sep 1, 2024 Scene Classification Transductive Zero-Shot Classification
Code Code Available 2Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning May 31, 2023 Decision Making General Knowledge
Code Code Available 2DreamLLM: Synergistic Multimodal Comprehension and Creation Sep 20, 2023 multimodal generation Visual Question Answering
Code Code Available 2Active Prompting with Chain-of-Thought for Large Language Models Feb 23, 2023 Active Learning Zero-Shot Learning
Code Code Available 2EasyRec: Simple yet Effective Language Models for Recommendation Aug 16, 2024 Collaborative Filtering Contrastive Learning
Code Code Available 2FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models Jul 1, 2024 Benchmarking Fairness
Code Code Available 2VeCLIP: Improving CLIP Training via Visual-enriched Captions Oct 11, 2023 Image-text Retrieval Retrieval
Code Code Available 2GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 2Cross-lingual Contextualized Topic Models with Zero-shot Learning Apr 16, 2020 Topic Models Transfer Learning
Code Code Available 2CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Nov 15, 2024 Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation
Code Code Available 2Crosslingual Generalization through Multitask Finetuning Nov 3, 2022 Coreference Resolution Cross-Lingual Transfer
Code Code Available 2