LLaMA: Open and Efficient Foundation Language Models Feb 27, 2023 Arithmetic Reasoning Code Generation
Code Code Available 75 GPT-4 Technical Report Mar 15, 2023 answerability prediction Arithmetic Reasoning
Code Code Available 65 Zephyr: Direct Distillation of LM Alignment Oct 25, 2023 2D Cyclist Detection Few-Shot Learning
Code Code Available 55 ImageBind: One Embedding Space To Bind Them All May 9, 2023 All Cross-Modal Retrieval
Code Code Available 55 Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively Jan 5, 2024 image-classification Image Classification
Code Code Available 55 MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments Feb 1, 2024 Embodied Question Answering Language Modeling
Code Code Available 55 Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese Nov 2, 2022 Contrastive Learning image-classification
Code Code Available 55 MEDITRON-70B: Scaling Medical Pretraining for Large Language Models Nov 27, 2023 Articles Conditional Text Generation
Code Code Available 45 Multimodal Whole Slide Foundation Model for Pathology Nov 29, 2024 Cross-Modal Retrieval model
Code Code Available 45 The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot Jun 29, 2023 Image Segmentation Semantic Segmentation
Code Code Available 45 Flamingo: a Visual Language Model for Few-Shot Learning Apr 29, 2022 Few-Shot Learning Generative Visual Question Answering
Code Code Available 45 A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges Jan 4, 2025 Fairness Hallucination
Code Code Available 45 FG-CLIP: Fine-Grained Visual and Textual Alignment May 8, 2025 Image-text Retrieval object-detection
Code Code Available 45 Long-CLIP: Unlocking the Long-Text Capability of CLIP Mar 22, 2024 Image Generation Image Retrieval
Code Code Available 45 Multi-label Cluster Discrimination for Visual Representation Learning Jul 24, 2024 Contrastive Learning Image-text Retrieval
Code Code Available 45 Zero-shot forecasting of chaotic systems Sep 24, 2024 Attribute In-Context Learning
Code Code Available 45 Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective Oct 16, 2022 Coreference Resolution Multiple-choice
Code Code Available 45 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models Oct 3, 2023 Time Series Time Series Forecasting
Code Code Available 45 Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning May 23, 2025 Decoder Image Captioning
Code Code Available 45 Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 35 LLM-Pruner: On the Structural Pruning of Large Language Models May 19, 2023 Text Generation zero-shot-classification
Code Code Available 35 Description Boosting for Zero-Shot Entity and Relation Classification Jun 4, 2024 Relation Relation Classification
Code Code Available 35 AnyGraph: Graph Foundation Model in the Wild Aug 20, 2024 Graph Learning Mixture-of-Experts
Code Code Available 35 Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters Mar 18, 2024 Continual Learning Incremental Learning
Code Code Available 35 Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 35 MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories Jun 5, 2025 Benchmarking Optical Character Recognition
Code Code Available 25 LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings Aug 25, 2024 Language Modelling Link Prediction
Code Code Available 25 MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale Aug 29, 2024 Deep Reinforcement Learning Imitation Learning
Code Code Available 25 BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Jun 30, 2022 Diversity Language Model Evaluation
Code Code Available 25 Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Jun 25, 2024 cross-modal alignment Image Classification
Code Code Available 25 Learning Transferable Visual Models From Natural Language Supervision Feb 26, 2021 Action Recognition Benchmarking
Code Code Available 25 Is ChatGPT a General-Purpose Natural Language Processing Task Solver? Feb 8, 2023 Arithmetic Reasoning Zero-Shot Learning
Code Code Available 25 BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Jan 13, 2025 Articles Image-text Retrieval
Code Code Available 25 BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning Mar 3, 2022 Compositional Zero-Shot Learning Contrastive Learning
Code Code Available 25 Improving CLIP Fine-tuning Performance Jan 1, 2023 Diagnostic object-detection
Code Code Available 25 Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding Jan 24, 2025 Anatomy Contrastive Learning
Code Code Available 25 GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models May 30, 2025 Classification Disaster Response
Code Code Available 25 Audio-FLAN: A Preliminary Release Feb 23, 2025 Zero-Shot Learning
Code Code Available 25 ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation Jul 19, 2024 Decoder Image Segmentation
Code Code Available 25 Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification Sep 1, 2024 Scene Classification Transductive Zero-Shot Classification
Code Code Available 25 Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning May 31, 2023 Decision Making General Knowledge
Code Code Available 25 DreamLLM: Synergistic Multimodal Comprehension and Creation Sep 20, 2023 multimodal generation Visual Question Answering
Code Code Available 25 Active Prompting with Chain-of-Thought for Large Language Models Feb 23, 2023 Active Learning Zero-Shot Learning
Code Code Available 25 EasyRec: Simple yet Effective Language Models for Recommendation Aug 16, 2024 Collaborative Filtering Contrastive Learning
Code Code Available 25 FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models Jul 1, 2024 Benchmarking Fairness
Code Code Available 25 VeCLIP: Improving CLIP Training via Visual-enriched Captions Oct 11, 2023 Image-text Retrieval Retrieval
Code Code Available 25 GraphGPT: Graph Instruction Tuning for Large Language Models Oct 19, 2023 Data Augmentation Graph Learning
Code Code Available 25 Cross-lingual Contextualized Topic Models with Zero-shot Learning Apr 16, 2020 Topic Models Transfer Learning
Code Code Available 25 CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Nov 15, 2024 Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation
Code Code Available 25 Crosslingual Generalization through Multitask Finetuning Nov 3, 2022 Coreference Resolution Cross-Lingual Transfer
Code Code Available 25