M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models Mar 31, 2024 Image-text Retrieval Language Modeling
Code Code Available 3PCToolkit: A Unified Plug-and-Play Prompt Compression Toolkit of Large Language Models Mar 26, 2024 Code Completion Few-Shot Learning
Code Code Available 3The Unreasonable Ineffectiveness of the Deeper Layers Mar 26, 2024 GPU Quantization
Code Code Available 3Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Mar 21, 2024 Question Answering RAG
Code Code Available 3AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework Mar 19, 2024 Benchmarking Financial Analysis
Code Code Available 3Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Mar 8, 2024 1 Image, 2*2 Stitching Code Generation
Code Code Available 3Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review Mar 4, 2024 Medical Report Generation Question Answering
Code Code Available 3Towards Building Multilingual Language Model for Medicine Feb 21, 2024 Domain Adaptation Language Modeling
Code Code Available 3ALLaVA: Harnessing GPT4V-Synthesized Data for Lite Vision-Language Models Feb 18, 2024 Language Modelling Question Answering
Code Code Available 3PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers Feb 13, 2024 Question Answering Retrieval
Code Code Available 3Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to Pairs Feb 11, 2024 Image Quality Assessment Question Answering
Code Code Available 3A Survey of Large Language Models in Finance (FinLLMs) Feb 4, 2024 Named Entity Recognition (NER) Question Answering
Code Code Available 3CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models Jan 30, 2024 Knowledge Base Construction Question Answering
Code Code Available 3AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning Jan 10, 2024 Question Answering
Code Code Available 3LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning Jan 1, 2024 3D dense captioning Dense Captioning
Code Code Available 3TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Dec 28, 2023 Computational Efficiency Image Captioning
Code Code Available 3DriveLM: Driving with Graph Visual Question Answering Dec 21, 2023 Autonomous Driving Question Answering
Code Code Available 3Generative Multimodal Models are In-Context Learners Dec 20, 2023 In-Context Learning Personalized Image Generation
Code Code Available 3FinanceBench: A New Benchmark for Financial Question Answering Nov 20, 2023 How to refund a wrong transaction in PhonePe Question Answering
Code Code Available 3Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models Nov 11, 2023 Image Captioning MMR total
Code Code Available 3SALMONN: Towards Generic Hearing Abilities for Large Language Models Oct 20, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 3Evaluating Hallucinations in Chinese Large Language Models Oct 5, 2023 Hallucination Question Answering
Code Code Available 3Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering Sep 3, 2023 Data Augmentation Domain Adaptation
Code Code Available 3Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMs Aug 23, 2023 counterfactual Question Answering
Code Code Available 33D-LLM: Injecting the 3D World into Large Language Models Jul 24, 2023 3D Object Captioning 3D Question Answering (3D-QA)
Code Code Available 3Emu: Generative Pretraining in Multimodality Jul 11, 2023 Image Captioning Image Generation
Code Code Available 3SVIT: Scaling up Visual Instruction Tuning Jul 9, 2023 Diversity Image Captioning
Code Code Available 3WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences Jun 13, 2023 Language Modeling Language Modelling
Code Code Available 3Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models Jun 8, 2023 Question Answering VCGBench-Diverse
Code Code Available 3Self-QA: Unsupervised Knowledge Guided Language Model Alignment May 19, 2023 Diversity Language Modeling
Code Code Available 3ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities May 18, 2023 1 Image, 2*2 Stitchi Action Classification
Code Code Available 3Visual Causal Scene Refinement for Video Question Answering May 7, 2023 Contrastive Learning Question Answering
Code Code Available 3REPLUG: Retrieval-Augmented Black-Box Language Models Jan 30, 2023 Language Modeling Language Modelling
Code Code Available 3ThoughtSource: A central hub for large language model reasoning data Jan 27, 2023 Language Modeling Language Modelling
Code Code Available 3Champion Solution for the WSDM2023 Toloka VQA Challenge Jan 22, 2023 Question Answering Visual Grounding
Code Code Available 3TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 3Prompting Is Programming: A Query Language for Large Language Models Dec 12, 2022 Code Generation Language Modeling
Code Code Available 3A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal Dec 12, 2022 General Knowledge Graph Embedding
Code Code Available 3Scaling Instruction-Finetuned Language Models Oct 20, 2022 Coreference Resolution Cross-Lingual Question Answering
Code Code Available 3Vision-Language Pre-training: Basics, Recent Advances, and Future Trends Oct 17, 2022 Few-Shot Learning Image Captioning
Code Code Available 3Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought Oct 3, 2022 Mathematical Reasoning Question Answering
Code Code Available 3Time-series Transformer Generative Adversarial Networks May 23, 2022 Question Answering Time Series
Code Code Available 3All You May Need for VQA are Image Captions May 4, 2022 All Image Captioning
Code Code Available 3ST-MoE: Designing Stable and Transferable Sparse Expert Models Feb 17, 2022 ARC Common Sense Reasoning
Code Code Available 3Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3Language Models are Few-Shot Learners May 28, 2020 answerability prediction Articles
Code Code Available 3Longformer: The Long-Document Transformer Apr 10, 2020 Decoder Language Modeling
Code Code Available 3ERNIE 2.0: A Continual Pre-training Framework for Language Understanding Jul 29, 2019 Chinese Named Entity Recognition Chinese Reading Comprehension
Code Code Available 3Generating Long Sequences with Sparse Transformers Apr 23, 2019 Diversity Image Generation
Code Code Available 3ERNIE: Enhanced Representation through Knowledge Integration Apr 19, 2019 Chinese Named Entity Recognition Chinese Sentence Pair Classification
Code Code Available 3