| PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis | Dec 6, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Creative Agents: Empowering Agents with Imagination for Creative Tasks | Dec 5, 2023 | Instruction FollowingLanguage Modelling | CodeCode Available | 1 |
| ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StoryGPT-V: Large Language Models as Consistent Story Visualizers | Dec 4, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning | Dec 2, 2023 | Causal Language ModelingContrastive Learning | CodeCode Available | 1 |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Nov 30, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance | Nov 30, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis | Nov 29, 2023 | Hate Speech DetectionLanguage Modeling | CodeCode Available | 1 |
| Contrastive Vision-Language Alignment Makes Efficient Instruction Learner | Nov 29, 2023 | Contrastive LearningImage Captioning | CodeCode Available | 1 |
| M^2Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation | Nov 29, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up? | Nov 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Compositional Chain-of-Thought Prompting for Large Multimodal Models | Nov 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer | Nov 27, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training | Nov 27, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models | Nov 27, 2023 | Cross-Modal RetrievalImage Generation | CodeCode Available | 1 |
| InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint | Nov 27, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Paragraph-to-Image Generation with Information-Enriched Diffusion Model | Nov 24, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs | Nov 22, 2023 | document understandingInstruction Following | CodeCode Available | 1 |
| Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object | Nov 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Extracting Definienda in Mathematical Scholarly Articles with Transformers | Nov 21, 2023 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge | Nov 21, 2023 | Large Language ModelMultimodal Deep Learning | CodeCode Available | 1 |
| Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching | Nov 21, 2023 | Drone navigationgeo-localization | CodeCode Available | 1 |
| Oasis: Data Curation and Assessment System for Pretraining of Large Language Models | Nov 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Causal Structure Learning Supervised by Large Language Model | Nov 20, 2023 | Causal DiscoveryCausal Inference | CodeCode Available | 1 |
| Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks | Nov 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge | Nov 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation | Nov 19, 2023 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Language Generation from Brain Recordings | Nov 16, 2023 | DecoderLanguage Modelling | CodeCode Available | 1 |
| VideoCon: Robust Video-Language Alignment via Contrast Captions | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Open-Ended Visual Recognition with Large Language Model | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLatrieval: LLM-Verified Retrieval for Verifiable Generation | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Zero-shot audio captioning with audio-language model guidance and audio context keywords | Nov 14, 2023 | Audio captioningDescriptive | CodeCode Available | 1 |
| MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledge | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models | Nov 13, 2023 | Document-level Relation ExtractionIn-Context Learning | CodeCode Available | 1 |
| Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model | Nov 10, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification | Nov 10, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation | Nov 10, 2023 | BenchmarkingCloud Computing | CodeCode Available | 1 |
| ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences | Nov 10, 2023 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model | Nov 9, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Chain of Images for Intuitively Reasoning | Nov 9, 2023 | Common Sense ReasoningLanguage Modelling | CodeCode Available | 1 |
| ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI Agents | Nov 6, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| DeepInception: Hypnotize Large Language Model to Be Jailbreaker | Nov 6, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EmojiLM: Modeling the New Emoji Language | Nov 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |