| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis | Dec 6, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| Teaching Specific Scientific Knowledge into Large Language Models through Additional Training | Dec 6, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 0 |
| GPT vs Human for Scientific Reviews: A Dual Source Review on Applications of ChatGPT in Science | Dec 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Customization Assistant for Text-to-image Generation | Dec 5, 2023 | DescriptiveImage Generation | CodeCode Available | 2 |
| Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models | Dec 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Hardware Evaluation Framework for Large Language Model Inference | Dec 5, 2023 | GPULanguage Modeling | —Unverified | 0 |
| EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model | Dec 5, 2023 | Boundary DetectionLanguage Modeling | —Unverified | 0 |
| Weakly Supervised Detection of Hallucinations in LLM Activations | Dec 5, 2023 | HallucinationLanguage Modeling | CodeCode Available | 5 |
| mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs | Dec 5, 2023 | GPULarge Language Model | CodeCode Available | 2 |
| Creative Agents: Empowering Agents with Imagination for Creative Tasks | Dec 5, 2023 | Instruction FollowingLanguage Modelling | CodeCode Available | 1 |
| FG-MDM: Towards Zero-Shot Human Motion Generation via ChatGPT-Refined Descriptions | Dec 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Intelligent Virtual Assistants with LLM-based Process Automation | Dec 4, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MedXChat: A Unified Multimodal Large Language Model Framework towards CXRs Understanding and Generation | Dec 4, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Evaluating Dependencies in Fact Editing for Language Models: Specificity and Implication Awareness | Dec 4, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 0 |
| StoryGPT-V: Large Language Models as Consistent Story Visualizers | Dec 4, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques | Dec 4, 2023 | DiversityLanguage Modeling | —Unverified | 0 |
| InstructTA: Instruction-Tuned Targeted Attack for Large Vision-Language Models | Dec 4, 2023 | Adversarial AttackLanguage Modelling | CodeCode Available | 0 |
| Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 |
| Jellyfish: A Large Language Model for Data Preprocessing | Dec 4, 2023 | GPUImputation | —Unverified | 0 |
| Unleashing the Potential of Large Language Model: Zero-shot VQA for Flood Disaster Scenario | Dec 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly | Dec 4, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis | Dec 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Beginner to Expert: Modeling Medical Knowledge into General LLMs | Dec 2, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| Self Generated Wargame AI: Double Layer Agent Task Planning Based on Large Language Model | Dec 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning | Dec 2, 2023 | Causal Language ModelingContrastive Learning | CodeCode Available | 1 |
| Conceptual Engineering Using Large Language Models | Dec 1, 2023 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| Zero-Shot Video Question Answering with Procedural Programs | Dec 1, 2023 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Hyperparameter Optimization for Large Language Model Instruction-Tuning | Dec 1, 2023 | Hyperparameter OptimizationLanguage Modeling | —Unverified | 0 |
| LinguaLinked: A Distributed Large Language Model Inference System for Mobile Devices | Dec 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses | Dec 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games | Dec 1, 2023 | AI AgentIn-Context Learning | —Unverified | 0 |
| A Bayesian approach for prompt optimization in pre-trained language models | Dec 1, 2023 | Bayesian OptimizationCombinatorial Optimization | —Unverified | 0 |
| Evaluating Large Language Model Creativity from a Literary Perspective | Nov 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LucidDreaming: Controllable Object-Centric 3D Generation | Nov 30, 2023 | 3D GenerationBenchmarking | —Unverified | 0 |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Nov 30, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| ArthModel: Enhance Arithmetic Skills to Large Language Model | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model | Nov 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| COVID-19 Vaccine Misinformation in Middle Income Countries | Nov 30, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Detailed Human-Centric Text Description-Driven Large Scene Synthesis | Nov 30, 2023 | Image GenerationLanguage Modeling | —Unverified | 0 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance | Nov 30, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 |
| CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation | Nov 30, 2023 | Image GenerationIn-Context Learning | —Unverified | 0 |
| LEAP: LLM-Generation of Egocentric Action Programs | Nov 29, 2023 | Action RecognitionLanguage Modeling | —Unverified | 0 |
| TurkishBERTweet: Fast and Reliable Large Language Model for Social Media Analysis | Nov 29, 2023 | Hate Speech DetectionLanguage Modeling | CodeCode Available | 1 |
| M^2Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation | Nov 29, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| Contrastive Vision-Language Alignment Makes Efficient Instruction Learner | Nov 29, 2023 | Contrastive LearningImage Captioning | CodeCode Available | 1 |