| Fewer Truncations Improve Language Modeling | Apr 16, 2024 | Combinatorial OptimizationHallucination | —Unverified | 0 |
| Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering | Apr 16, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From a Lossless (~1.5:1) Compression Algorithm for Llama2 7B Weights to Variable Precision, Variable Range, Compressed Numeric Data Types for CNNs and LLMs | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Text Steganography with Large Language Model | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Apr 16, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 3 |
| HLAT: High-quality Large Language Model Pre-trained on AWS Trainium | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering | Apr 16, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs | Apr 16, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |
| Exact and Efficient Unlearning for Large Language Model-based Recommendation | Apr 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatShop: Interactive Information Seeking with Language Agents | Apr 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| in2IN: Leveraging individual Information to Generate Human INteractions | Apr 15, 2024 | DiversityLanguage Modelling | CodeCode Available | 2 |
| Memory Sharing for Large Language Model based Agents | Apr 15, 2024 | Common Sense ReasoningDiversity | CodeCode Available | 1 |
| Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection | Apr 15, 2024 | Anomaly DetectionAnomaly Localization | —Unverified | 0 |
| Evolving Interpretable Visual Classifiers with Large Language Models | Apr 15, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| A Self-feedback Knowledge Elicitation Approach for Chemical Reaction Predictions | Apr 15, 2024 | Chemical Reaction PredictionDrug Discovery | CodeCode Available | 0 |
| UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark | Apr 15, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model | Apr 15, 2024 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model | Apr 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | Apr 14, 2024 | Dense CaptioningLanguage Modelling | —Unverified | 0 |
| Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts | Apr 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Test Code Generation for Telecom Software Systems using Two-Stage Generative Model | Apr 14, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| JaFIn: Japanese Financial Instruction Dataset | Apr 14, 2024 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Self-Selected Attention Span for Accelerating Large Language Model Inference | Apr 14, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission | Apr 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMSat: A Large Language Model-Based Goal-Oriented Agent for Autonomous Space Exploration | Apr 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging Large Language Model as Simulated Patients for Clinical Education | Apr 13, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 |
| On Speculative Decoding for Multimodal Large Language Models | Apr 13, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Generative AI Agent for Next-Generation MIMO Design: Fundamentals, Challenges, and Vision | Apr 13, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| Adapting Mental Health Prediction Tasks for Cross-lingual Learning via Meta-Training and In-context Learning with Large Language Model | Apr 13, 2024 | Cross-Lingual TransferIn-Context Learning | —Unverified | 0 |
| CUDA-Accelerated Soft Robot Neural Evolution with Large Language Model Supervision | Apr 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring the Quality of Answers in Political Q&As with Large Language Models | Apr 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| Inverse Kinematics for Neuro-Robotic Grasping with Humanoid Embodied Agents | Apr 12, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian | Apr 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pretraining and Updates of Domain-Specific LLM: A Case Study in the Japanese Business Domain | Apr 12, 2024 | Continual PretrainingGeneral Knowledge | —Unverified | 0 |
| Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation | Apr 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Future of Scientific Publishing: Automated Article Generation | Apr 11, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Human Latency Conversational Turns for Spoken Avatar Systems | Apr 11, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Introducing L2M3, A Multilingual Medical Large Language Model to Advance Health Equity in Low-Resource Regions | Apr 11, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| CEM: A Data-Efficient Method for Large Language Models to Continue Evolving From Mistakes | Apr 11, 2024 | Continual LearningContinual Pretraining | —Unverified | 0 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Audio Dialogues: Dialogues dataset for audio and music understanding | Apr 11, 2024 | Audio captioningAudio Question Answering | —Unverified | 0 |
| Scalable Language Model with Generalized Continual Learning | Apr 11, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| A Multi-Expert Large Language Model Architecture for Verilog Code Generation | Apr 11, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Auctions with LLM Summaries | Apr 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |