| Self-Emotion Blended Dialogue Generation in Social Simulation Agents | Aug 3, 2024 | Decision MakingDialogue Generation | —Unverified | 0 |
| Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer | Aug 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Six Dragons Fly Again: Reviving 15th-Century Korean Court Music with Transformers and Novel Encoding | Aug 2, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines | Aug 2, 2024 | GPUHyperparameter Optimization | —Unverified | 0 |
| Large Language Model (LLM)-enabled In-context Learning for Wireless Network Optimization: A Case Study of Power Control | Aug 1, 2024 | Deep Reinforcement LearningIn-Context Learning | —Unverified | 0 |
| Leveraging Large Language Models (LLMs) for Traffic Management at Urban Intersections: The Case of Mixed Traffic Scenarios | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Aspect Reviewed-Item Retrieval via LLM Query Decomposition and Aspect Fusion | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model | Aug 1, 2024 | EgoSchemaLanguage Modeling | —Unverified | 0 |
| Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning | Aug 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ExpertAF: Expert Actionable Feedback from Video | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Are Bigger Encoders Always Better in Vision Large Models? | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DeliLaw: A Chinese Legal Counselling System Based on a Large Language Model | Aug 1, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| ABC Align: Large Language Model Alignment for Safety & Accuracy | Aug 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Intelligent Transformation: General Intelligence Theory | Jul 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Jul 31, 2024 | Contrastive LearningDecoder | —Unverified | 0 |
| Finch: Prompt-guided Key-Value Cache Compression | Jul 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Jul 31, 2024 | Dictionary LearningLanguage Modeling | CodeCode Available | 1 |
| Learning Video Context as Interleaved Multimodal Sequences | Jul 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving | Jul 31, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Interpreting and learning voice commands with a Large Language Model for a robot system | Jul 31, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| The Llama 3 Herd of Models | Jul 31, 2024 | answerability predictionLanguage Modeling | CodeCode Available | 4 |
| KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making | Jul 31, 2024 | BenchmarkingDecision Making | —Unverified | 0 |
| Learning Effective Representations for Retrieval Using Self-Distillation with Adaptive Relevance Margins | Jul 31, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Decomposed Prompting to Answer Questions on a Course Discussion Board | Jul 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Entropy, Thermodynamics and the Geometrization of the Language Model | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A federated large language model for long-term time series forecasting | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Label-Guided Prompt for Multi-label Few-shot Aspect Category Detection | Jul 30, 2024 | Aspect Category DetectionLanguage Modeling | —Unverified | 0 |
| Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning | Jul 30, 2024 | Efficient ExplorationLanguage Modeling | —Unverified | 0 |
| Faithful and Plausible Natural Language Explanations for Image Classification: A Pipeline Approach | Jul 30, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Harvesting Textual and Structured Data from the HAL Publication Repository | Jul 30, 2024 | ArticlesAuthorship Attribution | —Unverified | 0 |
| Industrial-Grade Smart Troubleshooting through Causal Technical Language Processing: a Proof of Concept | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Meltemi: The first open Large Language Model for Greek | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning | Jul 30, 2024 | Contrastive LearningDiagnostic | CodeCode Available | 1 |
| Accelerating Large Language Model Inference with Self-Supervised Early Exits | Jul 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks | Jul 29, 2024 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |
| Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval | Jul 29, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| Apple Intelligence Foundation Language Models | Jul 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Retrieval Augmented Language Model with Self-Reasoning | Jul 29, 2024 | Fact VerificationLanguage Modeling | —Unverified | 0 |
| OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale | Jul 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing | Jul 29, 2024 | DenoisingDiversity | CodeCode Available | 0 |
| Normality Addition via Normality Detection in Industrial Image Anomaly Detection Models | Jul 29, 2024 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Prometheus Chatbot: Knowledge Graph Collaborative Large Language Model for Computer Components Recommendation | Jul 29, 2024 | ChatbotKnowledge Graphs | CodeCode Available | 0 |
| VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks | Jul 29, 2024 | Deep LearningDomain Generalization | —Unverified | 0 |
| ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 | Jul 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Bayesian Flow Network Framework for Chemistry Tasks | Jul 28, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training | Jul 28, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |