| Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs | Dec 2, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| Improved Large Language Model Jailbreak Detection via Pretrained Embeddings | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unlocking Video-LLM via Agent-of-Thoughts Distillation | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PLD+: Accelerating LLM inference by leveraging Language Model Artifacts | Dec 2, 2024 | AttributeAvg | —Unverified | 0 |
| Advancing Speech Language Models by Scaling Supervised Fine-Tuning with Over 60,000 Hours of Synthetic Speech Dialogue Data | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| FD-LLM: Large Language Model for Fault Diagnosis of Machines | Dec 2, 2024 | Fault DetectionFault Diagnosis | —Unverified | 0 |
| Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion | Dec 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WAFFLE: Multimodal Floorplan Understanding in the Wild | Dec 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Free and Customizable Code Documentation with LLMs: A Fine-Tuning Approach | Dec 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ARChef: An iOS-Based Augmented Reality Cooking Assistant Powered by Multimodal Gemini LLM | Dec 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaMA-Gene: A General-purpose Gene Task Large Language Model Based on Instruction Fine-tuning | Nov 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI | Nov 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model | Nov 29, 2024 | Autonomous VehiclesLanguage Modeling | —Unverified | 0 |
| MoTe: Learning Motion-Text Diffusion Model for Multiple Generation Tasks | Nov 29, 2024 | DecoderDenoising | —Unverified | 0 |
| Enhancing Sentiment Analysis in Bengali Texts: A Hybrid Approach Using Lexicon-Based Algorithm and Pretrained Language Model Bangla-BERT | Nov 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KV Shifting Attention Enhances Language Modeling | Nov 29, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| CovidLLM: A Robust Large Language Model with Missing Value Adaptation and Multi-Objective Learning Strategy for Predicting Disease Severity and Clinical Outcomes in COVID-19 Patients | Nov 28, 2024 | ImputationLanguage Modeling | CodeCode Available | 0 |
| Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads | Nov 28, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Extracting Information in a Low-resource Setting: Case Study on Bioinformatics Workflows | Nov 28, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models | Nov 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection | Nov 28, 2024 | Anomaly DetectionImage-text matching | —Unverified | 0 |
| Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Nov 28, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| Rephrasing Electronic Health Records for Pretraining Clinical Language Models | Nov 28, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease | Nov 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An AI-driven multimodal smart home platform for continuous monitoring and intelligent assistance in post-stroke patients | Nov 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Marconi: Prefix Caching for the Era of Hybrid LLMs | Nov 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EzSQL: An SQL intermediate representation for improving SQL-to-text Generation | Nov 28, 2024 | Graph-to-SequenceLanguage Modeling | —Unverified | 0 |
| Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising | Nov 28, 2024 | DenoisingLanguage Modeling | —Unverified | 0 |
| Human Evaluation of Procedural Knowledge Graph Extraction from Text with Large Language Models | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference | Nov 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models | Nov 27, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Verbalized Representation Learning for Interpretable Few-Shot Generalization | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving | Nov 27, 2024 | FairnessGPU | CodeCode Available | 7 |
| SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment | Nov 27, 2024 | ClassificationDecision Making | —Unverified | 0 |
| CoVis: A Collaborative Framework for Fine-grained Graphic Visual Understanding | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services | Nov 27, 2024 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| NewsEdits 2.0: Learning the Intentions Behind Updating News | Nov 27, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| Can bidirectional encoder become the ultimate winner for downstream applications of foundation models? | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Wearable intelligent throat enables natural speech in stroke patients with dysarthria | Nov 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| R-MTLLMF: Resilient Multi-Task Large Language Model Fusion at the Wireless Edge | Nov 27, 2024 | DisentanglementLanguage Modeling | —Unverified | 0 |
| Automated Literature Review Using NLP Techniques and LLM-Based Retrieval-Augmented Generation | Nov 27, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| Diffusion Self-Distillation for Zero-Shot Customized Image Generation | Nov 27, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Large Language Model-Brained GUI Agents: A Survey | Nov 27, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 3 |
| The Context of Crash Occurrence: A Complexity-Infused Approach Integrating Semantic, Contextual, and Kinematic Features | Nov 26, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |