| Induced Model Matching: Restricted Models Help Train Full-Featured Models | Jan 15, 2025 | Knowledge DistillationLanguage Modeling | CodeCode Available | 0 |
| WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher Learning | Jan 15, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model | Jan 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design | Jan 15, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 2 |
| Leveraging LLM Agents for Translating Network Configurations | Jan 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities | Jan 15, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment | Jan 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CityLoc: 6DoF Pose Distributional Localization for Text Descriptions in Large-Scale Scenes with Gaussian Representation | Jan 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Applying General Turn-taking Models to Conversational Human-Robot Interaction | Jan 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models For Text Classification: Case Study And Comprehensive Review | Jan 14, 2025 | ArticlesBinary Classification | —Unverified | 0 |
| Gandalf the Red: Adaptive Security for LLMs | Jan 14, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT | Jan 14, 2025 | ClusteringDimensionality Reduction | —Unverified | 0 |
| In-situ graph reasoning and knowledge expansion using Graph-PReFLexOR | Jan 14, 2025 | Knowledge GraphsLanguage Modeling | CodeCode Available | 3 |
| Large Language Model Interface for Home Energy Management Systems | Jan 14, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| Hierarchical Autoscaling for Large Language Model Serving with Chiron | Jan 14, 2025 | GPULanguage Modeling | —Unverified | 0 |
| A Driver Advisory System Based on Large Language Model for High-speed Train | Jan 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations | Jan 14, 2025 | Alzheimer's DetectionBinary Classification | —Unverified | 0 |
| Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks | Jan 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding | Jan 14, 2025 | Embodied Question AnsweringHallucination | CodeCode Available | 4 |
| LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding | Jan 14, 2025 | Feature CompressionLanguage Modeling | CodeCode Available | 2 |
| Real-time Verification and Refinement of Language Model Text Generation | Jan 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Jan 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch | Jan 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMic: Romanian Foundation Language Model | Jan 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-megabase scale genome interpretation with genetic language models | Jan 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model | Jan 13, 2025 | Audio captioningInstruction Following | —Unverified | 0 |
| Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing | Jan 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lifelong Learning of Large Language Model based Agents: A Roadmap | Jan 13, 2025 | Incremental LearningLanguage Modeling | CodeCode Available | 3 |
| Enhancing Image Generation Fidelity via Progressive Prompts | Jan 13, 2025 | DiversityImage Generation | CodeCode Available | 0 |
| TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time | Jan 13, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 0 |
| TempoGPT: Enhancing Temporal Reasoning via Quantizing Embedding | Jan 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| A Proposed Large Language Model-Based Smart Search for Archive System | Jan 13, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Integrating Pause Information with Word Embeddings in Language Models for Alzheimer's Disease Detection from Spontaneous Speech | Jan 12, 2025 | Alzheimer's Disease DetectionLanguage Modeling | —Unverified | 0 |
| Better Prompt Compression Without Multi-Layer Perceptrons | Jan 12, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Modeling Neural Networks with Privacy Using Neural Stochastic Differential Equations | Jan 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Jan 12, 2025 | Autonomous DrivingDecision Making | —Unverified | 0 |
| An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering | Jan 12, 2025 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing | Jan 12, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT | Jan 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators | Jan 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models | Jan 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Magnitude of Categories of Texts Enriched by Language Models | Jan 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification | Jan 11, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation | Jan 11, 2025 | Chart UnderstandingCode Generation | CodeCode Available | 2 |
| AlgoPilot: Fully Autonomous Program Synthesis Without Human-Written Programs | Jan 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tensor Product Attention Is All You Need | Jan 11, 2025 | AllLanguage Modeling | CodeCode Available | 0 |
| Environmental large language model Evaluation (ELLE) dataset: A Benchmark for Evaluating Generative AI applications in Eco-environment Domain | Jan 10, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 0 |
| MinMo: A Multimodal Large Language Model for Seamless Voice Interaction | Jan 10, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Scalable Vision Language Model Training via High Quality Data Curation | Jan 10, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |