| A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model | Oct 2, 2024 | DiagnosticDialogue Generation | —Unverified | 0 |
| Generate then Refine: Data Augmentation for Zero-shot Intent Detection | Oct 2, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| Racing Thoughts: Explaining Contextualization Errors in Large Language Models | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LS-HAR: Language Supervised Human Action Recognition with Salient Fusion, Construction Sites as a Use-Case | Oct 2, 2024 | Action RecognitionLanguage Modeling | —Unverified | 0 |
| LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition | Oct 2, 2024 | Common Sense ReasoningInductive logic programming | —Unverified | 0 |
| FARM: Functional Group-Aware Representations for Small Molecules | Oct 2, 2024 | Contrastive LearningDrug Discovery | —Unverified | 0 |
| OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning | Oct 2, 2024 | 3D GenerationLanguage Modeling | —Unverified | 0 |
| EMMA: Efficient Visual Alignment in Multi-Modal LLMs | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TypedThinker: Typed Thinking Improves Large Language Model Reasoning | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics | Oct 2, 2024 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automatic deductive coding in discourse analysis: an application of large language models in learning analytics | Oct 2, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Agent-Driven Large Language Models for Mandarin Lyric Generation | Oct 2, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Spoken Grammar Assessment Using LLM | Oct 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Elaborative Subtopic Query Reformulation for Broad and Indirect Queries in Travel Destination Recommendation | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving | Oct 2, 2024 | BenchmarkingDocument Summarization | —Unverified | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Investigating on RLHF methodology | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| End-to-End Speech Recognition with Pre-trained Masked Language Model | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation | Oct 1, 2024 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| Khattat: Enhancing Readability and Concept Representation of Semantic Typography | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model | Oct 1, 2024 | AllLanguage Modeling | —Unverified | 0 |
| Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PclGPT: A Large Language Model for Patronizing and Condescending Language Detection | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Quantifying reliance on external information over parametric knowledge during Retrieval Augmented Generation (RAG) using mechanistic analysis | Oct 1, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Removing Distributional Discrepancies in Captions Improves Image-Text Alignment | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding | Oct 1, 2024 | Contrastive LearningHallucination | CodeCode Available | 0 |
| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation | Oct 1, 2024 | DescriptiveInductive Bias | —Unverified | 0 |
| Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReXplain: Translating Radiology into Patient-Friendly Video Reports | Oct 1, 2024 | AnatomyImage Segmentation | —Unverified | 0 |
| ViDAS: Vision-based Danger Assessment and Scoring | Oct 1, 2024 | Fixed Few Shot PromptingFixed Few Shot Prompting Danger Assessment | —Unverified | 0 |
| Preserving Generalization of Language models in Few-shot Continual Relation Extraction | Oct 1, 2024 | Continual Relation ExtractionLanguage Modeling | CodeCode Available | 0 |
| RisingBALLER: A player is a token, a match is a sentence, A path towards a foundational model for football players data analytics | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LASMP: Language Aided Subset Sampling Based Motion Planner | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Exploring Empty Spaces: Human-in-the-Loop Data Augmentation | Oct 1, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training | Oct 1, 2024 | DecoderLanguage Modeling | —Unverified | 0 |