Morphological evaluation of subwords vocabulary used by BETO language model Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Neutral residues: revisiting adapters for model extension Oct 3, 2024 Domain Adaptation Language Modelling
— Unverified 0Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration Oct 3, 2024 Diversity Language Modeling
Code Code Available 4Learning the Latent Rules of a Game from Data: A Chess Story Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0FAN: Fourier Analysis Networks Oct 3, 2024 Language Modeling Language Modelling
Code Code Available 3NL-Eye: Abductive NLI for Images Oct 3, 2024 Language Modeling Language Modelling
— Unverified 0Leveraging Large Language Models to Enhance Personalized Recommendations in E-commerce Oct 2, 2024 Diversity Language Modeling
— Unverified 0Long-range gene expression prediction with token alignment of large language model Oct 2, 2024 In-Context Learning Language Modeling
— Unverified 0Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model Oct 2, 2024 Diagnostic Dialogue Generation
— Unverified 0EMMA: Efficient Visual Alignment in Multi-Modal LLMs Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition Oct 2, 2024 Common Sense Reasoning Inductive logic programming
— Unverified 0Generate then Refine: Data Augmentation for Zero-shot Intent Detection Oct 2, 2024 Data Augmentation Diversity
Code Code Available 0Racing Thoughts: Explaining Contextualization Errors in Large Language Models Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0FARM: Functional Group-Aware Representations for Small Molecules Oct 2, 2024 Contrastive Learning Drug Discovery
— Unverified 0LS-HAR: Language Supervised Human Action Recognition with Salient Fusion, Construction Sites as a Use-Case Oct 2, 2024 Action Recognition Language Modeling
— Unverified 0OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning Oct 2, 2024 3D Generation Language Modeling
— Unverified 0TypedThinker: Typed Thinking Improves Large Language Model Reasoning Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics Oct 2, 2024 Classification Language Modeling
Code Code Available 0Automatic deductive coding in discourse analysis: an application of large language models in learning analytics Oct 2, 2024 Feature Engineering Language Modeling
Code Code Available 0Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices Oct 2, 2024 GPU Language Modeling
Code Code Available 1Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 2ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving Oct 2, 2024 Benchmarking Document Summarization
— Unverified 0Unveiling Language Skills via Path-Level Circuit Discovery Oct 2, 2024 Disentanglement In-Context Learning
Code Code Available 0Elaborative Subtopic Query Reformulation for Broad and Indirect Queries in Travel Destination Recommendation Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 0Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling Oct 2, 2024 Language Modeling Language Modelling
Code Code Available 1Spoken Grammar Assessment Using LLM Oct 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Circuit Compositions: Exploring Modular Structures in Transformer-Based Language Models Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1 Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Agent-Driven Large Language Models for Mandarin Lyric Generation Oct 2, 2024 In-Context Learning Language Modeling
— Unverified 0Integrating Protein Sequence and Expression Level to Analysis Molecular Characterization of Breast Cancer Subtypes Oct 2, 2024 Clustering Feature Importance
— Unverified 0Investigating on RLHF methodology Oct 2, 2024 Language Modeling Language Modelling
— Unverified 0Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective Oct 1, 2024 Language Modeling Language Modelling
— Unverified 0End-to-End Speech Recognition with Pre-trained Masked Language Model Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ERASMO: Leveraging Large Language Models for Enhanced Clustering Segmentation Oct 1, 2024 Clustering Language Modeling
Code Code Available 0Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model Oct 1, 2024 All Language Modeling
— Unverified 0Khattat: Enhancing Readability and Concept Representation of Semantic Typography Oct 1, 2024 Language Modeling Language Modelling
— Unverified 0PclGPT: A Large Language Model for Patronizing and Condescending Language Detection Oct 1, 2024 Language Modeling Language Modelling
Code Code Available 0Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix Oct 1, 2024 Language Modeling Language Modelling
— Unverified 0Quantifying reliance on external information over parametric knowledge during Retrieval Augmented Generation (RAG) using mechanistic analysis Oct 1, 2024 Information Retrieval Language Modeling
— Unverified 0Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting Oct 1, 2024 Language Modeling Language Modelling
Code Code Available 0Removing Distributional Discrepancies in Captions Improves Image-Text Alignment Oct 1, 2024 Language Modeling Language Modelling
— Unverified 0ReXplain: Translating Radiology into Patient-Friendly Video Reports Oct 1, 2024 Anatomy Image Segmentation
— Unverified 0