Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation Jul 26, 2024 Diversity Language Modeling
Code Code Available 1Effective Large Language Model Debugging with Best-first Tree Search Jul 26, 2024 Code Generation Language Modeling
— Unverified 0A Role-specific Guided Large Language Model for Ophthalmic Consultation Based on Stylistic Differentiation Jul 26, 2024 Language Modeling Language Modelling
Code Code Available 0REAPER: Reasoning based Retrieval Planning for Complex RAG Systems Jul 26, 2024 Language Modelling Large Language Model
— Unverified 0Multi-turn Response Selection with Commonsense-enhanced Language Models Jul 26, 2024 Common Sense Reasoning Graph Neural Network
— Unverified 0Blockchain for Large Language Model Security and Safety: A Holistic Survey Jul 26, 2024 Data Poisoning Language Modeling
— Unverified 0MistralBSM: Leveraging Mistral-7B for Vehicular Networks Misbehavior Detection Jul 26, 2024 Cloud Detection Language Modeling
— Unverified 0Understanding the Interplay of Scale, Data, and Bias in Language Models: A Case Study with BERT Jul 25, 2024 Language Modeling Language Modelling
— Unverified 0Multi-group Uncertainty Quantification for Long-form Text Generation Jul 25, 2024 Conformal Prediction Form
— Unverified 0Large Language Model Integrated Healthcare Cyber-Physical Systems Architecture Jul 25, 2024 Decision Making Language Modeling
— Unverified 0Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic Jul 25, 2024 Image to text Language Modeling
— Unverified 0Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions Jul 25, 2024 Automatic Speech Recognition Decoder
— Unverified 0Cost-effective Instruction Learning for Pathology Vision and Language Analysis Jul 25, 2024 Few-Shot Learning Language Modelling
Code Code Available 1TwIPS: A Large Language Model Powered Texting Application to Simplify Conversational Nuances for Autistic Users Jul 25, 2024 Language Modeling Language Modelling
— Unverified 0Recursive Introspection: Teaching Language Model Agents How to Self-Improve Jul 25, 2024 Imitation Learning Language Modeling
— Unverified 0Examining the Influence of Political Bias on Large Language Model Performance in Stance Classification Jul 25, 2024 Classification Language Modeling
— Unverified 0Text-Driven Neural Collaborative Filtering Model for Paper Source Tracing Jul 25, 2024 Articles Collaborative Filtering
Code Code Available 0Unified Lexical Representation for Interpretable Visual-Language Alignment Jul 25, 2024 Cross-Modal Retrieval Language Modelling
Code Code Available 0Scaling Trends in Language Model Robustness Jul 25, 2024 Adversarial Robustness Language Modeling
Code Code Available 0Describe Where You Are: Improving Noise-Robustness for Speech Emotion Recognition with Text Description of the Environment Jul 25, 2024 Emotion Recognition Language Modeling
— Unverified 0Demystifying Verbatim Memorization in Large Language Models Jul 25, 2024 Language Modeling Language Modelling
Code Code Available 0Building a Domain-specific Guardrail Model in Production Jul 24, 2024 Benchmarking Language Modelling
— Unverified 0SimCT: A Simple Consistency Test Protocol in LLMs Development Lifecycle Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0Exploring Domain Robust Lightweight Reward Models based on Router Mechanism Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0Can Language Models Evaluate Human Written Text? Case Study on Korean Student Writing for Education Jul 24, 2024 Language Modeling Language Modelling
Code Code Available 0Label Alignment and Reassignment with Generalist Large Language Model for Enhanced Cross-Domain Named Entity Recognition Jul 24, 2024 Cross-Domain Named Entity Recognition Language Modeling
— Unverified 0Time Matters: Examine Temporal Effects on Biomedical Language Models Jul 24, 2024 Language Modeling Language Modelling
Code Code Available 0Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning Jul 24, 2024 Language Modeling Language Modelling
Code Code Available 0ViPer: Visual Personalization of Generative Models via Individual Preference Learning Jul 24, 2024 Image Generation Language Modeling
— Unverified 0SDoH-GPT: Using Large Language Models to Extract Social Determinants of Health (SDoH) Jul 24, 2024 Computational Efficiency Language Modeling
— Unverified 0A Voter-Based Stochastic Rejection-Method Framework for Asymptotically Safe Language Model Outputs Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0Towards Aligning Language Models with Textual Feedback Jul 24, 2024 Language Modeling Language Modelling
Code Code Available 1Early screening of potential breakthrough technologies with enhanced interpretability: A patent-specific hierarchical attention network model Jul 24, 2024 Interpretable Machine Learning Language Modeling
— Unverified 0Gradient-based inference of abstract task representations for generalization in neural networks Jul 24, 2024 Language Modelling Variational Inference
— Unverified 0MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models Jul 24, 2024 Language Modelling
Code Code Available 1Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0A Comprehensive Approach to Misspelling Correction with BERT and Levenshtein Distance Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks Jul 24, 2024 Active Learning Domain Adaptation
— Unverified 0Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation Jul 24, 2024 Language Modeling Language Modelling
— Unverified 0Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models Jul 24, 2024 ARC Inductive Bias
Code Code Available 1DenseTrack: Drone-based Crowd Tracking via Density-aware Motion-appearance Synergy Jul 24, 2024 Crowd Counting Language Modeling
Code Code Available 0TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback Jul 23, 2024 Language Modeling Language Modelling
Code Code Available 1AMONGAGENTS: Evaluating Large Language Models in the Interactive Text-Based Social Deduction Game Jul 23, 2024 Language Modeling Language Modelling
Code Code Available 0How to Leverage Personal Textual Knowledge for Personalized Conversational Information Retrieval Jul 23, 2024 Information Retrieval Language Modeling
Code Code Available 0INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model Jul 23, 2024 Language Modeling Language Modelling
Code Code Available 1Do LLMs Know When to NOT Answer? Investigating Abstention Abilities of Large Language Models Jul 23, 2024 Language Modelling Large Language Model
— Unverified 0Quantifying the Role of Textual Predictability in Automatic Speech Recognition Jul 23, 2024 Attribute Automatic Speech Recognition
— Unverified 0Graph-Structured Speculative Decoding Jul 23, 2024 Language Modelling Small Language Model
— Unverified 0Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design Jul 23, 2024 Formal Logic Language Modelling
— Unverified 0