Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation Nov 15, 2023 Constituency Parsing Knowledge Distillation
Code Code Available 0Toucan: Token-Aware Character Level Language Modeling Nov 15, 2023 Language Modeling Language Modelling
— Unverified 0Improving Deep Learning Optimization through Constrained Parameter Regularization Nov 15, 2023 Deep Learning Image Classification
Code Code Available 0PsyEval: A Suite of Mental Health Related Tasks for Evaluating Large Language Models Nov 15, 2023 Language Modelling Large Language Model
Code Code Available 0Pearl: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers Nov 15, 2023 Language Modeling Language Modelling
— Unverified 0Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder Nov 15, 2023 Decoder Image Captioning
— Unverified 0User Persona Identification and New Service Adaptation Recommendation Nov 15, 2023 Collaborative Filtering Language Modeling
— Unverified 0X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects Nov 15, 2023 Dialogue Generation Language Modelling
— Unverified 0When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages Nov 15, 2023 Language Modeling Language Modelling
Code Code Available 0Summarization-Based Document IDs for Generative Retrieval with Language Models Nov 14, 2023 Articles Language Modeling
Code Code Available 0Automated title and abstract screening for scoping reviews using the GPT-4 Large Language Model Nov 14, 2023 Language Modeling Language Modelling
Code Code Available 0Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios Nov 14, 2023 All Decoder
— Unverified 0Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders Nov 14, 2023 Language Modelling Multi-Task Learning
Code Code Available 0How good are Large Language Models on African Languages? Nov 14, 2023 In-Context Learning Language Modelling
— Unverified 0A Survey of Confidence Estimation and Calibration in Large Language Models Nov 14, 2023 Language Modelling
— Unverified 0Anti-LM Decoding for Zero-shot In-context Machine Translation Nov 14, 2023 In-Context Learning Language Modeling
Code Code Available 0Large Language Model-Driven Classroom Flipping: Empowering Student-Centric Peer Questioning with Flipped Interaction Nov 14, 2023 Chatbot Language Modeling
— Unverified 0Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning Nov 14, 2023 Knowledge Graphs Language Modeling
— Unverified 0Memory-efficient Stochastic methods for Memory-based Transformers Nov 14, 2023 Language Modeling Language Modelling
Code Code Available 0Text Retrieval with Multi-Stage Re-Ranking Models Nov 14, 2023 Language Modeling Language Modelling
Code Code Available 0On the Discussion of Large Language Models: Symmetry of Agents and Interplay with Prompts Nov 13, 2023 Language Modeling Language Modelling
— Unverified 0To Tell The Truth: Language of Deception and Language Models Nov 13, 2023 Language Modeling Language Modelling
Code Code Available 0Language Model-In-The-Loop: Data Optimal Approach to Learn-To-Recommend Actions in Text Games Nov 13, 2023 Language Modeling Language Modelling
— Unverified 0On Elastic Language Models Nov 13, 2023 Information Retrieval Knowledge Distillation
— Unverified 0On The Truthfulness of 'Surprisingly Likely' Responses of Large Language Models Nov 13, 2023 Language Modeling Language Modelling
— Unverified 0Teach me with a Whisper: Enhancing Large Language Models for Analyzing Spoken Transcripts using Speech Embeddings Nov 13, 2023 Knowledge Distillation Language Modeling
— Unverified 0The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 Nov 13, 2023 Computational chemistry Drug Discovery
— Unverified 0Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval Nov 13, 2023 Contrastive Learning Image Retrieval
Code Code Available 0Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks Nov 13, 2023 Language Modelling
— Unverified 0ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models Nov 13, 2023 counterfactual Language Modeling
— Unverified 0Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions Nov 13, 2023 Classification Language Modeling
— Unverified 0Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning Nov 13, 2023 In-Context Learning Language Modeling
— Unverified 0Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor Nov 13, 2023 Language Modeling Language Modelling
Code Code Available 0In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search Nov 13, 2023 Language Modelling Natural Language Inference
Code Code Available 0AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising Nov 13, 2023 Denoising Language Modeling
— Unverified 0Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference Nov 13, 2023 Deep Learning Language Modeling
— Unverified 0Detecting and Correcting Hate Speech in Multimodal Memes with Large Visual Language Model Nov 12, 2023 Language Modeling Language Modelling
— Unverified 0Are LLMs Rigorous Logical Reasoner? Empowering Natural Language Proof Generation with Contrastive Stepwise Decoding Nov 12, 2023 Language Modeling Language Modelling
— Unverified 0GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect Nov 12, 2023 Event Extraction Language Modeling
— Unverified 0From Complex to Simple: Unraveling the Cognitive Tree for Reasoning with Small Language Models Nov 12, 2023 Language Modelling Logical Reasoning
— Unverified 0Tunable Soft Prompts are Messengers in Federated Learning Nov 12, 2023 Federated Learning Language Modelling
— Unverified 0TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System Nov 11, 2023 Decision Making Language Modelling
— Unverified 0Separating the Wheat from the Chaff with BREAD: An open-source benchmark and metrics to detect redundancy in text Nov 11, 2023 Language Modeling Language Modelling
Code Code Available 0L3 Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models Nov 11, 2023 Language Modeling Language Modelling
— Unverified 0Intentional Biases in LLM Responses Nov 11, 2023 Language Modeling Language Modelling
— Unverified 0Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study Nov 11, 2023 Cross-Lingual Sentiment Classification Cross-Lingual Transfer
— Unverified 0Language Models can be Logical Solvers Nov 10, 2023 Decision Making Language Modeling
— Unverified 0Model-as-a-Service (MaaS): A Survey Nov 10, 2023 Cloud Computing Language Modelling
— Unverified 0Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking Nov 10, 2023 Dialogue State Tracking Graph Neural Network
— Unverified 0Autoregressive Language Models For Estimating the Entropy of Epic EHR Audit Logs Nov 10, 2023 Language Modeling Language Modelling
Code Code Available 0