How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench May 24, 2023 Diversity Language Modeling
Code Code Available 0Dynamic Masking Rate Schedules for MLM Pretraining May 24, 2023 Language Modeling Language Modelling
— Unverified 0Estimating class separability of text embeddings with persistent homology May 24, 2023 Language Modelling Multi Class Text Classification
— Unverified 0An Efficient Multilingual Language Model Compression through Vocabulary Trimming May 24, 2023 Language Modeling Language Modelling
Code Code Available 1C-STS: Conditional Semantic Textual Similarity May 24, 2023 Information Retrieval Language Model Evaluation
Code Code Available 1Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems May 24, 2023 Arithmetic Reasoning GSM8K
Code Code Available 0Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model May 24, 2023 All Language Modeling
Code Code Available 0Trade-Offs Between Fairness and Privacy in Language Modeling May 24, 2023 Bias Detection Fairness
Code Code Available 0Textless Speech-to-Speech Translation With Limited Parallel Data May 24, 2023 Automatic Speech Recognition Denoising
Code Code Available 0Focus Your Attention (with Adaptive IIR Filters) May 24, 2023 Language Modelling Long-range modeling
— Unverified 0Mitigating Test-Time Bias for Fair Image Retrieval May 23, 2023 Image Retrieval Language Modeling
Code Code Available 0QLoRA: Efficient Finetuning of Quantized LLMs May 23, 2023 Chatbot GPU
Code Code Available 6Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain May 23, 2023 Language Modeling Language Modelling
— Unverified 0Natural Language Decompositions of Implicit Content Enable Better Text Representations May 23, 2023 Language Modeling Language Modelling
Code Code Available 0RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning May 23, 2023 In-Context Learning Language Modelling
Code Code Available 1Language Model Self-improvement by Reinforcement Learning Contemplation May 23, 2023 Language Modeling Language Modelling
— Unverified 0Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings May 23, 2023 Active Learning Language Modeling
Code Code Available 0MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems May 23, 2023 Language Modelling Large Language Model
Code Code Available 1On Robustness of Finetuned Transformer-based NLP Models May 23, 2023 Decoder Language Modelling
Code Code Available 0From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding May 23, 2023 Language Modeling Language Modelling
— Unverified 0Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding May 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cascaded Beam Search: Plug-and-Play Terminology-Forcing For Neural Machine Translation May 23, 2023 Language Modeling Language Modelling
— Unverified 0FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models May 23, 2023 Language Modeling Language Modelling
Code Code Available 1Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions May 23, 2023 Data Augmentation Language Modeling
Code Code Available 0Acquiring Frame Element Knowledge with Deep Metric Learning for Semantic Frame Induction May 23, 2023 Clustering Language Modeling
— Unverified 0Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks May 23, 2023 Few-Shot Learning Language Modeling
Code Code Available 0Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models May 23, 2023 Data Poisoning Language Modelling
Code Code Available 0Domain Private Transformers for Multi-Domain Dialog Systems May 23, 2023 domain classification Language Modeling
Code Code Available 0Goal-Driven Explainable Clustering via Language Descriptions May 23, 2023 Clustering Language Modelling
Code Code Available 1Error Detection for Text-to-SQL Semantic Parsing May 23, 2023 Language Modeling Language Modelling
Code Code Available 0ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings May 23, 2023 Community Detection Contrastive Learning
Code Code Available 1AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese May 23, 2023 Language Modeling Language Modelling
Code Code Available 0APPLS: Evaluating Evaluation Metrics for Plain Language Summarization May 23, 2023 Informativeness Language Modelling
Code Code Available 0Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation May 23, 2023 Data Augmentation Few-Shot Text Classification
— Unverified 0Aligning Large Language Models through Synthetic Feedback May 23, 2023 Language Modeling Language Modelling
Code Code Available 1CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model May 23, 2023 Decoder Language Modeling
Code Code Available 1When your Cousin has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages May 23, 2023 Bilingual Lexicon Induction Language Modeling
Code Code Available 0Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker May 23, 2023 Information Retrieval Language Modeling
Code Code Available 0Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models May 23, 2023 All Fairness
— Unverified 0GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models May 23, 2023 Chatbot Language Modelling
— Unverified 0Query Rewriting for Retrieval-Augmented Large Language Models May 23, 2023 Language Modeling Language Modelling
— Unverified 0Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection May 23, 2023 Event Detection Language Modeling
Code Code Available 0Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training May 23, 2023 Language Modeling Language Modelling
Code Code Available 2Learning from Mistakes via Cooperative Study Assistant for Large Language Models May 23, 2023 Imitation Learning Language Modeling
Code Code Available 0R2H: Building Multimodal Navigation Helpers that Respond to Help Requests May 23, 2023 Benchmarking Language Modeling
— Unverified 0The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models May 23, 2023 Hallucination Language Modeling
Code Code Available 0Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings May 23, 2023 Language Modeling Language Modelling
— Unverified 0Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model May 23, 2023 Avg Language Modeling
— Unverified 0Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning May 23, 2023 Language Modeling Language Modelling
Code Code Available 1LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models May 23, 2023 Common Sense Reasoning Image Generation
Code Code Available 2