| Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation | Jan 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLIPTER: Looking at the Bigger Picture in Scene Text Recognition | Jan 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Prompting Large Language Model for Machine Translation: A Case Study | Jan 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam | Jan 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling | Jan 16, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| A Case Study in Engineering a Conversational Programming Assistant's Persona | Jan 13, 2023 | ChatbotLanguage Modeling | —Unverified | 0 |
| In BLOOM: Creativity and Affinity in Artificial Lyrics and Art | Jan 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution | Jan 12, 2023 | Entity ResolutionGeneral Knowledge | —Unverified | 0 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Topics in Contextualised Attention Embeddings | Jan 11, 2023 | ClusteringLanguage Modeling | —Unverified | 0 |
| NarrowBERT: Accelerating Masked Language Model Pretraining and Inference | Jan 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Memory Augmented Large Language Models are Computationally Universal | Jan 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chatbots in a Honeypot World | Jan 10, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching | Jan 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Antibody Design for Complementary Chain Pairing Sequences through Encoder-Decoder Language Model | Jan 6, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Towards Table-to-Text Generation with Pretrained Language Model: A Table Structure Understanding and Text Deliberating Approach | Jan 5, 2023 | DecoderDescriptive | CodeCode Available | 0 |
| PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora | Jan 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClusTop: An unsupervised and integrated text clustering and topic extraction framework | Jan 3, 2023 | ClusteringDimensionality Reduction | —Unverified | 0 |
| Understanding Political Polarisation using Language Models: A dataset and method | Jan 2, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Open-Category Human-Object Interaction Pre-Training via Language Modeling Framework | Jan 1, 2023 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| Logic Mill -- A Knowledge Navigation System | Dec 31, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports | Dec 30, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Black-box language model explanation by context length probing | Dec 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition | Dec 30, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Complex Knowledge Base Question Answering via Question-to-Action and Question-to-Question Alignment | Dec 26, 2022 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| HMM-based data augmentation for E2E systems for building conversational speech synthesis systems | Dec 22, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise | Dec 22, 2022 | DecoderDenoising | —Unverified | 0 |
| ImPaKT: A Dataset for Open-Schema Knowledge Base Construction | Dec 21, 2022 | AttributeKnowledge Base Construction | —Unverified | 0 |
| Crowd Score: A Method for the Evaluation of Jokes using Large Language Model AI Voters as Judges | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What do LLMs Know about Financial Markets? A Case Study on Reddit Market Sentiment Analysis | Dec 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning | Dec 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models | Dec 21, 2022 | Extractive Question-AnsweringLanguage Modeling | —Unverified | 0 |
| Prompt-Augmented Linear Probing: Scaling beyond the Limit of Few-shot In-Context Learners | Dec 21, 2022 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SERENGETI: Massively Multilingual Language Models for Africa | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Resolving Indirect Referring Expressions for Entity Selection | Dec 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters | Dec 20, 2022 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Language Modeling with Latent Situations | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| In-context Learning Distillation: Transferring Few-shot Learning Ability of Pre-trained Language Models | Dec 20, 2022 | Few-Shot LearningIn-Context Learning | —Unverified | 0 |
| Controllable Text Generation with Language Constraints | Dec 20, 2022 | AttributeLanguage Modeling | —Unverified | 0 |
| EIT: Enhanced Interactive Transformer | Dec 20, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Identifying and Manipulating the Personality Traits of Language Models | Dec 20, 2022 | DiagnosticLanguage Modeling | —Unverified | 0 |
| KronA: Parameter Efficient Tuning with Kronecker Adapter | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Measure-Theoretic Characterization of Tight Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Is GPT-3 a Good Data Annotator? | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AnyTOD: A Programmable Task-Oriented Dialog System | Dec 20, 2022 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Can Current Task-oriented Dialogue Models Automate Real-world Scenarios in the Wild? | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Long-Form Spoken Language Translation with Large Language Models | Dec 19, 2022 | FormLanguage Modeling | —Unverified | 0 |