| Knowledge Graph Generation From Text | Nov 18, 2022 | Graph GenerationJoint Entity and Relation Extraction | CodeCode Available | 1 |
| AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages | Nov 7, 2022 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| Investigating Fairness Disparities in Peer Review: A Language Model Enhanced Approach | Nov 7, 2022 | FairnessLanguage Modeling | CodeCode Available | 1 |
| KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction | Nov 4, 2022 | Fraud DetectionKnowledge Graph Completion | CodeCode Available | 1 |
| Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Contextual information integration for stance detection via cross-attention | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Fine-Tuning Language Models via Epistemic Neural Networks | Nov 3, 2022 | Active LearningLanguage Modeling | CodeCode Available | 1 |
| LMentry: A Language Model Benchmark of Elementary Language Tasks | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 |
| T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5 | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change | Oct 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control | Oct 31, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 |
| L-GreCo: Layerwise-Adaptive Gradient Compression for Efficient and Accurate Deep Learning | Oct 31, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Differentiable Data Augmentation for Contrastive Sentence Representation Learning | Oct 29, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| RoChBert: Towards Robust BERT Fine-tuning for Chinese | Oct 28, 2022 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion | Oct 28, 2022 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Truncation Sampling as Language Model Desmoothing | Oct 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Will we run out of data? Limits of LLM scaling based on human-generated data | Oct 26, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| N-gram Is Back: Residual Learning of Neural Text Generation with n-gram Language Model | Oct 26, 2022 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A single-cell gene expression language model | Oct 25, 2022 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Help me write a poem: Instruction Tuning as a Vehicle for Collaborative Poetry Writing | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction | Oct 25, 2022 | AllClick-Through Rate Prediction | CodeCode Available | 1 |
| ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Code4Struct: Code Generation for Few-Shot Event Structure Prediction | Oct 23, 2022 | Code GenerationEvent Argument Extraction | CodeCode Available | 1 |
| Language Model Pre-Training with Sparse Latent Typing | Oct 23, 2022 | Few-shot NERLanguage Modeling | CodeCode Available | 1 |
| Generative Prompt Tuning for Relation Classification | Oct 22, 2022 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs | Oct 21, 2022 | Automated Theorem ProvingLanguage Modeling | CodeCode Available | 1 |
| Tele-Knowledge Pre-training for Fault Analysis | Oct 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Aspect Sentiment Quad Prediction via Template-Order Data Augmentation | Oct 19, 2022 | Aspect-Based Sentiment Analysis (ABSA)Data Augmentation | CodeCode Available | 1 |
| The Devil in Linear Transformer | Oct 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Continued Pretraining for Better Zero- and Few-Shot Promptability | Oct 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models | Oct 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis | Oct 18, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding | Oct 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Construction Repetition Reduces Information Rate in Dialogue | Oct 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation | Oct 14, 2022 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Extracting Cultural Commonsense Knowledge at Scale | Oct 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| M2D2: A Massively Multi-domain Language Modeling Dataset | Oct 13, 2022 | Domain AdaptationDomain Generalization | CodeCode Available | 1 |
| Language Model Decoding as Likelihood-Utility Alignment | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ImaginaryNet: Learning Object Detectors without Real Images and Annotations | Oct 13, 2022 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |