| Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification | Dec 8, 2023 | ClassificationFew-Shot Text Classification | CodeCode Available | 0 |
| LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models | Dec 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling | Nov 28, 2023 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| User Persona Identification and New Service Adaptation Recommendation | Nov 15, 2023 | Collaborative FilteringLanguage Modeling | —Unverified | 0 |
| CLIMB: Curriculum Learning for Infant-inspired Model Building | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision | Nov 4, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text | Oct 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Counterfactually Probing Language Identity in Multilingual Models | Oct 29, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding | Oct 23, 2023 | ArticlesContrastive Learning | CodeCode Available | 1 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| FATA-Trans: Field And Time-Aware Transformer for Sequential Tabular Data | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FiLM: Fill-in Language Models for Any-Order Generation | Oct 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| PepMLM: Target Sequence-Conditioned Generation of Therapeutic Peptide Binders via Span Masked Language Modeling | Oct 5, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Structural Self-Supervised Objectives for Transformers | Sep 15, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification | Sep 9, 2023 | Image to textLanguage Modeling | —Unverified | 0 |
| ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation | Aug 31, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| A Multi-Task Semantic Decomposition Framework with Task-specific Pre-training for Few-Shot NER | Aug 28, 2023 | Contrastive Learningfew-shot-ner | CodeCode Available | 1 |
| Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning | Aug 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Latent State Models of Training Dynamics | Aug 18, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models | Aug 15, 2023 | DecoderIn-Context Learning | CodeCode Available | 0 |
| Pairing interacting protein sequences using masked language modeling | Aug 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning | Jul 29, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 |
| S2SNet: A Pretrained Neural Network for Superconductivity Discovery | Jun 28, 2023 | Electrical EngineeringLanguage Modeling | CodeCode Available | 0 |
| Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling | Jun 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 |
| Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation | Jun 15, 2023 | Automatic Speech RecognitionClustering | CodeCode Available | 1 |
| Generate to Understand for Representation | Jun 14, 2023 | Contrastive LearningGPU | CodeCode Available | 1 |
| Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models | Jun 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Global and Local Semantic Completion Learning for Vision-Language Pre-training | Jun 12, 2023 | cross-modal alignmentImage-text Retrieval | CodeCode Available | 1 |
| Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Jun 7, 2023 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Difference of BERT-style and CLIP-style Text Encoders | Jun 6, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| Fair multilingual vandalism detection system for Wikipedia | Jun 2, 2023 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression | Jun 1, 2023 | Contrastive LearningData Augmentation | —Unverified | 0 |