| Emerging Property of Masked Token for Effective Pre-training | Apr 12, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| OPSD: an Offensive Persian Social media Dataset and its baseline evaluations | Apr 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining | Apr 1, 2024 | Contrastive LearningImage-text matching | —Unverified | 0 |
| Effectively Prompting Small-sized Language Models for Cross-lingual Tasks via Winning Tickets | Apr 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Developing Healthcare Language Model Embedding Spaces | Mar 28, 2024 | Contrastive LearningDocument Classification | —Unverified | 0 |
| Fingerprinting web servers through Transformer-encoded HTTP response headers | Mar 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Detecting Bias in Large Language Models: Fine-tuned KcBERT | Mar 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Punctuation Restoration Improves Structure Understanding Without Supervision | Feb 13, 2024 | ChunkingLanguage Modeling | CodeCode Available | 0 |
| Arabic Synonym BERT-based Adversarial Examples for Text Classification | Feb 5, 2024 | Adversarial TextLanguage Modeling | CodeCode Available | 0 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation? | Jan 31, 2024 | ClassificationDomain Adaptation | —Unverified | 0 |
| BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining | Jan 29, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling | Jan 25, 2024 | Causal Language ModelingDecoder | —Unverified | 0 |
| Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Efficient Parallel Audio Generation using Group Masked Language Modeling | Jan 2, 2024 | Audio GenerationComputational Efficiency | —Unverified | 0 |
| HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments | Dec 20, 2023 | Hate Speech DetectionLanguage Modeling | —Unverified | 0 |
| Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification | Dec 8, 2023 | ClassificationFew-Shot Text Classification | CodeCode Available | 0 |
| LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models | Dec 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling | Nov 28, 2023 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| CLIMB: Curriculum Learning for Infant-inspired Model Building | Nov 15, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| User Persona Identification and New Service Adaptation Recommendation | Nov 15, 2023 | Collaborative FilteringLanguage Modeling | —Unverified | 0 |
| Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision | Nov 4, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text | Oct 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Counterfactually Probing Language Identity in Multilingual Models | Oct 29, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Structural Self-Supervised Objectives for Transformers | Sep 15, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification | Sep 9, 2023 | Image to textLanguage Modeling | —Unverified | 0 |
| ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation | Aug 31, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Latent State Models of Training Dynamics | Aug 18, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models | Aug 15, 2023 | DecoderIn-Context Learning | CodeCode Available | 0 |
| GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning | Jul 29, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 |
| S2SNet: A Pretrained Neural Network for Superconductivity Discovery | Jun 28, 2023 | Electrical EngineeringLanguage Modeling | CodeCode Available | 0 |
| Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling | Jun 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |