| Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision | Nov 4, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text | Oct 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Counterfactually Probing Language Identity in Multilingual Models | Oct 29, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction | Oct 13, 2023 | Click-Through Rate PredictionLanguage Modeling | —Unverified | 0 |
| Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Structural Self-Supervised Objectives for Transformers | Sep 15, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification | Sep 9, 2023 | Image to textLanguage Modeling | —Unverified | 0 |
| ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation | Aug 31, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Latent State Models of Training Dynamics | Aug 18, 2023 | image-classificationImage Classification | CodeCode Available | 0 |
| RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models | Aug 15, 2023 | DecoderIn-Context Learning | CodeCode Available | 0 |
| GeneMask: Fast Pretraining of Gene Sequences to Enable Few-Shot Learning | Jul 29, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 |
| S2SNet: A Pretrained Neural Network for Superconductivity Discovery | Jun 28, 2023 | Electrical EngineeringLanguage Modeling | CodeCode Available | 0 |
| Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling | Jun 21, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |