| Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection | Dec 9, 2024 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Small Languages, Big Models: A Study of Continual Training on Languages of Norway | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AntLM: Bridging Causal and Masked Language Models | Dec 4, 2024 | Causal Language ModelingDecoder | —Unverified | 0 |
| Mitigating Gender Bias in Contextual Word Embeddings | Nov 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 |
| Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies | Oct 30, 2024 | Language AcquisitionMasked Language Modeling | CodeCode Available | 0 |
| Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers | Oct 29, 2024 | Drug DesignLanguage Modeling | CodeCode Available | 1 |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT | Oct 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Oct 10, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| FARM: Functional Group-Aware Representations for Small Molecules | Oct 2, 2024 | Contrastive LearningDrug Discovery | —Unverified | 0 |
| SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics | Oct 2, 2024 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling | Sep 15, 2024 | Causal Language ModelingDe-identification | CodeCode Available | 0 |
| DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification | Sep 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| VidLPRO: A Video-Language Pre-training Framework for Robotic and Laparoscopic Surgery | Sep 7, 2024 | Computational EfficiencyContrastive Learning | —Unverified | 0 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers | Sep 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Unlocking Efficiency: Adaptive Masking for Gene Transformer Models | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling | Aug 9, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| AutoScale: Scale-Aware Data Mixing for Pre-Training LLMs | Jul 29, 2024 | Bilevel OptimizationLanguage Modelling | CodeCode Available | 1 |