| LakotaBERT: A Transformer-based Model for Low Resource Lakota Language | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Shushing! Let's Imagine an Authentic Speech from the Silent Video | Mar 19, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning | Mar 14, 2025 | Code GenerationDecoder | CodeCode Available | 0 |
| Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Feb 18, 2025 | Authorship AttributionLanguage Modeling | CodeCode Available | 0 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 |
| Enabling Autoregressive Models to Fill In Masked Tokens | Feb 9, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling | Jan 22, 2025 | Audio CompressionLanguage Modeling | —Unverified | 0 |
| Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Progressive Transformer for Unifying Binary Code Embedding and Knowledge Transfer | Dec 15, 2024 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection | Dec 9, 2024 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Small Languages, Big Models: A Study of Continual Training on Languages of Norway | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AntLM: Bridging Causal and Masked Language Models | Dec 4, 2024 | Causal Language ModelingDecoder | —Unverified | 0 |
| Mitigating Gender Bias in Contextual Word Embeddings | Nov 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CamemBERT 2.0: A Smarter French Language Model Aged to Perfection | Nov 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies | Oct 30, 2024 | Language AcquisitionMasked Language Modeling | CodeCode Available | 0 |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Oct 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models | Oct 10, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| LecPrompt: A Prompt-based Approach for Logical Error Correction with CodeBERT | Oct 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics | Oct 2, 2024 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| FARM: Functional Group-Aware Representations for Small Molecules | Oct 2, 2024 | Contrastive LearningDrug Discovery | —Unverified | 0 |
| Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling | Sep 15, 2024 | Causal Language ModelingDe-identification | CodeCode Available | 0 |
| VidLPRO: A Video-Language Pre-training Framework for Robotic and Laparoscopic Surgery | Sep 7, 2024 | Computational EfficiencyContrastive Learning | —Unverified | 0 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| Dynamic Motion Synthesis: Masked Audio-Text Conditioned Spatio-Temporal Transformers | Sep 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Unlocking Efficiency: Adaptive Masking for Gene Transformer Models | Aug 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling | Aug 9, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training | Jul 28, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| A Novel Two-Step Fine-Tuning Pipeline for Cold-Start Active Learning in Text Classification Tasks | Jul 24, 2024 | Active LearningDomain Adaptation | —Unverified | 0 |
| Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines | Jul 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs | Jul 22, 2024 | Few-Shot LearningGraph Neural Network | —Unverified | 0 |
| Pseudo-perplexity in One Fell Swoop for Protein Fitness Estimation | Jul 9, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Historical Ink: Semantic Shift Detection for 19th Century Spanish | Jul 8, 2024 | Masked Language ModelingSemantic Shift Detection | CodeCode Available | 0 |
| LLMcap: Large Language Model for Unsupervised PCAP Failure Detection | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization | Jul 1, 2024 | Code SummarizationDecoder | —Unverified | 0 |
| TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems | Jun 21, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models | Jun 4, 2024 | Document DatingLanguage Modeling | —Unverified | 0 |
| Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis | May 31, 2024 | Density EstimationImputation | —Unverified | 0 |
| Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic | May 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transformer based neural networks for emotion recognition in conversations | May 18, 2024 | Causal Language ModelingEmotion Classification | CodeCode Available | 0 |
| Self-Distillation Improves DNA Sequence Inference | May 14, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |