| BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop | Feb 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Back from the future: bidirectional CTC decoding using future information in speech recognition | Oct 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity | Feb 24, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema | Apr 16, 2021 | Artifact DetectionBias Detection | —Unverified | 0 | 0 |
| Backtracking Improves Generation Safety | Sep 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification | Jul 22, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Backward and Forward Language Modeling for Constrained Sentence Generation | Dec 21, 2015 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Backward Lens: Projecting Language Model Gradients into the Vocabulary Space | Feb 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT | Feb 21, 2023 | Backdoor AttackLanguage Modeling | —Unverified | 0 | 0 |
| BadRobot: Jailbreaking Embodied LLMs in the Physical World | Jul 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BAGEL: Bootstrapping Agents by Guiding Exploration with Language | Mar 12, 2024 | In-Context Learning | —Unverified | 0 | 0 |
| Bag of Tricks for Effective Language Model Pretraining and Downstream Adaptation: A Case Study on GLUE | Feb 18, 2023 | Contrastive LearningDenoising | —Unverified | 0 | 0 |
| BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference | Feb 18, 2025 | GPULanguage Modeling | —Unverified | 0 | 0 |
| Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Balancing Average and Worst-case Accuracy in Multitask Learning | Oct 12, 2021 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Sparsely-gated Mixture-of-Expert Layers for CNN Interpretability | Apr 22, 2022 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction | Sep 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM | Feb 24, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 | 0 |
| BAMBI: Developing Baby Language Models for Italian | Mar 12, 2025 | Language AcquisitionLanguage Modeling | —Unverified | 0 | 0 |
| BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BanglaHateBERT: BERT for Abusive Language Detection in Bengali | Jun 1, 2022 | Abusive LanguageLanguage Modeling | —Unverified | 0 | 0 |
| Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models | Sep 13, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Bangla Word Clustering Based on Tri-gram, 4-gram and 5-gram Language Model | Jan 27, 2017 | ClusteringLanguage Modeling | —Unverified | 0 | 0 |
| BART based semantic correction for Mandarin automatic speech recognition system | Mar 26, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| BART for Post-Correction of OCR Newspaper Text | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BART-light: One Decoder Layer Is Enough | Sep 17, 2021 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| BAS: An Answer Selection Method Using BERT Language Model | Nov 4, 2019 | Answer SelectionLanguage Modeling | —Unverified | 0 | 0 |
| BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Mar 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BASES: Large-scale Web Search User Simulation with Large Language Model based Agents | Feb 27, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 | 0 |
| BayesFormer: Transformer with Uncertainty Estimation | Jun 2, 2022 | Active LearningLanguage Modeling | —Unverified | 0 | 0 |
| Bayesian Language Model based on Mixture of Segmental Contexts for Spontaneous Utterances with Unexpected Words | Dec 1, 2016 | Automatic Speech Recognition (ASR)Language Modeling | —Unverified | 0 | 0 |
| Bayesian Neural Networks with Variance Propagation for Uncertainty Evaluation | Jan 1, 2021 | Bayesian InferenceComputational Efficiency | —Unverified | 0 | 0 |
| Bayesian Reward Models for LLM Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| β-calibration of Language Model Confidence Scores for Generative QA | Oct 9, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Jan 29, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| BEA-Base: A Benchmark for ASR of Spontaneous Hungarian | Feb 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| BEA-Base: A Benchmark for ASR of Spontaneous Hungarian | Jun 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Beam-Guided Knowledge Replay for Knowledge-Rich Image Captioning using Vision-Language Model | May 29, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 | 0 |
| Allies: Prompting Large Language Model with Beam Search | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Beam Search with Bidirectional Strategies for Neural Response Generation | Oct 7, 2021 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| BeamSeg: A Joint Model for Multi-Document Segmentation and Topic Identification | Nov 1, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text | Sep 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Behavior Gated Language Models | Aug 31, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Behind the Scenes of an Evolving Event Cloze Test | Apr 1, 2017 | Cloze TestCommon Sense Reasoning | —Unverified | 0 | 0 |
| belabBERT: a Dutch RoBERTa-based language model applied to psychiatric classification | Jun 2, 2021 | Audio ClassificationClassification | —Unverified | 0 | 0 |
| BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief | Sep 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BenchDirect: A Directed Language Model for Compiler Benchmarks | Mar 2, 2023 | Active LearningCPU | —Unverified | 0 | 0 |
| BenchmarkCards: Large Language Model and Risk Reporting | Oct 16, 2024 | FairnessLanguage Modeling | —Unverified | 0 | 0 |