| AVSS: Layer Importance Evaluation in Large Language Models via Activation Variance-Sparsity Analysis | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Awes, Laws, and Flaws From Today's LLM Research | Aug 27, 2024 | EthicsLanguage Modeling | —Unverified | 0 |
| AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs | Mar 1, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| Aya 23: Open Weight Releases to Further Multilingual Progress | May 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BadRobot: Jailbreaking Embodied LLMs in the Physical World | Jul 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BaKlaVa -- Budgeted Allocation of KV cache for Long-context Inference | Feb 18, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems | Jun 16, 2025 | Large Language Model | —Unverified | 0 |
| Balancing Performance and Efficiency: A Multimodal Large Language Model Pruning Method based Image Text Interaction | Sep 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BAMBI: Developing Baby Language Models for Italian | Mar 12, 2025 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| BasedAI: A decentralized P2P network for Zero Knowledge Large Language Models (ZK-LLMs) | Mar 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BASES: Large-scale Web Search User Simulation with Large Language Model based Agents | Feb 27, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction | Aug 19, 2024 | Drug DiscoveryLanguage Modelling | —Unverified | 0 |
| BAT: Learning to Reason about Spatial Sounds with Large Language Models | Feb 2, 2024 | Event DetectionLanguage Modelling | —Unverified | 0 |
| Bayesian inference to improve quality of Retrieval Augmented Generation | Aug 12, 2024 | Bayesian InferenceLarge Language Model | —Unverified | 0 |
| Bayesian Reward Models for LLM Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Jul 18, 2024 | HallucinationLanguage Modelling | —Unverified | 0 |
| Allies: Prompting Large Language Model with Beam Search | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery | Jul 4, 2025 | Large Language Model | —Unverified | 0 |
| BenchmarkCards: Large Language Model and Risk Reporting | Oct 16, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking Large Language Model Capabilities for Conditional Generation | Jun 29, 2023 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| Benchmarking Large Language Models with Integer Sequence Generation Tasks | Nov 7, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Benchmarking Large Language Model Volatility | Nov 26, 2023 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 | Apr 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design | Apr 14, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| bert2BERT: Towards Reusable Pretrained Language Models | Oct 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BeSimulator: A Large Language Model Powered Text-based Behavior Simulator | Sep 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Better Process Supervision with Bi-directional Rewarding Signals | Mar 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension | Dec 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving | Jan 2, 2024 | Autonomous DrivingCaption Generation | —Unverified | 0 |
| Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph | Jun 11, 2024 | 3D Object Retrieval3D Semantic Segmentation | —Unverified | 0 |
| Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses | Dec 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws | Dec 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | May 22, 2025 | Causal InferenceDrug Discovery | —Unverified | 0 |
| Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks | Jun 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling | May 21, 2025 | DiagnosticLarge Language Model | —Unverified | 0 |
| Beyond Exponential Decay: Rethinking Error Accumulation in Large Language Models | May 30, 2025 | Large Language Model | —Unverified | 0 |
| Beyond Forecasting: Compositional Time Series Reasoning for End-to-End Task Execution | Oct 5, 2024 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems | May 18, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Keywords: A Context-based Hybrid Approach to Mining Ethical Concern-related App Reviews | Nov 11, 2024 | EthicsLarge Language Model | —Unverified | 0 |
| Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts | Apr 10, 2025 | Graph GenerationLanguage Modeling | —Unverified | 0 |
| Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks | Jul 29, 2024 | BenchmarkingLanguage Model Evaluation | —Unverified | 0 |
| Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey | May 30, 2023 | ChatbotLanguage Modelling | —Unverified | 0 |
| Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models | Aug 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering | May 17, 2025 | Document RankingLarge Language Model | —Unverified | 0 |
| Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing | Oct 14, 2024 | AllBinary Classification | —Unverified | 0 |
| Beyond Segmentation: Road Network Generation with Multi-Modal LLMs | Oct 15, 2023 | Autonomous NavigationLanguage Modeling | —Unverified | 0 |
| Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging | Apr 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |