| Adaptively profiling models with task elicitation | Mar 3, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| Forgetting Transformer: Softmax Attention with a Forget Gate | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| KurTail : Kurtosis-based LLM Quantization | Mar 3, 2025 | GPULanguage Modeling | —Unverified | 0 |
| LLMs as Educational Analysts: Transforming Multimodal Data Traces into Actionable Reading Assessment Reports | Mar 3, 2025 | FairnessLanguage Modeling | CodeCode Available | 0 |
| SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer's Patients | Mar 3, 2025 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syntactic Learnability of Echo State Neural Language Models at Scale | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ReaderLM-v2: Small Language Model for HTML to Markdown and JSON | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can (A)I Change Your Mind? | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Jailbreaking Safeguarded Text-to-Image Models via Large Language Models | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Generate Long-term Future Narrations Describing Activities of Daily Living | Mar 3, 2025 | Action AnticipationDecision Making | —Unverified | 0 |
| Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFD | Mar 3, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model | Mar 2, 2025 | AnatomyDenoising | CodeCode Available | 0 |
| Waste Not, Want Not; Recycled Gumbel Noise Improves Consistency in Natural Language Generation | Mar 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer Meets Twicing: Harnessing Unattended Residual Information | Mar 2, 2025 | Adversarial Robustnessimage-classification | CodeCode Available | 0 |
| FunBench: Benchmarking Fundus Reading Skills of MLLMs | Mar 2, 2025 | AnatomyBenchmarking | —Unverified | 0 |
| Enhancing Monocular 3D Scene Completion with Diffusion Model | Mar 2, 2025 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 1 |
| CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering | Mar 1, 2025 | Continual LearningLanguage Modeling | —Unverified | 0 |
| NeuroSymAD: A Neuro-Symbolic Framework for Interpretable Alzheimer's Disease Diagnosis | Mar 1, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions | Mar 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PinLanding: Content-First Keyword Landing Page Generation via Multi-Modal AI for Web-Scale Discovery | Mar 1, 2025 | AttributeAttribute Extraction | —Unverified | 0 |
| Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Challenges in Testing Large Language Model Based Software: A Faceted Taxonomy | Mar 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Mapping in Multimodal Music Learning: A Grand Challenge Proposal | Mar 1, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Leveraging Compute-in-Memory for Efficient Generative Model Inference in TPUs | Mar 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement | Mar 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reducing Large Language Model Safety Risks in Women's Health using Semantic Entropy | Mar 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards General Visual-Linguistic Face Forgery Detection(V2) | Feb 28, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization | Feb 28, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Llamarine: Open-source Maritime Industry-specific Large Language Model | Feb 28, 2025 | Collision AvoidanceDecision Making | —Unverified | 0 |
| MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model Training | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Can LLM Assist in the Evaluation of the Quality of Machine Learning Explanations? | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Protein Structure Tokenization: Benchmarking and New Recipe | Feb 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 |
| Chronologically Consistent Large Language Models | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NANOGPT: A Query-Driven Large Language Model Retrieval-Augmented Generation System for Nanotechnology Research | Feb 27, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Protecting multimodal large language models against misleading visualizations | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models | Feb 27, 2025 | GPUKnowledge Distillation | —Unverified | 0 |
| Large Language Model Strategic Reasoning Evaluation through Behavioral Game Theory | Feb 27, 2025 | Decision MakingFairness | —Unverified | 0 |
| From Retrieval to Generation: Comparing Different Approaches | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Conformal Tail Risk Control for Large Language Model Alignment | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Collaborative Stance Detection via Small-Large Language Model Consistency Verification | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |