| Llamarine: Open-source Maritime Industry-specific Large Language Model | Feb 28, 2025 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can LLM Assist in the Evaluation of the Quality of Machine Learning Explanations? | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chronologically Consistent Large Language Models | Feb 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization | Feb 28, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Conformal Tail Risk Control for Large Language Model Alignment | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do Sparse Autoencoders Generalize? A Case Study of Answerability | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Retrieval to Generation: Comparing Different Approaches | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models | Feb 27, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model | Feb 27, 2025 | Bayesian OptimizationDrug Discovery | —Unverified | 0 |
| Large Language Model Strategic Reasoning Evaluation through Behavioral Game Theory | Feb 27, 2025 | Decision MakingFairness | —Unverified | 0 |
| KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model | Feb 27, 2025 | Drug DiscoveryKnowledge Graphs | —Unverified | 0 |
| GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Collaborative Stance Detection via Small-Large Language Model Consistency Verification | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| M-LLM Based Video Frame Selection for Efficient Video Understanding | Feb 27, 2025 | EgoSchemaLanguage Modeling | —Unverified | 0 |
| Protecting multimodal large language models against misleading visualizations | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NANOGPT: A Query-Driven Large Language Model Retrieval-Augmented Generation System for Nanotechnology Research | Feb 27, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models | Feb 27, 2025 | counterfactualLanguage Modeling | —Unverified | 0 |
| SEKI: Self-Evolution and Knowledge Inspiration based Neural Architecture Search via Large Language Models | Feb 27, 2025 | GPUKnowledge Distillation | —Unverified | 0 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision | Feb 26, 2025 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 |
| The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revealing Treatment Non-Adherence Bias in Clinical Machine Learning Using Large Language Models | Feb 26, 2025 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions | Feb 26, 2025 | Cross-Modal RetrievalLanguage Modeling | —Unverified | 0 |
| TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency | Feb 26, 2025 | intent-classificationIntent Classification | CodeCode Available | 0 |
| A City of Millions: Mapping Literary Social Networks At Scale | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning | Feb 26, 2025 | Decoderimage-classification | —Unverified | 0 |
| Conformal Linguistic Calibration: Trading-off between Factuality and Specificity | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Kanana: Compute-efficient Bilingual Language Models | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Gender Bias in German Machine Translation | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Representation Learning of Complex Critical Care Data with ICU-BERT | Feb 26, 2025 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Large Language Model Driven Agents for Simulating Echo Chamber Formation | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems | Feb 25, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| AMPO: Active Multi-Preference Optimization | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Feb 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation | Feb 25, 2025 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages | Feb 25, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Can LLMs Explain Themselves Counterfactually? | Feb 25, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Independent Mobility GPT (IDM-GPT): A Self-Supervised Multi-Agent Large Language Model Framework for Customized Traffic Mobility Analysis Using Machine Learning Models | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Broadening Discovery through Structural Models: Multimodal Combination of Local and Structural Properties for Predicting Chemical Features | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search | Feb 25, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training | Feb 25, 2025 | Arithmetic ReasoningData Augmentation | —Unverified | 0 |
| PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning | Feb 25, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |