| Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| M-LLM Based Video Frame Selection for Efficient Video Understanding | Feb 27, 2025 | EgoSchemaLanguage Modeling | —Unverified | 0 |
| Collaborative Stance Detection via Small-Large Language Model Consistency Verification | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models | Feb 27, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| Do Sparse Autoencoders Generalize? A Case Study of Answerability | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration | Feb 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model | Feb 27, 2025 | Bayesian OptimizationDrug Discovery | —Unverified | 0 |
| SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model | Feb 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Conformal Linguistic Calibration: Trading-off between Factuality and Specificity | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning | Feb 26, 2025 | Decoderimage-classification | —Unverified | 0 |
| TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency | Feb 26, 2025 | intent-classificationIntent Classification | CodeCode Available | 0 |
| Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision | Feb 26, 2025 | Audio SynthesisAutomatic Speech Recognition | —Unverified | 0 |
| Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions | Feb 26, 2025 | Cross-Modal RetrievalLanguage Modeling | —Unverified | 0 |
| On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation | Feb 26, 2025 | Cross-Modal RetrievalHallucination | —Unverified | 0 |
| AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Kanana: Compute-efficient Bilingual Language Models | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A City of Millions: Mapping Literary Social Networks At Scale | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revealing Treatment Non-Adherence Bias in Clinical Machine Learning Using Large Language Models | Feb 26, 2025 | Causal InferenceLanguage Modeling | —Unverified | 0 |
| Improving Representation Learning of Complex Critical Care Data with ICU-BERT | Feb 26, 2025 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Evaluating Gender Bias in German Machine Translation | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems | Feb 25, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search | Feb 25, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Feb 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Independent Mobility GPT (IDM-GPT): A Self-Supervised Multi-Agent Large Language Model Framework for Customized Traffic Mobility Analysis Using Machine Learning Models | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages | Feb 25, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training | Feb 25, 2025 | Arithmetic ReasoningData Augmentation | —Unverified | 0 |
| Can LLMs Explain Themselves Counterfactually? | Feb 25, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| AMPO: Active Multi-Preference Optimization | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rank1: Test-Time Compute for Reranking in Information Retrieval | Feb 25, 2025 | Information RetrievalInstruction Following | CodeCode Available | 2 |
| TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning | Feb 25, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models | Feb 25, 2025 | DiversityLanguage Modeling | CodeCode Available | 11 |
| SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long Contexts | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Broadening Discovery through Structural Models: Multimodal Combination of Local and Structural Properties for Predicting Chemical Features | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation | Feb 25, 2025 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| Large Language Model Driven Agents for Simulating Echo Chamber Formation | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought | Feb 25, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 |
| Inverse Materials Design by Large Language Model-Assisted Generative Framework | Feb 25, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Improving Interactive Diagnostic Ability of a Large Language Model Agent Through Clinical Experience Learning | Feb 24, 2025 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Knowledge Distillation with Training Wheels | Feb 24, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |