| Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions | Feb 26, 2025 | Cross-Modal RetrievalLanguage Modeling | —Unverified | 0 |
| TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency | Feb 26, 2025 | intent-classificationIntent Classification | CodeCode Available | 0 |
| A City of Millions: Mapping Literary Social Networks At Scale | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning | Feb 26, 2025 | Decoderimage-classification | —Unverified | 0 |
| Conformal Linguistic Calibration: Trading-off between Factuality and Specificity | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ANPMI: Assessing the True Comprehension Capabilities of LLMs for Multiple Choice Questions | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Kanana: Compute-efficient Bilingual Language Models | Feb 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Gender Bias in German Machine Translation | Feb 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Representation Learning of Complex Critical Care Data with ICU-BERT | Feb 26, 2025 | Feature EngineeringLanguage Modeling | —Unverified | 0 |
| Large Language Model Driven Agents for Simulating Echo Chamber Formation | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Faster, Cheaper, Better: Multi-Objective Hyperparameter Optimization for LLM and RAG Systems | Feb 25, 2025 | Bayesian OptimizationHyperparameter Optimization | —Unverified | 0 |
| AMPO: Active Multi-Preference Optimization | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Feb 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation | Feb 25, 2025 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages | Feb 25, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Can LLMs Explain Themselves Counterfactually? | Feb 25, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Independent Mobility GPT (IDM-GPT): A Self-Supervised Multi-Agent Large Language Model Framework for Customized Traffic Mobility Analysis Using Machine Learning Models | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Iterative Counterfactual Data Augmentation | Feb 25, 2025 | counterfactualData Augmentation | CodeCode Available | 0 |
| Broadening Discovery through Structural Models: Multimodal Combination of Local and Structural Properties for Predicting Chemical Features | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VALUE: Value-Aware Large Language Model for Query Rewriting via Weighted Trie in Sponsored Search | Feb 25, 2025 | AttributeLanguage Modeling | —Unverified | 0 |
| Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training | Feb 25, 2025 | Arithmetic ReasoningData Augmentation | —Unverified | 0 |
| PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning | Feb 25, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |