| Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios? | Oct 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Evaluation via Matrix Nuclear-Norm | Oct 14, 2024 | Computational EfficiencyData Compression | CodeCode Available | 0 |
| ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws | Oct 13, 2024 | Large Language Model | —Unverified | 0 |
| MisinfoEval: Generative AI in the Era of "Alternative Facts" | Oct 13, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Adaptive Reasoning and Acting in Medical Language Agents | Oct 13, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering | Oct 13, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles | Oct 13, 2024 | Autonomous VehiclesCode Generation | —Unverified | 0 |
| Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization | Oct 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoIN: Mixture of Introvert Experts to Upcycle an LLM | Oct 13, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Oct 12, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement | Oct 12, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning | Oct 12, 2024 | Audio captioningLarge Language Model | —Unverified | 0 |
| LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning | Oct 12, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Debiasing Vison-Language Models with Text-Only Training | Oct 12, 2024 | Large Language Model | —Unverified | 0 |
| ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation | Oct 11, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Oct 11, 2024 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| LLMD: A Large Language Model for Interpreting Longitudinal Medical Records | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Preferential Normalizing Flows | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can a large language model be a gaslighter? | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Emergent social conventions and collective bias in LLM populations | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enterprise Benchmarks for Large Language Model Evaluation | Oct 11, 2024 | BenchmarkingLanguage Model Evaluation | CodeCode Available | 0 |
| Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks | Oct 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |