| Privacy Leakage Overshadowed by Views of AI: A Study on Human Oversight of Privacy in Language Model Agent | Nov 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Large Language Models for Code-Mixed Data Augmentation in Sentiment Analysis | Nov 1, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations | Nov 1, 2024 | InformativenessLanguage Modeling | —Unverified | 0 |
| Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models | Nov 1, 2024 | Decision MakingInformativeness | CodeCode Available | 1 |
| Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers | Nov 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Few-Shot Cross-Domain Named Entity Recognition by Instruction Tuning a Word-Embedding based Retrieval Augmented Large Language Model | Nov 1, 2024 | BenchmarkingCross-Domain Named Entity Recognition | —Unverified | 0 |
| Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback | Nov 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement | Nov 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents | Nov 1, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models | Nov 1, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |