| debiaSAE: Benchmarking and Mitigating Vision-Language Model Bias | Oct 17, 2024 | BenchmarkingBias Detection | CodeCode Available | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| MedINST: Meta Dataset of Biomedical Instructions | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems | Oct 17, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Proof Flow: Preliminary Study on Generative Flow Network Language Model Tuning for Formal Reasoning | Oct 17, 2024 | Automated Theorem ProvingLanguage Modeling | —Unverified | 0 |
| SLM-Mod: Small Language Models Surpass LLMs at Content Moderation | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language | Oct 17, 2024 | DescriptiveDiversity | —Unverified | 0 |
| Retrieval-Enhanced Named Entity Recognition | Oct 17, 2024 | In-Context LearningInformation Retrieval | —Unverified | 0 |
| Instruction-Driven Game Engine: A Poker Case Study | Oct 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing | Oct 17, 2024 | AttributeCode Completion | CodeCode Available | 7 |