| A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models | Feb 21, 2024 | BenchmarkingImage to text | —Unverified | 0 |
| CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models | Feb 21, 2024 | Benchmarking | —Unverified | 0 |
| MM-Soc: Benchmarking Multimodal Large Language Models in Social Media Platforms | Feb 21, 2024 | BenchmarkingHate Speech Detection | CodeCode Available | 0 |
| KetGPT -- Dataset Augmentation of Quantum Circuits using Transformers | Feb 20, 2024 | Benchmarking | —Unverified | 0 |
| Synthetic location trajectory generation using categorical diffusion models | Feb 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 |
| FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation | Feb 19, 2024 | BenchmarkingChatbot | —Unverified | 0 |
| AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Feb 19, 2024 | Benchmarking | CodeCode Available | 0 |
| Learning Disentangled Audio Representations through Controlled Synthesis | Feb 16, 2024 | BenchmarkingDisentanglement | —Unverified | 0 |
| VATr++: Choose Your Words Wisely for Handwritten Text Generation | Feb 16, 2024 | BenchmarkingText Generation | —Unverified | 0 |
| The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse | Feb 15, 2024 | BenchmarkingModel Editing | CodeCode Available | 0 |