| ChatSUMO: Large Language Model for Automating Traffic Scenario Generation in Simulation of Urban MObility | Aug 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| VLM-KD: Knowledge Distillation from VLM for Long-Tail Visual Recognition | Aug 29, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Plausible-Parrots @ MSP2023: Enhancing Semantic Plausibility Modeling using Entity and Event Knowledge | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling | Aug 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Aug 29, 2024 | Data AugmentationImage Retrieval | —Unverified | 0 |
| Law of Vision Representation in MLLMs | Aug 29, 2024 | cross-modal alignmentLanguage Modeling | CodeCode Available | 2 |
| DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving | Aug 29, 2024 | Autonomous DrivingDenoising | —Unverified | 0 |
| Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |