| DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization | Aug 14, 2024 | Data VisualizationLanguage Modeling | CodeCode Available | 0 |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Abstract Operations Research Modeling Using Natural Language Inputs | Aug 14, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Cropper: Vision-Language Model for Image Cropping through In-Context Learning | Aug 14, 2024 | Image CroppingIn-Context Learning | —Unverified | 0 |
| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach | Aug 14, 2024 | Cross-Modal RetrievalLanguage Modeling | —Unverified | 0 |
| Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MGH Radiology Llama: A Llama 3 70B Model for Radiology | Aug 13, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation | Aug 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |