| Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement | Mar 11, 2024 | Clinical KnowledgeDescriptive | CodeCode Available | 2 |
| An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control | Mar 7, 2024 | Descriptive | CodeCode Available | 2 |
| Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation | Jan 1, 2024 | DescriptiveObject | CodeCode Available | 2 |
| Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | Dec 14, 2023 | DescriptiveImage Quality Assessment | CodeCode Available | 2 |
| Customization Assistant for Text-to-image Generation | Dec 5, 2023 | DescriptiveImage Generation | CodeCode Available | 2 |
| TeCH: Text-guided Reconstruction of Lifelike Clothed Humans | Aug 16, 2023 | DescriptiveQuestion Answering | CodeCode Available | 2 |
| Solving Data Quality Problems with Desbordante: a Demo | Jul 27, 2023 | Anomaly DetectionDescriptive | CodeCode Available | 2 |
| AmadeusGPT: a natural language interface for interactive animal behavioral analysis | Jul 10, 2023 | Descriptive | CodeCode Available | 2 |
| Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language | Jun 28, 2023 | DescriptiveLanguage Modeling | CodeCode Available | 2 |
| Scalable 3D Captioning with Pretrained Models | Jun 12, 2023 | DescriptiveImage Captioning | CodeCode Available | 2 |