| How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations | Jan 26, 2025 | Cross-Modal RetrievalImage Retrieval | —Unverified | 0 |
| Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets | Jan 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval | Jan 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding | Jan 25, 2025 | Action UnderstandingEmotion Recognition | —Unverified | 0 |
| PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures | Jan 25, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 1 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration | Jan 24, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 5 |
| A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education | Jan 24, 2025 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval | Jan 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |