| FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation | Jul 9, 2025 | DescriptiveText Generation | —Unverified | 0 |
| Beyond Accuracy: Metrics that Uncover What Makes a 'Good' Visual Descriptor | Jul 4, 2025 | Descriptiveimage-classification | CodeCode Available | 0 |
| Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization | Jul 3, 2025 | DescriptiveDisentanglement | —Unverified | 0 |
| Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization | Jun 25, 2025 | Dense Video CaptioningDescriptive | —Unverified | 0 |
| Experiential marketing strategy and tourism demand in the contribution of the positioning of the floating islands Los Uros, Puno | Jun 22, 2025 | DescriptiveMarketing | —Unverified | 0 |
| A Simple Contrastive Framework Of Item Tokenization For Generative Recommendation | Jun 20, 2025 | Contrastive LearningDescriptive | —Unverified | 0 |
| Uncovering Intention through LLM-Driven Code Snippet Description Generation | Jun 18, 2025 | Descriptive | —Unverified | 0 |
| Evolvable Conditional Diffusion | Jun 16, 2025 | DenoisingDescriptive | —Unverified | 0 |
| A Semantically-Aware Relevance Measure for Content-Based Medical Image Retrieval Evaluation | Jun 16, 2025 | Content-Based Image RetrievalDescriptive | —Unverified | 0 |
| Rethinking Optimization: A Systems-Based Approach to Social Externalities | Jun 15, 2025 | Descriptive | —Unverified | 0 |
| Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables | Jun 13, 2025 | BenchmarkingDescriptive | —Unverified | 0 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| Alice and the Caterpillar: A more descriptive null model for assessing data mining results | Jun 11, 2025 | Descriptive | CodeCode Available | 0 |
| ARGUS: Hallucination and Omission Evaluation in Video-LLMs | Jun 9, 2025 | DescriptiveForm | —Unverified | 0 |
| ArchiLense: A Framework for Quantitative Analysis of Architectural Styles Based on Vision Large Language Models | Jun 9, 2025 | Descriptive | —Unverified | 0 |
| The Influence of Tourist Experience on Revisit Decisions with the Mediation of Tourist Satisfaction | Jun 6, 2025 | DescriptiveMarketing | —Unverified | 0 |
| PRJ: Perception-Retrieval-Judgement for Generated Images | Jun 4, 2025 | DescriptiveRetrieval | —Unverified | 0 |
| Protein folding classes -- High-dimensional geometry of amino acid composition space revisited | Jun 2, 2025 | DescriptiveProtein Folding | —Unverified | 0 |
| Effect of Insecurity on Agricultural Output in Benue State, Nigeria | Jun 2, 2025 | Descriptive | —Unverified | 0 |
| NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization | May 30, 2025 | DescriptiveForm | —Unverified | 0 |
| LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization | May 29, 2025 | DescriptiveVector Graphics | —Unverified | 0 |
| Comparative analysis of privacy-preserving open-source LLMs regarding extraction of diagnostic information from clinical CMR imaging reports | May 29, 2025 | DescriptiveDiagnostic | —Unverified | 0 |
| NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID | May 26, 2025 | AttributeCaption Generation | —Unverified | 0 |
| BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks | May 24, 2025 | Activity RecognitionDescriptive | —Unverified | 0 |
| Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition | May 23, 2025 | DescriptiveEmotion Recognition | CodeCode Available | 0 |