| PRJ: Perception-Retrieval-Judgement for Generated Images | Jun 4, 2025 | DescriptiveRetrieval | —Unverified | 0 |
| Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability | Jun 2, 2025 | DescriptiveSynthetic Data Generation | CodeCode Available | 1 |
| Protein folding classes -- High-dimensional geometry of amino acid composition space revisited | Jun 2, 2025 | DescriptiveProtein Folding | —Unverified | 0 |
| Effect of Insecurity on Agricultural Output in Benue State, Nigeria | Jun 2, 2025 | Descriptive | —Unverified | 0 |
| Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation | Jun 2, 2025 | 4kDescriptive | CodeCode Available | 3 |
| NexusSum: Hierarchical LLM Agents for Long-Form Narrative Summarization | May 30, 2025 | DescriptiveForm | —Unverified | 0 |
| Comparative analysis of privacy-preserving open-source LLMs regarding extraction of diagnostic information from clinical CMR imaging reports | May 29, 2025 | DescriptiveDiagnostic | —Unverified | 0 |
| VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning | May 29, 2025 | Anomaly DetectionDescriptive | CodeCode Available | 2 |
| LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization | May 29, 2025 | DescriptiveVector Graphics | —Unverified | 0 |
| NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID | May 26, 2025 | AttributeCaption Generation | —Unverified | 0 |
| BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks | May 24, 2025 | Activity RecognitionDescriptive | —Unverified | 0 |
| Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition | May 23, 2025 | DescriptiveEmotion Recognition | CodeCode Available | 0 |
| Creatively Upscaling Images with Global-Regional Priors | May 22, 2025 | DenoisingDescriptive | —Unverified | 0 |
| CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation | May 22, 2025 | AttributeDescriptive | —Unverified | 0 |
| GitHub Repository Complexity Leads to Diminished Web Archive Availability | May 21, 2025 | Descriptive | —Unverified | 0 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 |
| Multimodal RAG-driven Anomaly Detection and Classification in Laser Powder Bed Fusion using Large Language Models | May 20, 2025 | Anomaly DetectionDescriptive | —Unverified | 0 |
| Descriptive Image-Text Matching with Graded Contextual Similarity | May 15, 2025 | DescriptiveImage-text matching | —Unverified | 0 |
| The Human-Data-Model Interaction Canvas for Visual Analytics | May 12, 2025 | Descriptive | —Unverified | 0 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration | May 11, 2025 | BenchmarkingDescriptive | —Unverified | 0 |
| Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding | May 10, 2025 | DescriptiveEmotion Recognition | CodeCode Available | 1 |
| KCluster: An LLM-based Clustering Approach to Knowledge Component Discovery | May 9, 2025 | ClusteringDescriptive | CodeCode Available | 0 |
| SweRank: Software Issue Localization with Code Ranking | May 7, 2025 | Descriptive | —Unverified | 0 |
| Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model | May 7, 2025 | Data AugmentationDescriptive | —Unverified | 0 |