| ReGraP-LLaVA: Reasoning enabled Graph-based Personalized Large Language and Vision Assistant | May 6, 2025 | DescriptiveMultiple-choice | CodeCode Available | 0 |
| Dual-Forecaster: A Multimodal Time Series Model Integrating Descriptive and Predictive Texts | May 2, 2025 | DescriptiveTime Series | —Unverified | 0 |
| A flexible Bayesian non-parametric mixture model reveals multiple dependencies of swap errors in visual working memory | May 2, 2025 | Descriptive | —Unverified | 0 |
| A Unifying Framework for Robust and Efficient Inference with Unstructured Data | May 1, 2025 | Causal InferenceDescriptive | —Unverified | 0 |
| Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors | Apr 30, 2025 | DescriptiveObject | —Unverified | 0 |
| Enhancing Health Mention Classification Performance: A Study on Advancements in Parameter Efficient Tuning | Apr 30, 2025 | DescriptivePOS | —Unverified | 0 |
| RuleKit 2: Faster and simpler rule learning | Apr 29, 2025 | Descriptive | CodeCode Available | 2 |
| Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization | Apr 24, 2025 | DescriptiveSegmentation | —Unverified | 0 |
| Demand for LLMs: Descriptive Evidence on Substitution, Market Expansion, and Multihoming | Apr 21, 2025 | Descriptive | —Unverified | 0 |
| The Influence of Establishing Belt and Road Node Cities on the Development of Digital Inclusive Finance Across Chinese Provinces | Apr 20, 2025 | Descriptive | —Unverified | 0 |
| NVSMask3D: Hard Visual Prompting with Camera Pose Interpolation for 3D Open Vocabulary Instance Segmentation | Apr 20, 2025 | 3D Instance Segmentation3D Open-Vocabulary Instance Segmentation | —Unverified | 0 |
| FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models | Apr 20, 2025 | DescriptiveEthics | —Unverified | 0 |
| Visualization Tasks for Unlabelled Graphs | Apr 19, 2025 | Descriptive | —Unverified | 0 |
| Contextualizing Spotify's Audiobook List Recommendations with Descriptive Shelves | Apr 18, 2025 | Descriptive | —Unverified | 0 |
| Position Uncertainty in a Prisoner's Dilemma Game : An Experiment | Apr 14, 2025 | DescriptivePosition | —Unverified | 0 |
| Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models | Apr 14, 2025 | BenchmarkingDescriptive | —Unverified | 0 |
| Integrating Large Language Models for Automated Structural Analysis | Apr 13, 2025 | DescriptiveIn-Context Learning | —Unverified | 0 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 |
| Big Meaning: Qualitative Analysis on Large Bodies of Data Using AI | Apr 11, 2025 | ArticlesDescriptive | —Unverified | 0 |
| JEPA4Rec: Learning Effective Language Representations for Sequential Recommendation via Joint Embedding Predictive Architecture | Apr 10, 2025 | Common Sense ReasoningDescriptive | —Unverified | 0 |
| Overcoming the Identity Mapping Problem in Self-Supervised Hyperspectral Anomaly Detection | Apr 5, 2025 | Anomaly DetectionDescriptive | CodeCode Available | 0 |
| Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities | Apr 2, 2025 | DescriptiveLarge Language Model | CodeCode Available | 0 |
| UAKNN: Label Distribution Learning via Uncertainty-Aware KNN | Apr 2, 2025 | Descriptive | —Unverified | 0 |
| MolGround: A Benchmark for Molecular Grounding | Mar 31, 2025 | Descriptive | —Unverified | 0 |
| Enhancing Learnable Descriptive Convolutional Vision Transformer for Face Anti-Spoofing | Mar 29, 2025 | DescriptiveDomain Generalization | CodeCode Available | 0 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Mar 28, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 |
| Concept Map Assessment Through Structure Classification | Mar 26, 2025 | ClassificationDescriptive | —Unverified | 0 |
| Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations | Mar 26, 2025 | DescriptiveText-to-Video Generation | CodeCode Available | 0 |
| BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation | Mar 26, 2025 | DescriptiveImage Generation | —Unverified | 0 |
| Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology | Mar 26, 2025 | DescriptiveLarge Language Model | —Unverified | 0 |
| MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion | Mar 26, 2025 | DescriptiveRetrieval | —Unverified | 0 |
| Why Representation Engineering Works: A Theoretical and Empirical Study in Vision-Language Models | Mar 25, 2025 | DescriptiveFairness | —Unverified | 0 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 |
| Clearing Sections of Lattice Liability Networks | Mar 22, 2025 | Descriptive | —Unverified | 0 |
| Income Inequality, Food Aid, and 'Zero Hunger': Evaluating Effectiveness During Lula's Administration | Mar 20, 2025 | Descriptive | —Unverified | 0 |
| LLM-Aided Customizable Profiling of Code Data Based On Programming Language Concepts | Mar 19, 2025 | Code GenerationData Valuation | —Unverified | 0 |
| A Language Vision Model Approach for Automated Tumor Contouring in Radiation Oncology | Mar 19, 2025 | Descriptive | —Unverified | 0 |
| Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and Tracking | Mar 18, 2025 | DescriptiveInstance Segmentation | CodeCode Available | 0 |
| Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering | Mar 18, 2025 | BenchmarkingDescriptive | —Unverified | 0 |
| Longitudinal Impact of Tobacco Use and Social Determinants on Respiratory Health Disparities Among Louisiana Medicaid Enrollees | Mar 15, 2025 | Descriptive | —Unverified | 0 |
| The Status Quo and Future of AI-TPACK for Mathematics Teacher Education Students: A Case Study in Chinese Universities | Mar 15, 2025 | Descriptive | —Unverified | 0 |
| Visual Polarization Measurement Using Counterfactual Image Generation | Mar 13, 2025 | counterfactualDescriptive | —Unverified | 0 |
| MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis | Mar 13, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Semantic Latent Motion for Portrait Video Generation | Mar 13, 2025 | DescriptiveVideo Generation | —Unverified | 0 |
| Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion | Mar 12, 2025 | DescriptiveImage Generation | —Unverified | 0 |
| Teaching LMMs for Image Quality Scoring and Interpreting | Mar 12, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 |
| Power Spectrum Signatures of Graphs | Mar 12, 2025 | DescriptiveGraph Regression | —Unverified | 0 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 |
| Generative AI in Transportation Planning: A Survey | Mar 10, 2025 | Demand ForecastingDescriptive | —Unverified | 0 |
| Towards Understanding the Use of MLLM-Enabled Applications for Visual Interpretation by Blind and Low Vision People | Mar 7, 2025 | Descriptive | —Unverified | 0 |