| MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis | Mar 13, 2025 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Semantic Latent Motion for Portrait Video Generation | Mar 13, 2025 | DescriptiveVideo Generation | —Unverified | 0 |
| Power Spectrum Signatures of Graphs | Mar 12, 2025 | DescriptiveGraph Regression | —Unverified | 0 |
| Zero-Shot Subject-Centric Generation for Creative Application Using Entropy Fusion | Mar 12, 2025 | DescriptiveImage Generation | —Unverified | 0 |
| Generative AI in Transportation Planning: A Survey | Mar 10, 2025 | Demand ForecastingDescriptive | —Unverified | 0 |
| Global graph features unveiled by unsupervised geometric deep learning | Mar 7, 2025 | Deep LearningDescriptive | —Unverified | 0 |
| Towards Understanding the Use of MLLM-Enabled Applications for Visual Interpretation by Blind and Low Vision People | Mar 7, 2025 | Descriptive | —Unverified | 0 |
| A Benchmark for Multi-Lingual Vision-Language Learning in Remote Sensing Image Captioning | Mar 6, 2025 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test | Mar 4, 2025 | Autonomous DrivingDescriptive | —Unverified | 0 |
| Assessing Large Language Models in Agentic Multilingual National Bias | Feb 25, 2025 | Decision MakingDescriptive | —Unverified | 0 |
| Software implemented fault diagnosis of natural gas pumping unit based on feedforward neural network | Feb 25, 2025 | DescriptiveDiagnostic | —Unverified | 0 |
| Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction | Feb 24, 2025 | Descriptive | —Unverified | 0 |
| CLIP-SENet: CLIP-based Semantic Enhancement Network for Vehicle Re-identification | Feb 24, 2025 | DescriptiveVehicle Re-Identification | —Unverified | 0 |
| Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics | Feb 20, 2025 | Descriptive | CodeCode Available | 0 |
| ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models | Feb 17, 2025 | Code GenerationDescriptive | CodeCode Available | 0 |
| ChordFormer: A Conformer-Based Architecture for Large-Vocabulary Audio Chord Recognition | Feb 17, 2025 | Chord RecognitionDescriptive | —Unverified | 0 |
| FE-LWS: Refined Image-Text Representations via Decoder Stacking and Fused Encodings for Remote Sensing Image Captioning | Feb 13, 2025 | Caption GenerationDecoder | —Unverified | 0 |
| PathFinder: A Multi-Modal Multi-Agent System for Medical Diagnostic Decision-Making Applied to Histopathology | Feb 13, 2025 | Decision MakingDescriptive | —Unverified | 0 |
| A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions | Feb 9, 2025 | DescriptiveMultimodal Deep Learning | CodeCode Available | 0 |
| Augmented Conditioning Is Enough For Effective Training Image Generation | Feb 6, 2025 | Conditional Image GenerationDescriptive | —Unverified | 0 |
| Combining physics-based and data-driven models: advancing the frontiers of research with Scientific Machine Learning | Jan 30, 2025 | Descriptive | —Unverified | 0 |
| Towards Recommender Systems LLMs Playground (RecSysLLMsP): Exploring Polarization and Engagement in Simulated Social Networks | Jan 29, 2025 | DescriptiveRecommendation Systems | —Unverified | 0 |
| Audio Large Language Models Can Be Descriptive Speech Quality Evaluators | Jan 27, 2025 | Descriptive | CodeCode Available | 0 |
| Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge | Jan 27, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 0 |
| Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM | Jan 27, 2025 | DescriptiveEvent Detection | CodeCode Available | 0 |