| SS-GEN: A Social Story Generation Framework with Large Language Models | Jun 22, 2024 | DescriptiveStory Generation | CodeCode Available | 0 |
| Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics | Jun 20, 2024 | 8kDescriptive | —Unverified | 0 |
| From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment | Jun 20, 2024 | DescriptiveHallucination | —Unverified | 0 |
| Learning telic-controllable state representations | Jun 20, 2024 | DescriptiveRepresentation Learning | —Unverified | 0 |
| Mining United Nations General Assembly Debates | Jun 19, 2024 | DescriptiveSentiment Analysis | CodeCode Available | 0 |
| Automatically Generating Narrative-Style Radiology Reports from Volumetric CT Images; a Proof of Concept | Jun 18, 2024 | Descriptive | CodeCode Available | 0 |
| Navigating Knowledge Management Implementation Success in Government Organizations: A type-2 fuzzy approach | Jun 18, 2024 | DescriptiveManagement | CodeCode Available | 1 |
| MedCalc-Bench: Evaluating Large Language Models for Medical Calculations | Jun 17, 2024 | DescriptiveMedical Diagnosis | CodeCode Available | 2 |
| Investigating Annotator Bias in Large Language Models for Hate Speech Detection | Jun 17, 2024 | DescriptiveHate Speech Detection | CodeCode Available | 0 |
| Selecting Interpretability Techniques for Healthcare Machine Learning models | Jun 14, 2024 | DescriptiveInterpretable Machine Learning | —Unverified | 0 |
| Neural Concept Binder | Jun 14, 2024 | DescriptiveRetrieval | CodeCode Available | 1 |
| LaMOT: Language-Guided Multi-Object Tracking | Jun 12, 2024 | DescriptiveMulti-Object Tracking | CodeCode Available | 1 |
| Research Trends for the Interplay between Large Language Models and Knowledge Graphs | Jun 12, 2024 | DescriptiveKnowledge Graphs | —Unverified | 0 |
| From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition | Jun 12, 2024 | DescriptiveVisual Social Relationship Recognition | —Unverified | 0 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 |
| A Fine-tuning Dataset and Benchmark for Large Language Models for Protein Understanding | Jun 8, 2024 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| Evaluating and Mitigating IP Infringement in Visual Generative AI | Jun 7, 2024 | Descriptive | CodeCode Available | 0 |
| Multiple-input, multiple-output modal testing of a Hawk T1A aircraft: A new full-scale dataset for structural health monitoring | Jun 7, 2024 | DescriptiveStructural Health Monitoring | —Unverified | 0 |
| Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data | May 31, 2024 | DenoisingDescriptive | —Unverified | 0 |
| What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights | May 31, 2024 | DescriptiveSelf-Supervised Learning | CodeCode Available | 1 |
| Soft Partitioning of Latent Space for Semantic Channel Equalization | May 30, 2024 | DecoderDescriptive | —Unverified | 0 |
| VAAD: Visual Attention Analysis Dashboard applied to e-Learning | May 30, 2024 | Descriptive | —Unverified | 0 |
| A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation | May 29, 2024 | Autonomous DrivingBoundary Detection | CodeCode Available | 1 |
| Descriptive Image Quality Assessment in the Wild | May 29, 2024 | DescriptiveImage Quality Assessment | CodeCode Available | 3 |
| LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence | May 27, 2024 | Decision MakingDescriptive | —Unverified | 0 |
| User-Friendly Customized Generation with Multi-Modal Prompts | May 26, 2024 | DescriptiveImage Generation | CodeCode Available | 1 |
| Benchmarking Hierarchical Image Pyramid Transformer for the classification of colon biopsies and polyps in histopathology images | May 24, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 |
| Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports | May 23, 2024 | Clinical KnowledgeDescriptive | —Unverified | 0 |
| Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation | May 22, 2024 | Descriptive | —Unverified | 0 |
| Peripheral Nervous System Responses to Food Stimuli: Analysis Using Data Science Approaches | May 21, 2024 | DescriptiveSubgroup Discovery | —Unverified | 0 |
| Could a Computer Architect Understand our Brain? | May 21, 2024 | DescriptiveERP | —Unverified | 0 |
| Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence | May 17, 2024 | Descriptive | —Unverified | 0 |
| A Deep Learning Approach to Heterogeneous Consumer Aesthetics in Retail Fashion | May 17, 2024 | Deep LearningDescriptive | —Unverified | 0 |
| Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots | May 13, 2024 | Code GenerationDescriptive | —Unverified | 0 |
| Analysis and prevention of AI-based phishing email attacks | May 8, 2024 | Descriptive | —Unverified | 0 |
| Remote Diffusion | May 7, 2024 | DescriptiveRAG | —Unverified | 0 |
| Time Series Stock Price Forecasting Based on Genetic Algorithm (GA)-Long Short-Term Memory Network (LSTM) Optimization | May 6, 2024 | DescriptiveStock Price Prediction | —Unverified | 0 |
| Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models | May 5, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences | May 5, 2024 | Descriptive | —Unverified | 0 |
| FITA: Fine-grained Image-Text Aligner for Radiology Report Generation | May 2, 2024 | DescriptiveTriplet | —Unverified | 0 |
| CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions | May 1, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Apr 30, 2024 | DescriptiveGesture Generation | —Unverified | 0 |
| Análise de ambiguidade linguística em modelos de linguagem de grande escala (LLMs) | Apr 25, 2024 | Descriptive | —Unverified | 0 |
| Aligning LLM Agents by Learning Latent Preference from User Edits | Apr 23, 2024 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part II -- A Data Science Perspective | Apr 22, 2024 | AnatomyDescriptive | —Unverified | 0 |
| Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images | Apr 21, 2024 | Descriptive | —Unverified | 0 |
| ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis | Apr 15, 2024 | DescriptiveImage Captioning | CodeCode Available | 0 |
| Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation | Apr 15, 2024 | Contrastive LearningDescriptive | CodeCode Available | 3 |
| TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning | Apr 14, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 2 |