SOTAVerified

Descriptive

Papers

Showing 326350 of 1477 papers

TitleStatusHype
User-Friendly Customized Generation with Multi-Modal PromptsCode1
Benchmarking Hierarchical Image Pyramid Transformer for the classification of colon biopsies and polyps in histopathology images0
Composed Image Retrieval for Remote SensingCode2
Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports0
Accelerated Evaluation of Ollivier-Ricci Curvature Lower Bounds: Bridging Theory and Computation0
Peripheral Nervous System Responses to Food Stimuli: Analysis Using Data Science Approaches0
Could a Computer Architect Understand our Brain?0
Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence0
A Deep Learning Approach to Heterogeneous Consumer Aesthetics in Retail Fashion0
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots0
Analysis and prevention of AI-based phishing email attacks0
Remote Diffusion0
Time Series Stock Price Forecasting Based on Genetic Algorithm (GA)-Long Short-Term Memory Network (LSTM) Optimization0
Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large ModelsCode1
SkelCap: Automated Generation of Descriptive Text from Skeleton Keypoint Sequences0
FITA: Fine-grained Image-Text Aligner for Radiology Report Generation0
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions0
Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model0
Análise de ambiguidade linguística em modelos de linguagem de grande escala (LLMs)0
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
A Survey of Decomposition-Based Evolutionary Multi-Objective Optimization: Part II -- A Data Science Perspective0
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images0
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image SynthesisCode0
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity RepresentationCode3
TrafficVLM: A Controllable Visual Language Model for Traffic Video CaptioningCode2
Show:102550
← PrevPage 14 of 60Next →

No leaderboard results yet.