SOTAVerified

Descriptive

Papers

Showing 301350 of 1477 papers

TitleStatusHype
Large-scale Multi-granular Concept Extraction Based on Machine Reading ComprehensionCode0
Language-Driven Interactive Shadow DetectionCode0
KCluster: An LLM-based Clustering Approach to Knowledge Component DiscoveryCode0
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language ContextsCode0
Journalistic Guidelines Aware News Image CaptioningCode0
Audio Large Language Models Can Be Descriptive Speech Quality EvaluatorsCode0
A Multimodal PDE Foundation Model for Prediction and Scientific Text DescriptionsCode0
A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT SegmentationCode0
Attribute-based Visual Reprogramming for Image Classification with CLIPCode0
Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor SystemCode0
Investigating Annotator Bias in Large Language Models for Hate Speech DetectionCode0
Less Descriptive yet Discriminative: Quantifying the Properties of Multimodal Referring Utterances via CLIPCode0
Inferencing Based on Unsupervised Learning of Disentangled RepresentationsCode0
Incorporating Figure Captions and Descriptive Text in MeSH Term IndexingCode0
Attend to You: Personalized Image Captioning with Context Sequence Memory NetworksCode0
Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion RecognitionCode0
Improving LSTM-based Video Description with Linguistic Knowledge Mined from TextCode0
In-Contextual Gender Bias Suppression for Large Language ModelsCode0
Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video CaptioningCode0
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part ICode0
Contextualized Scene Imagination for Generative Commonsense ReasoningCode0
Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for MarketingCode0
Improve Machine Learning carbon footprint using Parquet dataset format and Mixed Precision training for regression models -- Part IICode0
Integrating Statistical Significance and Discriminative Power in Pattern DiscoveryCode0
Hierarchical Context-aware Network for Dense Video Event CaptioningCode0
Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language ModelsCode0
HICEScore: A Hierarchical Metric for Image Captioning EvaluationCode0
A Machine Learning Approach to Classifying Construction Cost Documents into the International Construction Measurement StandardCode0
GRIF-DM: Generation of Rich Impression Fonts using Diffusion ModelsCode0
Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic ScenariosCode0
How Contentious Terms About People and Cultures are Used in Linked Open DataCode0
Conditional Image Generation with PixelCNN DecodersCode0
Graph Representation Learning for Road Type ClassificationCode0
Hierarchical Deep Multi-modal Network for Medical Visual Question AnsweringCode0
Graphite: GRAPH-Induced feaTure Extraction for Point Cloud RegistrationCode0
Glocal Explanations of Expected Goal Models in SoccerCode0
Good News, Everyone! Context driven entity-aware captioning for news imagesCode0
Greedy Search for Descriptive Spatial Face FeaturesCode0
H-RANSAC, an algorithmic variant for Homography image transform from featureless point sets: application to video-based football analyticsCode0
Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based ChatbotsCode0
IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-trainingCode0
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-ThoughtCode0
Fusing Interpretable Knowledge of Neural Network Learning Agents For Swarm-GuidanceCode0
Assay2Mol: large language model-based drug design using BioAssay contextCode0
A Semi-Synthetic Dataset Generation Framework for Causal Inference in Recommender SystemsCode0
Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person SearchCode0
From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive GrammarsCode0
Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLMCode0
Incremental Residual Concept Bottleneck ModelsCode0
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
Show:102550
← PrevPage 7 of 30Next →

No leaderboard results yet.