SOTAVerified

Descriptive

Papers

Showing 13011350 of 1477 papers

TitleStatusHype
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted CaptionsCode0
DYAD: A Descriptive Yet Abjuring Density efficient approximation to linear neural network layersCode0
Hierarchical Context-aware Network for Dense Video Event CaptioningCode0
HICEScore: A Hierarchical Metric for Image Captioning EvaluationCode0
Uninformed Students: Student-Teacher Anomaly Detection with Discriminative Latent EmbeddingsCode0
Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language ModelsCode0
SEKE: Specialised Experts for Keyword ExtractionCode0
Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic ScenariosCode0
Language-Driven Interactive Shadow DetectionCode0
GRIF-DM: Generation of Rich Impression Fonts using Diffusion ModelsCode0
A Hierarchical Approach for Generating Descriptive Image ParagraphsCode0
Self-attention on Multi-Shifted Windows for Scene SegmentationCode0
DuoRC: Towards Complex Language Understanding with Paraphrased Reading ComprehensionCode0
Open Digital Rights Enforcement Framework (ODRE): from descriptive to enforceable policiesCode0
Large-scale Multi-granular Concept Extraction Based on Machine Reading ComprehensionCode0
Self-optimizing Feature Generation via Categorical Hashing Representation and Hierarchical Reinforcement CrossingCode0
A Graph Theoretic Approach for Object Shape Representation in Compositional Hierarchies Using a Hybrid Generative-Descriptive ModelCode0
Self-supervised Product Quantization for Deep Unsupervised Image RetrievalCode0
Collaborative Auto-encoding for Blind Image Quality AssessmentCode0
Bounding and Approximating Intersectional Fairness through Marginal FairnessCode0
VREN: Volleyball Rally Dataset with Expression Notation LanguageCode0
Greedy Search for Descriptive Spatial Face FeaturesCode0
Dropout Concrete Autoencoder for Band Selection on HSI ScenesCode0
Overcoming the Identity Mapping Problem in Self-Supervised Hyperspectral Anomaly DetectionCode0
Learning Deep Features for One-Class ClassificationCode0
Boosting Audio-visual Zero-shot Learning with Large Language ModelsCode0
Learning Efficient Representations of Neutrino Telescope EventsCode0
Learning English with Peppa PigCode0
Overview of PicTropes, a film trope datasetCode0
A Neural Topical Expansion Framework for Unstructured Persona-oriented Dialogue GenerationCode0
Temporal and Semantic Evaluation Metrics for Foundation Models in Post-Hoc Analysis of Robotic Sub-tasksCode0
Automated Image Captioning with CNNs and TransformersCode0
Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL ChallengeCode0
Low-Rank Subspace Override for Unsupervised Domain AdaptationCode0
ANEA: Automated (Named) Entity Annotation for German Domain-Specific TextsCode0
Audio Large Language Models Can Be Descriptive Speech Quality EvaluatorsCode0
Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature AlignmentCode0
Graph Representation Learning for Road Type ClassificationCode0
Semi-supervised multimodal coreference resolution in image narrationsCode0
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMsCode0
Attribute-based Visual Reprogramming for Image Classification with CLIPCode0
Graphite: GRAPH-Induced feaTure Extraction for Point Cloud RegistrationCode0
SemStyle: Learning to Generate Stylised Image Captions using Unaligned TextCode0
Less Descriptive yet Discriminative: Quantifying the Properties of Multimodal Referring Utterances via CLIPCode0
Attend to You: Personalized Image Captioning with Context Sequence Memory NetworksCode0
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-ThoughtCode0
CoinMath: Harnessing the Power of Coding Instruction for Math LLMsCode0
Good News, Everyone! Context driven entity-aware captioning for news imagesCode0
Picture It In Your Mind: Generating High Level Visual Representations From Textual DescriptionsCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
Show:102550
← PrevPage 27 of 30Next →

No leaderboard results yet.