SOTAVerified

Descriptive

Papers

Showing 151200 of 1477 papers

TitleStatusHype
Can Machines Learn Morality? The Delphi ExperimentCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM)Code1
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive PropertiesCode1
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsCode1
Field Convolutions for Surface CNNsCode1
Controlling Latent Diffusion Using Latent CLIPCode1
Finetune like you pretrain: Improved finetuning of zero-shot vision modelsCode1
CTRLsum: Towards Generic Controllable Text SummarizationCode1
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font ApplicationsCode1
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringCode1
Contrastive Learning and Mixture of Experts Enables Precise Vector EmbeddingsCode1
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space SearchCode1
Generating Parametric BRDFs from Natural Language DescriptionsCode1
GOAL: Global-local Object Alignment LearningCode1
Graph BackdoorCode1
Contrastive Audio-Language Learning for MusicCode1
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language NavigationCode1
Contrastive Learning of Medical Visual Representations from Paired Images and TextCode1
HDCC: A Hyperdimensional Computing compiler for classification on embedded systems and high-performance computingCode1
Human-like Controllable Image Captioning with Verb-specific Semantic RolesCode1
Hybrid Symbolic-Numeric Library for Power System Modeling and AnalysisCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human AnnotationCode1
Conditional Generative Adversarial NetsCode1
Comprehensive Information Integration Modeling Framework for Video TitlingCode1
Confidence-aware Pseudo-label Learning for Weakly Supervised Visual GroundingCode1
LaMOT: Language-Guided Multi-Object TrackingCode1
CiteTracker: Correlating Image and Text for Visual TrackingCode1
Learning to Color from LanguageCode1
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
Logical Consistency and Greater Descriptive Power for Facial Hair Attribute LearningCode1
Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only TrainingCode1
Mixture of Low-rank Experts for Transferable AI-Generated Image DetectionCode1
MMPD: Multi-Domain Mobile Video Physiology DatasetCode1
Möbius Convolutions for Spherical CNNsCode1
Beyond Co-occurrence: Multi-modal Session-based RecommendationCode1
Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large ModelsCode1
MultiFace: A Generic Training Mechanism for Boosting Face Recognition PerformanceCode1
Multi-Grained Multimodal Interaction Network for Entity LinkingCode1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous DrivingCode1
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of PneumothoraxCode1
NineRec: A Benchmark Dataset Suite for Evaluating Transferable RecommendationCode1
NLQuAD: A Non-Factoid Long Question Answering Data SetCode1
Can Knowledge Graphs Simplify Text?Code1
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video RecognitionCode1
A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D FacesCode1
Predicting emotion from music videos: exploring the relative contribution of visual and auditory information to affective responsesCode1
Causal Modeling of Twitter Activity During COVID-19Code1
A Variational Algorithm for Quantum Neural NetworksCode1
Show:102550
← PrevPage 4 of 30Next →

No leaderboard results yet.