SOTAVerified

Descriptive

Papers

Showing 51100 of 1477 papers

TitleStatusHype
Hybrid Symbolic-Numeric Library for Power System Modeling and AnalysisCode1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of PneumothoraxCode1
Conditional Generative Adversarial NetsCode1
Contrastive Learning and Mixture of Experts Enables Precise Vector EmbeddingsCode1
HDCC: A Hyperdimensional Computing compiler for classification on embedded systems and high-performance computingCode1
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language NavigationCode1
Confidence-aware Pseudo-label Learning for Weakly Supervised Visual GroundingCode1
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic SegmentationCode1
Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person SearchCode1
Contrastive Audio-Language Learning for MusicCode1
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language GenerationCode1
HYDRA: A multimodal deep learning framework for malware classificationCode1
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text SupervisionCode1
Graph BackdoorCode1
GraphLIME: Local Interpretable Model Explanations for Graph Neural NetworksCode1
A Bi-directional Transformer for Musical Chord RecognitionCode1
A Fine-tuning Dataset and Benchmark for Large Language Models for Protein UnderstandingCode1
GraphXAIN: Narratives to Explain Graph Neural NetworksCode1
CiteTracker: Correlating Image and Text for Visual TrackingCode1
Generating Parametric BRDFs from Natural Language DescriptionsCode1
GL-RG: Global-Local Representation Granularity for Video CaptioningCode1
Contrastive Learning of Medical Visual Representations from Paired Images and TextCode1
Causal Modeling of Twitter Activity During COVID-19Code1
A Convolutional Attention Network for Extreme Summarization of Source CodeCode1
What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable InsightsCode1
GOAL: Global-local Object Alignment LearningCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font ApplicationsCode1
Can Knowledge Graphs Simplify Text?Code1
From Artificially Real to Real: Leveraging Pseudo Data from Large Language Models for Low-Resource Molecule DiscoveryCode1
First Steps of an Approach to the ARC Challenge based on Descriptive Grid Models and the Minimum Description Length PrincipleCode1
Finetune like you pretrain: Improved finetuning of zero-shot vision modelsCode1
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-AnsweringCode1
A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning ProcessesCode1
A Variational Algorithm for Quantum Neural NetworksCode1
Zero-Shot Compositional Policy Learning via Language GroundingCode1
Automatic Generation of Topic LabelsCode1
Beyond Co-occurrence: Multi-modal Session-based RecommendationCode1
Bias Loss for Mobile Neural NetworksCode1
Natural scene reconstruction from fMRI signals using generative latent diffusionCode1
A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D FacesCode1
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language ModelsCode1
FlexConv: Continuous Kernel Convolutions with Differentiable Kernel SizesCode1
FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis AssistantCode1
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as PromptsCode1
Comprehensive Information Integration Modeling Framework for Video TitlingCode1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous DrivingCode1
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space SearchCode1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language UnderstandingCode1
Show:102550
← PrevPage 2 of 30Next →

No leaderboard results yet.