SOTAVerified

Descriptive

Papers

Showing 151200 of 1477 papers

TitleStatusHype
Comprehensive Information Integration Modeling Framework for Video TitlingCode1
ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of PneumothoraxCode1
Hybrid Symbolic-Numeric Library for Power System Modeling and AnalysisCode1
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive PropertiesCode1
IDAS: Intent Discovery with Abstractive SummarizationCode1
Conditional Generative Adversarial NetsCode1
A Sketch-Based Neural Model for Generating Commit Messages from DiffsCode1
Confidence-aware Pseudo-label Learning for Weakly Supervised Visual GroundingCode1
Enhancing Monocular 3D Scene Completion with Diffusion ModelCode1
Contrastive Learning and Mixture of Experts Enables Precise Vector EmbeddingsCode1
Contrastive Learning of Medical Visual Representations from Paired Images and TextCode1
Controlling Latent Diffusion Using Latent CLIPCode1
MORE: Multi-Order RElation Mining for Dense Captioning in 3D ScenesCode1
Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large ModelsCode1
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsCode1
MultiFace: A Generic Training Mechanism for Boosting Face Recognition PerformanceCode1
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
High-Fidelity 3D Face Generation from Natural Language DescriptionsCode1
Can Knowledge Graphs Simplify Text?Code1
Dataset Distillation via Vision-Language Category PrototypeCode1
Deep learning based geometric registration for medical images: How accurate can we get without visual features?Code1
Navigating Knowledge Management Implementation Success in Government Organizations: A type-2 fuzzy approachCode1
Deep Graph Matching under Quadratic ConstraintCode1
Deep Implicit Statistical Shape Models for 3D Medical Image DelineationCode1
Can Machines Learn Morality? The Delphi ExperimentCode1
Dual-Level Collaborative Transformer for Image CaptioningCode1
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language GenerationCode1
On the descriptive power of LiDAR intensity images for segment-based loop closing in 3-D SLAMCode1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
Describe Anything Model for Visual Question Answering on Text-rich ImagesCode1
A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D FacesCode1
HDCC: A Hyperdimensional Computing compiler for classification on embedded systems and high-performance computingCode1
DFR: Deep Feature Reconstruction for Unsupervised Anomaly SegmentationCode1
Human-like Controllable Image Captioning with Verb-specific Semantic RolesCode1
IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic RepresentationsCode1
Distilling BlackBox to Interpretable models for Efficient Transfer LearningCode1
GraphLIME: Local Interpretable Model Explanations for Graph Neural NetworksCode1
DOBF: A Deobfuscation Pre-Training Objective for Programming LanguagesCode1
Bias Loss for Mobile Neural NetworksCode1
GraphXAIN: Narratives to Explain Graph Neural NetworksCode1
GOAL: Global-local Object Alignment LearningCode1
Beyond Co-occurrence: Multi-modal Session-based RecommendationCode1
Graph BackdoorCode1
Zero-Shot Compositional Policy Learning via Language GroundingCode1
A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning ProcessesCode1
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as PromptsCode1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music RetrievalCode1
Causal Modeling of Twitter Activity During COVID-19Code1
SdAE: Self-distillated Masked AutoencoderCode1
Automatic Generation of Topic LabelsCode1
Show:102550
← PrevPage 4 of 30Next →

No leaderboard results yet.