Attentive Mask CLIP Dec 16, 2022 Contrastive Learning Retrieval
Code Code Available 1HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval Dec 16, 2022 Image-text Retrieval Retrieval
— Unverified 0Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation Dec 16, 2022 Answer Generation Decoder
Code Code Available 1MAViL: Masked Audio-Video Learners Dec 15, 2022 Contrastive Learning Retrieval
Code Code Available 1Political and Economic Patterns in COVID-19 News: From Lockdown to Vaccination Dec 15, 2022 Articles Information Retrieval
— Unverified 0MASTER: Multi-task Pre-trained Bottlenecked Masked Autoencoders are Better Dense Retrievers Dec 15, 2022 Decoder Passage Retrieval
— Unverified 0FiDO: Fusion-in-Decoder optimized for stronger performance and faster inference Dec 15, 2022 Decoder Language Modeling
— Unverified 0Visually-augmented pretrained language models for NLP tasks without images Dec 15, 2022 Retrieval
Code Code Available 0You were saying? - Spoken Language in the V3C Dataset Dec 15, 2022 Retrieval Video Retrieval
Code Code Available 0DeepJoin: Joinable Table Discovery with Pre-trained Language Models Dec 15, 2022 Data Augmentation GPU
— Unverified 0Writer Retrieval and Writer Identification in Greek Papyri Dec 15, 2022 Binarization Retrieval
— Unverified 0CLIPPO: Image-and-Language Understanding from Pixels Only Dec 15, 2022 Contrastive Learning image-classification
— Unverified 0Retrieval-based Disentangled Representation Learning with Natural Language Supervision Dec 15, 2022 Cross-Modal Retrieval Disentanglement
— Unverified 0FlexiViT: One Model for All Patch Sizes Dec 15, 2022 All Image-text Retrieval
Code Code Available 1Unsupervised Object Localization: Observing the Background to Discover Objects Dec 15, 2022 Instance Segmentation Object
Code Code Available 1Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift Dec 15, 2022 Benchmarking Image Captioning
Code Code Available 1The Infinite Index: Information Retrieval on Generative Text-To-Image Models Dec 14, 2022 Active Learning Game Design
— Unverified 0Reproducible scaling laws for contrastive language-image learning Dec 14, 2022 Image Classification Open Vocabulary Attribute Detection
Code Code Available 1DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog Dec 14, 2022 Retrieval
— Unverified 0EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries Dec 14, 2022 3D Reconstruction Object
Code Code Available 1Pre-trained Language Models Can be Fully Zero-Shot Learners Dec 14, 2022 Retrieval text-classification
Code Code Available 0Explainability of Text Processing and Retrieval Methods: A Critical Survey Dec 14, 2022 Document Ranking Information Retrieval
— Unverified 0NLIP: Noise-robust Language-Image Pre-training Dec 14, 2022 Image Captioning Image-text Retrieval
— Unverified 0Attentive Deep Neural Networks for Legal Document Retrieval Dec 13, 2022 Articles Question Answering
— Unverified 0Learning to Detect Good Keypoints to Match Non-Rigid Objects in RGB Images Dec 13, 2022 Keypoint Detection Retrieval
Code Code Available 0CREPE: Can Vision-Language Foundation Models Reason Compositionally? Dec 13, 2022 Image Retrieval Negation
Code Code Available 1Domain Adaptation for Dense Retrieval through Self-Supervision by Pseudo-Relevance Labeling Dec 13, 2022 Domain Adaptation Information Retrieval
— Unverified 0Auto-labelling of Bug Report using Natural Language Processing Dec 13, 2022 Retrieval
— Unverified 0LidarCLIP or: How I Learned to Talk to Point Clouds Dec 13, 2022 Image Generation Retrieval
Code Code Available 1Predicting Knowledge Gain for MOOC Video Consumption Dec 13, 2022 Feature Importance Retrieval
Code Code Available 0Contextual Explainable Video Representation: Human Perception-based Understanding Dec 12, 2022 Action Detection Action Recognition
Code Code Available 0Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing Dec 12, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0In Defense of Cross-Encoders for Zero-Shot Retrieval Dec 12, 2022 Retrieval
Code Code Available 1Changes in Power and Information Flow in Resting-state EEG by Working Memory Process Dec 12, 2022 EEG Electroencephalogram (EEG)
— Unverified 0The diagnostic utility of endocytoscopy for the detection of esophageal lesions: a systematic review and meta-analysis Dec 11, 2022 Diagnostic Retrieval
— Unverified 0SEPT: Towards Scalable and Efficient Visual Pre-Training Dec 11, 2022 Retrieval
— Unverified 0Using Multiple Instance Learning to Build Multimodal Representations Dec 11, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Information retrieval in single cell chromatin analysis using TF-IDF transformation methods Dec 10, 2022 Dimensionality Reduction Information Retrieval
— Unverified 0LEAD: Liberal Feature-based Distillation for Dense Retrieval Dec 10, 2022 Document Ranking Knowledge Distillation
— Unverified 0Natural Logic-guided Autoregressive Multi-hop Document Retrieval for Fact Verification Dec 10, 2022 Fact Verification Retrieval
— Unverified 0REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory Dec 10, 2022 Image Captioning Language Modeling
Code Code Available 0A Comparison of Audio Preprocessing Techniques and Deep Learning Algorithms for Raga Recognition Dec 10, 2022 Audio Signal Processing Information Retrieval
— Unverified 0VindLU: A Recipe for Effective Video-and-Language Pretraining Dec 9, 2022 Question Answering Retrieval
Code Code Available 1VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners Dec 9, 2022 Question Answering Retrieval
— Unverified 0DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset Dec 8, 2022 Diversity Image Description
Code Code Available 1Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval Dec 8, 2022 Cross-Modal Retrieval Food Recognition
Code Code Available 1Group Generalized Mean Pooling for Vision Transformer Dec 8, 2022 Image Retrieval Representation Learning
— Unverified 0Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models Dec 7, 2022 Image Retrieval Retrieval
— Unverified 0FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation Dec 7, 2022 Motion Synthesis Retrieval
Code Code Available 1Text Embeddings by Weakly-Supervised Contrastive Pre-training Dec 7, 2022 MTEB Benchmark Only Connect Walls Dataset Task 1 (Grouping)
— Unverified 0