Doc2Query--: When Less is More Jan 9, 2023 Hallucination Retrieval
Code Code Available 1Why do Nearest Neighbor Language Models Work? Jan 7, 2023 Retrieval
Code Code Available 1You Truly Understand What I Need: Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona Jan 6, 2023 Hallucination Language Modeling
Code Code Available 1Learning Semantic Relationship Among Instances for Image-Text Matching Jan 1, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis Jan 1, 2023 Articles Document Layout Analysis
Code Code Available 1Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval Jan 1, 2023 Diversity Object
Code Code Available 1Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval Jan 1, 2023 Knowledge Distillation Language Modelling
Code Code Available 1LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval Jan 1, 2023 image-classification Image Classification
Code Code Available 1Divide&Classify: Fine-Grained Classification for City-Wide Visual Geo-Localization Jan 1, 2023 geo-localization Image Retrieval
Code Code Available 1Unsupervised Feature Representation Learning for Domain-generalized Cross-domain Image Retrieval Jan 1, 2023 Contrastive Learning Image Retrieval
Code Code Available 1Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network Jan 1, 2023 Image-text matching Retrieval
Code Code Available 1R2Former: Unified Retrieval and Reranking Transformer for Place Recognition Jan 1, 2023 Feature Correlation Reranking
Code Code Available 1Towards Modality-Agnostic Person Re-Identification With Descriptive Query Jan 1, 2023 Descriptive Person Re-Identification
Code Code Available 1RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval Jan 1, 2023 Cross-Modal Retrieval Learning with noisy labels
Code Code Available 1Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning Jan 1, 2023 Active Learning Moment Retrieval
Code Code Available 1Modeling Video As Stochastic Processes for Fine-Grained Video Representation Learning Jan 1, 2023 Contrastive Learning Representation Learning
Code Code Available 1Revisiting Self-Similarity: Structural Embedding for Image Retrieval Jan 1, 2023 Image Retrieval Retrieval
Code Code Available 1Rethinking with Retrieval: Faithful Large Language Model Inference Dec 31, 2022 Language Modeling Language Modelling
Code Code Available 1HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images Dec 30, 2022 Image Retrieval Retrieval
Code Code Available 1TempCLR: Temporal Alignment Representation with Contrastive Learning Dec 28, 2022 Action Recognition Contrastive Learning
Code Code Available 1MVTN: Learning Multi-View Transformations for 3D Understanding Dec 27, 2022 3D Classification 3D Shape Classification
Code Code Available 1Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning Dec 27, 2022 Image Captioning Image Retrieval
Code Code Available 1Multi-queue Momentum Contrast for Microvideo-Product Retrieval Dec 22, 2022 Representation Learning Retrieval
Code Code Available 1Multi-hop Evidence Retrieval for Cross-document Relation Extraction Dec 21, 2022 Relation Relation Extraction
Code Code Available 1Parallel Context Windows for Large Language Models Dec 21, 2022 In-Context Learning Playing the Game of 2048
Code Code Available 1Data Curation Alone Can Stabilize In-context Learning Dec 20, 2022 Diversity In-Context Learning
Code Code Available 1When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories Dec 20, 2022 Knowledge Probing Memorization
Code Code Available 1SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes Dec 19, 2022 Dialogue Generation Machine Translation
Code Code Available 1Query-as-context Pre-training for Dense Passage Retrieval Dec 19, 2022 Contrastive Learning Passage Retrieval
Code Code Available 1Position-guided Text Prompt for Vision-Language Pre-training Dec 19, 2022 Cross-Modal Retrieval Image Captioning
Code Code Available 1Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model Dec 18, 2022 Language Modeling Language Modelling
Code Code Available 1Attentive Mask CLIP Dec 16, 2022 Contrastive Learning Retrieval
Code Code Available 1Self-Prompting Large Language Models for Zero-Shot Open-Domain QA Dec 16, 2022 In-Context Learning Open-Domain Question Answering
Code Code Available 1Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation Dec 16, 2022 Answer Generation Decoder
Code Code Available 1MAViL: Masked Audio-Video Learners Dec 15, 2022 Contrastive Learning Retrieval
Code Code Available 1Unsupervised Object Localization: Observing the Background to Discover Objects Dec 15, 2022 Instance Segmentation Object
Code Code Available 1FlexiViT: One Model for All Patch Sizes Dec 15, 2022 All Image-text Retrieval
Code Code Available 1Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift Dec 15, 2022 Benchmarking Image Captioning
Code Code Available 1EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries Dec 14, 2022 3D Reconstruction Object
Code Code Available 1Reproducible scaling laws for contrastive language-image learning Dec 14, 2022 Image Classification Open Vocabulary Attribute Detection
Code Code Available 1LidarCLIP or: How I Learned to Talk to Point Clouds Dec 13, 2022 Image Generation Retrieval
Code Code Available 1CREPE: Can Vision-Language Foundation Models Reason Compositionally? Dec 13, 2022 Image Retrieval Negation
Code Code Available 1In Defense of Cross-Encoders for Zero-Shot Retrieval Dec 12, 2022 Retrieval
Code Code Available 1VindLU: A Recipe for Effective Video-and-Language Pretraining Dec 9, 2022 Question Answering Retrieval
Code Code Available 1Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval Dec 8, 2022 Cross-Modal Retrieval Food Recognition
Code Code Available 1DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset Dec 8, 2022 Diversity Image Description
Code Code Available 1FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation Dec 7, 2022 Motion Synthesis Retrieval
Code Code Available 1A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval Dec 6, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 1Neural Machine Translation with Contrastive Translation Memories Dec 6, 2022 Contrastive Learning Machine Translation
Code Code Available 1Hierarchical Contrast for Unsupervised Skeleton-based Action Representation Learning Dec 5, 2022 Action Recognition Few-Shot Skeleton-Based Action Recognition
Code Code Available 1