| Metric Learning with Progressive Self-Distillation for Audio-Visual Embedding Learning | Jan 16, 2025 | Metric LearningRepresentation Learning | —Unverified | 0 |
| FARE: A Deep Learning-Based Framework for Radar-based Face Recognition and Out-of-distribution Detection | Jan 14, 2025 | ClassificationFace Recognition | —Unverified | 0 |
| ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training | Jan 13, 2025 | Anomaly DetectionKnowledge Graphs | CodeCode Available | 0 |
| SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval | Jan 12, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation | Jan 10, 2025 | DecoderGraph Generation | —Unverified | 0 |
| Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work | Jan 7, 2025 | Metric Learningobject-detection | —Unverified | 0 |
| Textualize Visual Prompt for Image Editing via Diffusion Bridge | Jan 7, 2025 | Triplet | —Unverified | 0 |
| Siamese Networks for Cat Re-Identification: Exploring Neural Models for Cat Instance Recognition | Jan 3, 2025 | Image AugmentationTriplet | CodeCode Available | 0 |
| Learning with Noisy Triplet Correspondence for Composed Image Retrieval | Jan 1, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation | Jan 1, 2025 | Graph GenerationRelation | —Unverified | 0 |
| Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems | Dec 28, 2024 | Anomaly DetectionBinary Classification | —Unverified | 0 |
| Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification | Dec 28, 2024 | Few-Shot Image ClassificationFew-Shot Learning | —Unverified | 0 |
| Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation | Dec 26, 2024 | Graph GenerationLarge Language Model | —Unverified | 0 |
| SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation | Dec 20, 2024 | Auxiliary LearningImage Segmentation | CodeCode Available | 0 |
| T^3-S2S: Training-free Triplet Tuning for Sketch to Scene Generation | Dec 18, 2024 | Scene GenerationTriplet | CodeCode Available | 0 |
| Suppressing Uncertainty in Gaze Estimation | Dec 17, 2024 | Gaze EstimationPseudo Label | —Unverified | 0 |
| A Contextualized BERT model for Knowledge Graph Completion | Dec 15, 2024 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Dec 15, 2024 | 3D ReconstructionAttribute | —Unverified | 0 |
| Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images | Dec 12, 2024 | 3D Point Cloud Reconstruction3D Reconstruction | —Unverified | 0 |
| Position-aware Guided Point Cloud Completion with CLIP Model | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures | Dec 7, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| Compositional Image Retrieval via Instruction-Aware Contrastive Learning | Dec 7, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 0 |
| Distance-Adaptive Quaternion Knowledge Graph Embedding with Bidirectional Rotation | Dec 5, 2024 | Graph EmbeddingKnowledge Graph Completion | CodeCode Available | 0 |
| End-to-end Triple-domain PET Enhancement: A Hybrid Denoising-and-reconstruction Framework for Reconstructing Standard-dose PET Images from Low-dose PET Sinograms | Dec 4, 2024 | DenoisingTriplet | —Unverified | 0 |
| CLERF: Contrastive LEaRning for Full Range Head Pose Estimation | Dec 3, 2024 | Contrastive LearningHead Pose Estimation | —Unverified | 0 |
| Learning Smooth Distance Functions via Queries | Dec 2, 2024 | Triplet | —Unverified | 0 |
| Node Importance Estimation Leveraging LLMs for Semantic Augmentation in Knowledge Graphs | Nov 30, 2024 | Knowledge GraphsTriplet | CodeCode Available | 0 |
| Train Once for All: A Transitional Approach for Efficient Aspect Sentiment Triplet Extraction | Nov 29, 2024 | AllAspect Sentiment Triplet Extraction | CodeCode Available | 0 |
| Cross-Spectral Attention for Unsupervised RGB-IR Face Verification and Person Re-identification | Nov 28, 2024 | Face VerificationPerson Re-Identification | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Nov 22, 2024 | Image RetrievalReranking | —Unverified | 0 |
| Leveraging large language models for efficient representation learning for entity resolution | Nov 15, 2024 | BlockingContrastive Learning | —Unverified | 0 |
| Marker-free Human Gait Analysis using a Smart Edge Sensor System | Nov 14, 2024 | Triplet | —Unverified | 0 |
| Energy Score-based Pseudo-Label Filtering and Adaptive Loss for Imbalanced Semi-supervised SAR target recognition | Nov 6, 2024 | Pseudo LabelPseudo Label Filtering | —Unverified | 0 |
| Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning | Nov 5, 2024 | Document-level Relation ExtractionFew-Shot Learning | —Unverified | 0 |
| TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition | Nov 4, 2024 | Boundary Detectionnamed-entity-recognition | CodeCode Available | 0 |
| Deep Learning for Leopard Individual Identification: An Adaptive Angular Margin Approach | Nov 4, 2024 | Deep LearningEdge Detection | CodeCode Available | 0 |
| Confidence Aware Learning for Reliable Face Anti-spoofing | Nov 2, 2024 | Face Anti-SpoofingPrediction | —Unverified | 0 |
| MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval | Oct 31, 2024 | Image RetrievalPrompt Learning | —Unverified | 0 |
| Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models | Oct 30, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| Prototypical Extreme Multi-label Classification with a Dynamic Margin Loss | Oct 27, 2024 | Contrastive LearningExtreme Multi-Label Classification | —Unverified | 0 |
| GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting | Oct 22, 2024 | Keyword SpottingTriplet | —Unverified | 0 |
| Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval | Oct 22, 2024 | AttributeDenoising | CodeCode Available | 0 |
| Diversified and Adaptive Negative Sampling on Knowledge Graphs | Oct 10, 2024 | Graph EmbeddingInformativeness | —Unverified | 0 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model | Oct 3, 2024 | image-classificationImage Classification | —Unverified | 0 |
| NL-Eye: Abductive NLI for Images | Oct 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Oct 2, 2024 | AttributeImage Retrieval | CodeCode Available | 0 |
| ENTP: Encoder-only Next Token Prediction | Oct 2, 2024 | DecoderIn-Context Learning | —Unverified | 0 |
| Intelligent Repetition Counting for Unseen Exercises: A Few-Shot Learning Approach with Sensor Signals | Oct 1, 2024 | Few-Shot LearningTriplet | —Unverified | 0 |