| Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work | Jan 7, 2025 | Metric Learningobject-detection | —Unverified | 0 |
| Textualize Visual Prompt for Image Editing via Diffusion Bridge | Jan 7, 2025 | Triplet | —Unverified | 0 |
| Siamese Networks for Cat Re-Identification: Exploring Neural Models for Cat Instance Recognition | Jan 3, 2025 | Image AugmentationTriplet | CodeCode Available | 0 |
| Hybrid Reciprocal Transformer with Triplet Feature Alignment for Scene Graph Generation | Jan 1, 2025 | Graph GenerationRelation | —Unverified | 0 |
| Learning with Noisy Triplet Correspondence for Composed Image Retrieval | Jan 1, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Cross-Modal Mapping: Mitigating the Modality Gap for Few-Shot Image Classification | Dec 28, 2024 | Few-Shot Image ClassificationFew-Shot Learning | —Unverified | 0 |
| Injecting Explainability and Lightweight Design into Weakly Supervised Video Anomaly Detection Systems | Dec 28, 2024 | Anomaly DetectionBinary Classification | —Unverified | 0 |
| Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation | Dec 26, 2024 | Graph GenerationLarge Language Model | —Unverified | 0 |
| SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation | Dec 20, 2024 | Auxiliary LearningImage Segmentation | CodeCode Available | 0 |
| T^3-S2S: Training-free Triplet Tuning for Sketch to Scene Generation | Dec 18, 2024 | Scene GenerationTriplet | CodeCode Available | 0 |
| Suppressing Uncertainty in Gaze Estimation | Dec 17, 2024 | Gaze EstimationPseudo Label | —Unverified | 0 |
| Relation-Guided Adversarial Learning for Data-free Knowledge Transfer | Dec 16, 2024 | Data-free Knowledge DistillationData Free Quantization | CodeCode Available | 1 |
| A Contextualized BERT model for Knowledge Graph Completion | Dec 15, 2024 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping | Dec 15, 2024 | 3D ReconstructionAttribute | —Unverified | 0 |
| Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images | Dec 12, 2024 | 3D Point Cloud Reconstruction3D Reconstruction | —Unverified | 0 |
| Position-aware Guided Point Cloud Completion with CLIP Model | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Compositional Image Retrieval via Instruction-Aware Contrastive Learning | Dec 7, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 0 |
| Securing Social Media Against Deepfakes using Identity, Behavioral, and Geometric Signatures | Dec 7, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| Distance-Adaptive Quaternion Knowledge Graph Embedding with Bidirectional Rotation | Dec 5, 2024 | Graph EmbeddingKnowledge Graph Completion | CodeCode Available | 0 |
| End-to-end Triple-domain PET Enhancement: A Hybrid Denoising-and-reconstruction Framework for Reconstructing Standard-dose PET Images from Low-dose PET Sinograms | Dec 4, 2024 | DenoisingTriplet | —Unverified | 0 |
| CLERF: Contrastive LEaRning for Full Range Head Pose Estimation | Dec 3, 2024 | Contrastive LearningHead Pose Estimation | —Unverified | 0 |
| Learning Smooth Distance Functions via Queries | Dec 2, 2024 | Triplet | —Unverified | 0 |
| Node Importance Estimation Leveraging LLMs for Semantic Augmentation in Knowledge Graphs | Nov 30, 2024 | Knowledge GraphsTriplet | CodeCode Available | 0 |
| Train Once for All: A Transitional Approach for Efficient Aspect Sentiment Triplet Extraction | Nov 29, 2024 | AllAspect Sentiment Triplet Extraction | CodeCode Available | 0 |
| Cross-Spectral Attention for Unsupervised RGB-IR Face Verification and Person Re-identification | Nov 28, 2024 | Face VerificationPerson Re-Identification | —Unverified | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval | Nov 22, 2024 | Image RetrievalReranking | —Unverified | 0 |
| Globally Correlation-Aware Hard Negative Generation | Nov 20, 2024 | Image RetrievalMetric Learning | CodeCode Available | 1 |
| TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition | Nov 16, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 1 |
| Leveraging large language models for efficient representation learning for entity resolution | Nov 15, 2024 | BlockingContrastive Learning | —Unverified | 0 |
| Marker-free Human Gait Analysis using a Smart Edge Sensor System | Nov 14, 2024 | Triplet | —Unverified | 0 |
| Energy Score-based Pseudo-Label Filtering and Adaptive Loss for Imbalanced Semi-supervised SAR target recognition | Nov 6, 2024 | Pseudo LabelPseudo Label Filtering | —Unverified | 0 |
| Graph-DPEP: Decomposed Plug and Ensemble Play for Few-Shot Document Relation Extraction with Graph-of-Thoughts Reasoning | Nov 5, 2024 | Document-level Relation ExtractionFew-Shot Learning | —Unverified | 0 |
| TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition | Nov 4, 2024 | Boundary Detectionnamed-entity-recognition | CodeCode Available | 0 |
| Deep Learning for Leopard Individual Identification: An Adaptive Angular Margin Approach | Nov 4, 2024 | Deep LearningEdge Detection | CodeCode Available | 0 |
| Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Nov 3, 2024 | Autonomous DrivingLane Detection | CodeCode Available | 1 |
| Confidence Aware Learning for Reliable Face Anti-spoofing | Nov 2, 2024 | Face Anti-SpoofingPrediction | —Unverified | 0 |
| MoTaDual: Modality-Task Dual Alignment for Enhanced Zero-shot Composed Image Retrieval | Oct 31, 2024 | Image RetrievalPrompt Learning | —Unverified | 0 |
| Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models | Oct 30, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| Prototypical Extreme Multi-label Classification with a Dynamic Margin Loss | Oct 27, 2024 | Contrastive LearningExtreme Multi-Label Classification | —Unverified | 0 |
| Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective | Oct 23, 2024 | graph constructionKnowledge Graphs | CodeCode Available | 1 |
| Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval | Oct 22, 2024 | AttributeDenoising | CodeCode Available | 0 |
| GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting | Oct 22, 2024 | Keyword SpottingTriplet | —Unverified | 0 |
| Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation | Oct 16, 2024 | Camera CalibrationInverse Rendering | CodeCode Available | 1 |
| Diversified and Adaptive Negative Sampling on Knowledge Graphs | Oct 10, 2024 | Graph EmbeddingInformativeness | —Unverified | 0 |
| TANet: Triplet Attention Network for All-In-One Adverse Weather Image Restoration | Oct 10, 2024 | AllImage Restoration | CodeCode Available | 1 |
| Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training | Oct 8, 2024 | Graph Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model | Oct 3, 2024 | image-classificationImage Classification | —Unverified | 0 |
| NL-Eye: Abductive NLI for Images | Oct 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections | Oct 2, 2024 | AttributeImage Retrieval | CodeCode Available | 0 |