| MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm | Jun 5, 2025 | GPURelation | CodeCode Available | 9 | 5 |
| Phantom: Subject-consistent video generation via cross-modal alignment | Feb 16, 2025 | cross-modal alignmentHuman-Domain Subject-to-Video | CodeCode Available | 5 | 5 |
| Stable-Hair: Real-World Hair Transfer via Diffusion Model | Jul 19, 2024 | Triplet | CodeCode Available | 4 | 5 |
| GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector | May 30, 2022 | Co-Salient Object DetectionObject | CodeCode Available | 4 | 5 |
| TristouNet: Triplet Loss for Speaker Turn Embedding | Sep 14, 2016 | Change DetectionTriplet | CodeCode Available | 3 | 5 |
| Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning | May 13, 2022 | Knowledge GraphsRelation | CodeCode Available | 3 | 5 |
| Bringing Old Photos Back to Life | Apr 20, 2020 | Image RestorationTranslation | CodeCode Available | 3 | 5 |
| Old Photo Restoration via Deep Latent Space Translation | Sep 14, 2020 | Image RestorationTranslation | CodeCode Available | 3 | 5 |
| An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval | Jun 13, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 2 | 5 |
| Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision | Feb 14, 2024 | Language ModellingSegmentation | CodeCode Available | 2 | 5 |
| RED^ FM: a Filtered and Multilingual Relation Extraction Dataset | Jun 16, 2023 | RelationRelation Extraction | CodeCode Available | 2 | 5 |
| Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences | Mar 8, 2022 | TripletWeakly-supervised Learning | CodeCode Available | 2 | 5 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 | 5 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 | 5 |
| Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification | May 28, 2024 | Person Re-IdentificationTriplet | CodeCode Available | 2 | 5 |
| Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Jul 19, 2024 | Data AugmentationDepth Estimation | CodeCode Available | 2 | 5 |
| Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Apr 21, 2022 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 | 5 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 | 5 |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | May 28, 2024 | Novel View SynthesisTriplet | CodeCode Available | 2 | 5 |
| Geometric Transformer for Fast and Robust Point Cloud Registration | Feb 14, 2022 | Metric LearningPoint Cloud Registration | CodeCode Available | 2 | 5 |
| AutoRE: Document-Level Relation Extraction with Large Language Models | Mar 21, 2024 | Document-level Relation ExtractionRelation | CodeCode Available | 2 | 5 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 | 5 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 | 5 |
| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 | 5 |
| Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models | Mar 4, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 2 | 5 |