| MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm | Jun 5, 2025 | GPURelation | CodeCode Available | 9 |
| Phantom: Subject-consistent video generation via cross-modal alignment | Feb 16, 2025 | cross-modal alignmentHuman-Domain Subject-to-Video | CodeCode Available | 5 |
| Stable-Hair: Real-World Hair Transfer via Diffusion Model | Jul 19, 2024 | Triplet | CodeCode Available | 4 |
| GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector | May 30, 2022 | Co-Salient Object DetectionObject | CodeCode Available | 4 |
| Simple and Effective Relation-based Embedding Propagation for Knowledge Representation Learning | May 13, 2022 | Knowledge GraphsRelation | CodeCode Available | 3 |
| Old Photo Restoration via Deep Latent Space Translation | Sep 14, 2020 | Image RestorationTranslation | CodeCode Available | 3 |
| Bringing Old Photos Back to Life | Apr 20, 2020 | Image RestorationTranslation | CodeCode Available | 3 |
| TristouNet: Triplet Loss for Speaker Turn Embedding | Sep 14, 2016 | Change DetectionTriplet | CodeCode Available | 3 |
| SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing | May 5, 2025 | Triplet | CodeCode Available | 2 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 |
| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 |
| YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Jan 23, 2025 | Multi-Object Trackingobject-detection | CodeCode Available | 2 |
| Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Jul 19, 2024 | Data AugmentationDepth Estimation | CodeCode Available | 2 |
| An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval | Jun 13, 2024 | Contrastive LearningImage Retrieval | CodeCode Available | 2 |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free-View Synthesis of Indoor Scenes | May 28, 2024 | Novel View SynthesisTriplet | CodeCode Available | 2 |
| Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification | May 28, 2024 | Person Re-IdentificationTriplet | CodeCode Available | 2 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 |
| AutoRE: Document-Level Relation Extraction with Large Language Models | Mar 21, 2024 | Document-level Relation ExtractionRelation | CodeCode Available | 2 |
| Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models | Mar 4, 2024 | Knowledge Graph CompletionKnowledge Graphs | CodeCode Available | 2 |
| Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision | Feb 14, 2024 | Language ModellingSegmentation | CodeCode Available | 2 |
| UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction | Feb 10, 2024 | graph constructionKnowledge Graph Completion | CodeCode Available | 2 |
| Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers | Feb 7, 2024 | Drug DiscoveryGraph Learning | CodeCode Available | 2 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |
| RED^ FM: a Filtered and Multilingual Relation Extraction Dataset | Jun 16, 2023 | RelationRelation Extraction | CodeCode Available | 2 |
| VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation | Mar 15, 2023 | Optical Flow EstimationTriplet | CodeCode Available | 2 |
| Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval | Apr 21, 2022 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| CenterNet++ for Object Detection | Apr 18, 2022 | Objectobject-detection | CodeCode Available | 2 |
| Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences | Mar 8, 2022 | TripletWeakly-supervised Learning | CodeCode Available | 2 |
| Geometric Transformer for Fast and Robust Point Cloud Registration | Feb 14, 2022 | Metric LearningPoint Cloud Registration | CodeCode Available | 2 |
| Supervised Contrastive Learning | Apr 23, 2020 | Class Incremental LearningContrastive Learning | CodeCode Available | 2 |
| DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Apr 28, 2025 | Generative Adversarial Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| ID-Booth: Identity-consistent Face Generation with Diffusion Models | Apr 10, 2025 | DenoisingDiversity | CodeCode Available | 1 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning | Mar 23, 2025 | Continual LearningExemplar-Free | CodeCode Available | 1 |
| REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Mar 10, 2025 | Instruction FollowingKeypoint Detection | CodeCode Available | 1 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Range and Bird's Eye View Fused Cross-Modal Visual Place Recognition | Feb 17, 2025 | Re-RankingTriplet | CodeCode Available | 1 |
| Relation-Guided Adversarial Learning for Data-free Knowledge Transfer | Dec 16, 2024 | Data-free Knowledge DistillationData Free Quantization | CodeCode Available | 1 |
| Globally Correlation-Aware Hard Negative Generation | Nov 20, 2024 | Image RetrievalMetric Learning | CodeCode Available | 1 |
| TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition | Nov 16, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 1 |
| Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Nov 3, 2024 | Autonomous DrivingLane Detection | CodeCode Available | 1 |
| Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective | Oct 23, 2024 | graph constructionKnowledge Graphs | CodeCode Available | 1 |
| Triplet: Triangle Patchlet for Mesh-Based Inverse Rendering and Scene Parameters Approximation | Oct 16, 2024 | Camera CalibrationInverse Rendering | CodeCode Available | 1 |
| TANet: Triplet Attention Network for All-In-One Adverse Weather Image Restoration | Oct 10, 2024 | AllImage Restoration | CodeCode Available | 1 |
| Deep-Wide Learning Assistance for Insect Pest Classification | Sep 16, 2024 | ClassificationData Augmentation | CodeCode Available | 1 |
| One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild | Sep 15, 2024 | GPUImage Generation | CodeCode Available | 1 |
| GLGait: A Global-Local Temporal Receptive Field Network for Gait Recognition in the Wild | Aug 13, 2024 | Gait RecognitionGait Recognition in the Wild | CodeCode Available | 1 |
| Masked Graph Autoencoders with Contrastive Augmentation for Spatially Resolved Transcriptomics Data | Aug 9, 2024 | DenoisingTriplet | CodeCode Available | 1 |
| Graphusion: Leveraging Large Language Models for Scientific Knowledge Graph Fusion and Construction in NLP Education | Jul 15, 2024 | graph constructionKnowledge Graphs | CodeCode Available | 1 |