| NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining | Jul 18, 2025 | Image EditingText-based Image Editing | —Unverified | 0 |
| SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks | Jul 17, 2025 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| Dark-EvGS: Event Camera as an Eye for Radiance Field in the Dark | Jul 16, 2025 | Triplet | —Unverified | 0 |
| Attributes Shape the Embedding Space of Face Recognition Models | Jul 15, 2025 | AttributeFace Recognition | CodeCode Available | 0 |
| Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval | Jul 8, 2025 | Image RetrievalLarge Language Model | —Unverified | 0 |
| Context-Driven Knowledge Graph Completion with Semantic-Aware Relational Message Passing | Jun 29, 2025 | Knowledge Graph CompletionKnowledge Graphs | —Unverified | 0 |
| Self-supervised Feature Extraction for Enhanced Ball Detection on Soccer Robots | Jun 20, 2025 | ColorizationEdge Detection | —Unverified | 0 |
| Egocentric Human-Object Interaction Detection: A New Benchmark and Method | Jun 17, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| Quantum-Informed Contrastive Learning with Dynamic Mixup Augmentation for Class-Imbalanced Expert Systems | Jun 16, 2025 | Contrastive LearningRobust classification | —Unverified | 0 |
| Improving Large Language Model Safety with Contrastive Representation Learning | Jun 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment | Jun 12, 2025 | Dialogue GenerationTriplet | CodeCode Available | 0 |
| Unsupervised Deep Clustering of MNIST with Triplet-Enhanced Convolutional Autoencoders | Jun 11, 2025 | ClusteringDeep Clustering | —Unverified | 0 |
| Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations | Jun 10, 2025 | In-Context LearningTriplet | —Unverified | 0 |
| Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning Data | Jun 9, 2025 | Learning Semantic RepresentationsTriplet | CodeCode Available | 0 |
| Learning-Augmented Hierarchical Clustering | Jun 5, 2025 | ClusteringTriplet | —Unverified | 0 |
| MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm | Jun 5, 2025 | GPURelation | CodeCode Available | 9 |
| Contrast & Compress: Learning Lightweight Embeddings for Short Trajectories | Jun 3, 2025 | Autonomous NavigationContrastive Learning | —Unverified | 0 |
| GATE: General Arabic Text Embedding for Enhanced Semantic Textual Similarity with Matryoshka Representation Learning and Hybrid Loss Training | May 30, 2025 | MTEB BenchmarkNatural Language Inference | —Unverified | 0 |
| A Joint Reconstruction-Triplet Loss Autoencoder Approach Towards Unseen Attack Detection in IoV Networks | May 27, 2025 | Transfer LearningTriplet | —Unverified | 0 |
| MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection | May 27, 2025 | Triplet | —Unverified | 0 |
| StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation | May 26, 2025 | Image GenerationInstruction Following | —Unverified | 0 |
| An Interpretable Representation Learning Approach for Diffusion Tensor Imaging | May 25, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models | May 25, 2025 | Triplet | CodeCode Available | 0 |
| Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor Datasets | May 21, 2025 | Dataset DistillationGait Recognition | CodeCode Available | 0 |
| Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment | May 20, 2025 | Representation LearningRetrieval | —Unverified | 0 |
| Any-to-Any Learning in Computational Pathology via Triplet Multimodal Pretraining | May 19, 2025 | Survival PredictionTriplet | —Unverified | 0 |
| GLProtein: Global-and-Local Structure Aware Protein Representation Learning | May 17, 2025 | Representation LearningTriplet | —Unverified | 0 |
| Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation | May 16, 2025 | MathMMLU | —Unverified | 0 |
| EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | May 13, 2025 | DenoisingImage Reconstruction | —Unverified | 0 |
| Automated Knot Detection and Pairing for Wood Analysis in the Timber Industry | May 9, 2025 | Transfer LearningTriplet | —Unverified | 0 |
| Achieving 3D Attention via Triplet Squeeze and Excitation Block | May 9, 2025 | Facial Expression RecognitionFacial Expression Recognition (FER) | —Unverified | 0 |
| T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction | May 8, 2025 | Aspect Sentiment Triplet ExtractionRelation | —Unverified | 0 |
| SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing | May 5, 2025 | Triplet | CodeCode Available | 2 |
| Robust Misinformation Detection by Visiting Potential Commonsense Conflict | Apr 30, 2025 | ArticlesMisinformation | CodeCode Available | 0 |
| DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Apr 28, 2025 | Generative Adversarial Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| Uncovering potential effects of spontaneous waves on synaptic development: the visual system as a model | Apr 26, 2025 | Triplet | —Unverified | 0 |
| From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval | Apr 25, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Apr 23, 2025 | Action Triplet RecognitionFederated Learning | —Unverified | 0 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 |
| FACT: Foundation Model for Assessing Cancer Tissue Margins with Mass Spectrometry | Apr 15, 2025 | Triplet | CodeCode Available | 0 |
| Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data Augmentation | Apr 13, 2025 | Data AugmentationEmotion-Cause Pair Extraction | CodeCode Available | 0 |
| ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting | Apr 10, 2025 | 3D GenerationContrastive Learning | CodeCode Available | 0 |
| Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection | Apr 10, 2025 | Multi-Task Learningobject-detection | CodeCode Available | 0 |
| Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ID-Booth: Identity-consistent Face Generation with Diffusion Models | Apr 10, 2025 | DenoisingDiversity | CodeCode Available | 1 |
| AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification | Apr 7, 2025 | Graph Neural NetworkTriplet | CodeCode Available | 0 |
| Subjective Visual Quality Assessment for High-Fidelity Learning-Based Image Compression | Apr 7, 2025 | BenchmarkingImage Compression | CodeCode Available | 0 |
| Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data | Apr 1, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| LANID: LLM-assisted New Intent Discovery | Mar 31, 2025 | Intent DiscoveryTask-Oriented Dialogue Systems | CodeCode Available | 0 |
| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 |