| Any-to-Any Learning in Computational Pathology via Triplet Multimodal Pretraining | May 19, 2025 | Survival PredictionTriplet | —Unverified | 0 |
| GLProtein: Global-and-Local Structure Aware Protein Representation Learning | May 17, 2025 | Representation LearningTriplet | —Unverified | 0 |
| Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation | May 16, 2025 | MathMMLU | —Unverified | 0 |
| EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | May 13, 2025 | DenoisingImage Reconstruction | —Unverified | 0 |
| Automated Knot Detection and Pairing for Wood Analysis in the Timber Industry | May 9, 2025 | Transfer LearningTriplet | —Unverified | 0 |
| Achieving 3D Attention via Triplet Squeeze and Excitation Block | May 9, 2025 | Facial Expression RecognitionFacial Expression Recognition (FER) | —Unverified | 0 |
| T-T: Table Transformer for Tagging-based Aspect Sentiment Triplet Extraction | May 8, 2025 | Aspect Sentiment Triplet ExtractionRelation | —Unverified | 0 |
| SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing | May 5, 2025 | Triplet | CodeCode Available | 2 |
| Robust Misinformation Detection by Visiting Potential Commonsense Conflict | Apr 30, 2025 | ArticlesMisinformation | CodeCode Available | 0 |
| DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Apr 28, 2025 | Generative Adversarial Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| Uncovering potential effects of spontaneous waves on synaptic development: the visual system as a model | Apr 26, 2025 | Triplet | —Unverified | 0 |
| From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval | Apr 25, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections | Apr 23, 2025 | Action Triplet RecognitionFederated Learning | —Unverified | 0 |
| DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning | Apr 20, 2025 | AttributeFace Swapping | CodeCode Available | 2 |
| FACT: Foundation Model for Assessing Cancer Tissue Margins with Mass Spectrometry | Apr 15, 2025 | Triplet | CodeCode Available | 0 |
| Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data Augmentation | Apr 13, 2025 | Data AugmentationEmotion-Cause Pair Extraction | CodeCode Available | 0 |
| ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting | Apr 10, 2025 | 3D GenerationContrastive Learning | CodeCode Available | 0 |
| Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection | Apr 10, 2025 | Multi-Task Learningobject-detection | CodeCode Available | 0 |
| Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments | Apr 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ID-Booth: Identity-consistent Face Generation with Diffusion Models | Apr 10, 2025 | DenoisingDiversity | CodeCode Available | 1 |
| AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification | Apr 7, 2025 | Graph Neural NetworkTriplet | CodeCode Available | 0 |
| Subjective Visual Quality Assessment for High-Fidelity Learning-Based Image Compression | Apr 7, 2025 | BenchmarkingImage Compression | CodeCode Available | 0 |
| Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data | Apr 1, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| LANID: LLM-assisted New Intent Discovery | Mar 31, 2025 | Intent DiscoveryTask-Oriented Dialogue Systems | CodeCode Available | 0 |
| FreeSplat++: Generalizable 3D Gaussian Splatting for Efficient Indoor Scene Reconstruction | Mar 29, 2025 | 3DGSIndoor Scene Reconstruction | CodeCode Available | 2 |