| StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation | May 26, 2025 | Image GenerationInstruction Following | —Unverified | 0 |
| An Interpretable Representation Learning Approach for Diffusion Tensor Imaging | May 25, 2025 | Contrastive LearningDecoder | —Unverified | 0 |
| A Graph Perspective to Probe Structural Patterns of Knowledge in Large Language Models | May 25, 2025 | Triplet | CodeCode Available | 0 |
| Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor Datasets | May 21, 2025 | Dataset DistillationGait Recognition | CodeCode Available | 0 |
| Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment | May 20, 2025 | Representation LearningRetrieval | —Unverified | 0 |
| Any-to-Any Learning in Computational Pathology via Triplet Multimodal Pretraining | May 19, 2025 | Survival PredictionTriplet | —Unverified | 0 |
| GLProtein: Global-and-Local Structure Aware Protein Representation Learning | May 17, 2025 | Representation LearningTriplet | —Unverified | 0 |
| Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation | May 16, 2025 | MathMMLU | —Unverified | 0 |
| EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation | May 13, 2025 | DenoisingImage Reconstruction | —Unverified | 0 |
| Achieving 3D Attention via Triplet Squeeze and Excitation Block | May 9, 2025 | Facial Expression RecognitionFacial Expression Recognition (FER) | —Unverified | 0 |