| Hateful Memes Detection via Complementary Visual and Linguistic Networks | Dec 9, 2020 | PositionSentence | CodeCode Available | 1 | 5 |
| Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining | Dec 13, 2022 | PositionPrivacy Preserving | CodeCode Available | 1 | 5 |
| GNNs as Predictors of Agentic Workflow Performances | Mar 14, 2025 | BenchmarkingPosition | CodeCode Available | 1 | 5 |
| ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences | Jun 5, 2022 | Position | CodeCode Available | 1 | 5 |
| Advancing Beyond Identification: Multi-bit Watermark for Large Language Models | Aug 1, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation | Oct 2, 2020 | PositionTrajectory Prediction | CodeCode Available | 1 | 5 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 | 5 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images | Nov 26, 2022 | Diabetic Retinopathy GradingPosition | CodeCode Available | 1 | 5 |
| A Skull-Adaptive Framework for AI-Based 3D Transcranial Focused Ultrasound Simulation | May 19, 2025 | Position | CodeCode Available | 1 | 5 |