| Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking | Aug 14, 2023 | PositionVisual Tracking | CodeCode Available | 1 | 5 |
| Entroformer: A Transformer-based Entropy Model for Learned Image Compression | Feb 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Instruction Position Matters in Sequence Generation with Large Language Models | Aug 23, 2023 | Instruction FollowingPosition | CodeCode Available | 1 | 5 |
| Iterative Geometry Calibration from Distance Estimates for Wireless Acoustic Sensor Networks | Dec 11, 2020 | Position | CodeCode Available | 1 | 5 |
| Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images | Nov 26, 2022 | Diabetic Retinopathy GradingPosition | CodeCode Available | 1 | 5 |
| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 | 5 |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Jul 31, 2021 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Enquire One's Parent and Child Before Decision: Fully Exploit Hierarchical Structure for Self-Supervised Taxonomy Expansion | Jan 27, 2021 | PositionTaxonomy Expansion | CodeCode Available | 1 | 5 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 | 5 |
| ETC: Encoding Long and Structured Inputs in Transformers | Apr 17, 2020 | PositionQuestion Answering | CodeCode Available | 1 | 5 |