| EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization | Jun 17, 2025 | Multi-Instance RetrievalRetrieval | CodeCode Available | 0 |
| ContextRefine-CLIP for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2025 | Jun 12, 2025 | Cross-Modal RetrievalEnsemble Learning | CodeCode Available | 0 |
| Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning | Mar 2, 2025 | Large Language ModelMulti-Instance Retrieval | CodeCode Available | 1 |
| Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning | Aug 7, 2024 | Multi-Instance RetrievalRepresentation Learning | —Unverified | 0 |
| EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation | Jun 26, 2024 | Action AnticipationAction Recognition | CodeCode Available | 2 |
| Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024 | Jun 18, 2024 | Ensemble LearningMulti-Instance Retrieval | CodeCode Available | 0 |
| EgoNCE++: Do Egocentric Video-Language Models Really Understand Hand-Object Interactions? | May 28, 2024 | Action RecognitionAttribute | CodeCode Available | 1 |
| Training a Large Video Model on a Single Machine in a Day | Sep 28, 2023 | Action RecognitionCPU | CodeCode Available | 1 |
| EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone | Jul 11, 2023 | Action RecognitionMoment Queries | CodeCode Available | 1 |
| UniUD Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2023 | Jun 27, 2023 | Multi-Instance RetrievalRetrieval | —Unverified | 0 |