Video Retrieval
The objective of video retrieval is as follows: given a text query and a pool of candidate videos, select the video which corresponds to the text query. Typically, the videos are returned as a ranked list of candidates and scored via document retrieval metrics.
Papers
Showing 1–10 of 486 papers
All datasetsMSR-VTT-1kADiDeMoMSR-VTTLSMDCActivityNetMSVDYouCook2FIVR-200KVATEXQuerYDSSv2-label retrievalSSv2-template retrieval
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | InternVideo2-6B | text-to-video R@1 | 61.4 | — | Unverified |
| 2 | InternVideo2-6B | text-to-video R@1 | 59.3 | — | Unverified |
| 3 | HunYuan_tvr (huge) | text-to-video R@1 | 59 | — | Unverified |
| 4 | InternVideo | text-to-video R@1 | 58.4 | — | Unverified |
| 5 | HunYuan_tvr | text-to-video R@1 | 58.2 | — | Unverified |
| 6 | vid-TLDR (UMT-L) | text-to-video R@1 | 57.9 | — | Unverified |
| 7 | VLAB | text-to-video R@1 | 57.5 | — | Unverified |
| 8 | MDMMT-2 | text-to-video R@1 | 56.8 | — | Unverified |
| 9 | Side4Video | text-to-video R@1 | 56.1 | — | Unverified |
| 10 | Cap4Video | text-to-video R@1 | 51.8 | — | Unverified |