Long-Context Understanding
Papers
Showing 1–10 of 81 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4o | 1 Image, 4*4 Stitching, Exact Accuracy | 83 | — | Unverified |
| 2 | GPT-4V | 1 Image, 4*4 Stitching, Exact Accuracy | 54.72 | — | Unverified |
| 3 | Gemini Pro 1.5 | 1 Image, 4*4 Stitching, Exact Accuracy | 39.85 | — | Unverified |
| 4 | Gemini Pro 1.0 | 1 Image, 4*4 Stitching, Exact Accuracy | 24.78 | — | Unverified |
| 5 | LLaVA-Llama-3 | 1 Image, 4*4 Stitching, Exact Accuracy | 17.5 | — | Unverified |
| 6 | Claude 3 Opus | 1 Image, 4*4 Stitching, Exact Accuracy | 12.3 | — | Unverified |
| 7 | IDEFICS2-8B | 1 Image, 4*4 Stitching, Exact Accuracy | 7.8 | — | Unverified |
| 8 | InstructBLIP-Flan-T5-XXL | 1 Image, 4*4 Stitching, Exact Accuracy | 6.2 | — | Unverified |
| 9 | CogVLM2-Llama-3 | 1 Image, 4*4 Stitching, Exact Accuracy | 0.9 | — | Unverified |
| 10 | mPLUG-Owl-v2 | 1 Image, 4*4 Stitching, Exact Accuracy | 0.3 | — | Unverified |