| LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens | Feb 21, 2024 | 8k | CodeCode Available | 3 |
| Can Separators Improve Chain-of-Thought Prompting? | Feb 16, 2024 | 8kGSM8K | —Unverified | 0 |
| Referring Expression Counting | Jan 1, 2024 | 8kobject-detection | CodeCode Available | 1 |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Dec 28, 2023 | 8kFeature Splatting | CodeCode Available | 2 |
| SCCA: Shifted Cross Chunk Attention for long contextual semantic expansion | Dec 12, 2023 | 4k8k | —Unverified | 0 |
| 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters | Nov 15, 2023 | 4k8k | CodeCode Available | 1 |
| LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models | Nov 8, 2023 | 8kGPU | CodeCode Available | 5 |
| A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture | Oct 30, 2023 | 8kObject | CodeCode Available | 1 |
| M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models | Oct 30, 2023 | 8kSemantic Retrieval | CodeCode Available | 1 |
| Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer | Oct 19, 2023 | 8kComputational Efficiency | —Unverified | 0 |
| Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models | Oct 6, 2023 | 8kMath | —Unverified | 0 |
| Transformer-VQ: Linear-Time Transformers via Vector Quantization | Sep 28, 2023 | 8kDecoder | CodeCode Available | 2 |
| BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model | Sep 20, 2023 | 8kLanguage Modeling | CodeCode Available | 3 |
| XGen-7B Technical Report | Sep 7, 2023 | 2k8k | CodeCode Available | 2 |
| BatchPrompt: Accomplish more with less | Sep 1, 2023 | 8kLanguage Modelling | CodeCode Available | 0 |
| Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models | Aug 29, 2023 | 16k8k | —Unverified | 0 |
| Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning | Aug 18, 2023 | 8kPosition | CodeCode Available | 1 |
| Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection | Aug 7, 2023 | 2k8k | CodeCode Available | 1 |
| VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation | Jul 28, 2023 | 3D Generation8k | CodeCode Available | 1 |
| Practical Commercial 5G Standalone (SA) Uplink Throughput Prediction | Jul 23, 2023 | 4k8k | —Unverified | 0 |
| Neural models for Factual Inconsistency Classification with Explanations | Jun 15, 2023 | 8kClassification | CodeCode Available | 0 |
| Stable Remaster: Bridging the Gap Between Old Content and New Displays | Jun 11, 2023 | 8kKey Point Matching | CodeCode Available | 0 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| MAILEX: Email Event and Argument Extraction | May 22, 2023 | 4k8k | CodeCode Available | 1 |
| AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three Weeks | May 16, 2023 | 8kActive Learning | CodeCode Available | 2 |