| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| MAILEX: Email Event and Argument Extraction | May 22, 2023 | 4k8k | CodeCode Available | 1 |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input | Apr 13, 2023 | 2k4k | CodeCode Available | 1 |
| NLUT: Neural-based 3D Lookup Tables for Video Photorealistic Style Transfer | Mar 16, 2023 | 8kStyle Transfer | CodeCode Available | 1 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 |
| FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer | Dec 6, 2022 | 8kTransfer Learning | CodeCode Available | 1 |
| Simplifying and Understanding State Space Models with Diagonal Linear RNNs | Dec 1, 2022 | 8kState Space Models | CodeCode Available | 1 |
| Question Answering Classification for Amharic Social Media Community Based Questions | Jun 1, 2022 | 8kQuestion Answering | CodeCode Available | 1 |
| Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble | May 1, 2022 | 8kGrapheme-to-Phoneme Conversion | CodeCode Available | 1 |
| Pyramid Grafting Network for One-Stage High Resolution Saliency Detection | Apr 11, 2022 | 4k8k | CodeCode Available | 1 |
| AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath | Mar 15, 2022 | 8kAudio Classification | CodeCode Available | 1 |
| Transformer Quality in Linear Time | Feb 21, 2022 | 8kLanguage Modeling | CodeCode Available | 1 |
| Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial Networks | Sep 5, 2021 | 8kFAD | CodeCode Available | 1 |
| Collapsible Linear Blocks for Super-Efficient Super Resolution | Mar 17, 2021 | 4k8k | CodeCode Available | 1 |
| ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic | Mar 6, 2021 | 2k8k | CodeCode Available | 1 |
| Contextual Residual Aggregation for Ultra High-Resolution Image Inpainting | May 19, 2020 | 2k8k | CodeCode Available | 1 |
| Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations | May 18, 2020 | 8kFew-Shot Learning | CodeCode Available | 1 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 |
| Large Batch Training of Convolutional Networks | Aug 13, 2017 | 8k | CodeCode Available | 1 |
| MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent | Jul 3, 2025 | 8k | —Unverified | 0 |
| UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Jun 16, 2025 | 4k8k | —Unverified | 0 |
| Through the Valley: Path to Effective Long CoT Training for Small Language Models | Jun 9, 2025 | 8kReinforcement Learning (RL) | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 |
| LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification | Jun 2, 2025 | 8k | —Unverified | 0 |