| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| MAILEX: Email Event and Argument Extraction | May 22, 2023 | 4k8k | CodeCode Available | 1 |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input | Apr 13, 2023 | 2k4k | CodeCode Available | 1 |
| NLUT: Neural-based 3D Lookup Tables for Video Photorealistic Style Transfer | Mar 16, 2023 | 8kStyle Transfer | CodeCode Available | 1 |
| In-Context Learning with Many Demonstration Examples | Feb 9, 2023 | 16k8k | CodeCode Available | 1 |
| FacT: Factor-Tuning for Lightweight Adaptation on Vision Transformer | Dec 6, 2022 | 8kTransfer Learning | CodeCode Available | 1 |
| Simplifying and Understanding State Space Models with Diagonal Linear RNNs | Dec 1, 2022 | 8kState Space Models | CodeCode Available | 1 |
| Question Answering Classification for Amharic Social Media Community Based Questions | Jun 1, 2022 | 8kQuestion Answering | CodeCode Available | 1 |
| Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble | May 1, 2022 | 8kGrapheme-to-Phoneme Conversion | CodeCode Available | 1 |
| Pyramid Grafting Network for One-Stage High Resolution Saliency Detection | Apr 11, 2022 | 4k8k | CodeCode Available | 1 |
| AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath | Mar 15, 2022 | 8kAudio Classification | CodeCode Available | 1 |
| Transformer Quality in Linear Time | Feb 21, 2022 | 8kLanguage Modeling | CodeCode Available | 1 |
| Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial Networks | Sep 5, 2021 | 8kFAD | CodeCode Available | 1 |
| Collapsible Linear Blocks for Super-Efficient Super Resolution | Mar 17, 2021 | 4k8k | CodeCode Available | 1 |
| ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic | Mar 6, 2021 | 2k8k | CodeCode Available | 1 |
| Contextual Residual Aggregation for Ultra High-Resolution Image Inpainting | May 19, 2020 | 2k8k | CodeCode Available | 1 |
| Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations | May 18, 2020 | 8kFew-Shot Learning | CodeCode Available | 1 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 |
| Large Batch Training of Convolutional Networks | Aug 13, 2017 | 8k | CodeCode Available | 1 |
| MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent | Jul 3, 2025 | 8k | —Unverified | 0 |
| UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions | Jun 16, 2025 | 4k8k | —Unverified | 0 |
| Through the Valley: Path to Effective Long CoT Training for Small Language Models | Jun 9, 2025 | 8kReinforcement Learning (RL) | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| LLM in the Loop: Creating the PARADEHATE Dataset for Hate Speech Detoxification | Jun 2, 2025 | 8k | —Unverified | 0 |
| SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis | Jun 2, 2025 | 8kMath | —Unverified | 0 |
| Efficient Neural and Numerical Methods for High-Quality Online Speech Spectrogram Inversion via Gradient Theorem | May 30, 2025 | 8k | —Unverified | 0 |
| LoLA: Low-Rank Linear Attention With Sparse Caching | May 29, 2025 | 4k8k | —Unverified | 0 |
| Efficient Correlation Volume Sampling for Ultra-High-Resolution Optical Flow Estimation | May 22, 2025 | 8kOptical Flow Estimation | —Unverified | 0 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| ViCTr: Vital Consistency Transfer for Pathology Aware Image Synthesis | May 8, 2025 | 8kData Augmentation | —Unverified | 0 |
| Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation | Apr 26, 2025 | 8kPosition | —Unverified | 0 |
| KeyDiff: Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments | Apr 21, 2025 | 8k | —Unverified | 0 |
| FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction | Apr 8, 2025 | 8kData Augmentation | CodeCode Available | 0 |
| Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts | Apr 7, 2025 | 8k | —Unverified | 0 |
| Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables | Mar 31, 2025 | 8kComputational Efficiency | —Unverified | 0 |
| Visual Acuity Consistent Foveated Rendering towards Retinal Resolution | Mar 30, 2025 | 8k | —Unverified | 0 |
| XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation | Mar 29, 2025 | 8kSynthetic Data Generation | —Unverified | 0 |
| ESSR: An 8K@30FPS Super-Resolution Accelerator With Edge Selective Network | Mar 26, 2025 | 8kSuper-Resolution | —Unverified | 0 |
| Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Mar 24, 2025 | 8kGPU | —Unverified | 0 |
| KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications | Mar 21, 2025 | 16k4k | CodeCode Available | 0 |
| Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack | Mar 18, 2025 | 8kBenchmarking | —Unverified | 0 |
| Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation | Feb 26, 2025 | 16k2k | —Unverified | 0 |
| ParallelComp: Parallel Long-Context Compressor for Length Extrapolation | Feb 20, 2025 | 4k8k | —Unverified | 0 |
| Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression | Feb 20, 2025 | 8k | —Unverified | 0 |
| CopySpec: Accelerating LLMs with Speculative Copy-and-Paste Without Compromising Quality | Feb 13, 2025 | 8kGPU | CodeCode Available | 0 |
| BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics | Jan 31, 2025 | 8kImage Generation | —Unverified | 0 |
| State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Jan 30, 2025 | 8kARC | —Unverified | 0 |
| Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration | Jan 27, 2025 | 4k8k | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |