| UIO-LLMs: Unbiased Incremental Optimization for Long-Context LLMs | Jun 26, 2024 | 4kDecoder | CodeCode Available | 0 |
| UHD-IQA Benchmark Database: Pushing the Boundaries of Blind Photo Quality Assessment | Jun 25, 2024 | 4kBlind Image Quality Assessment | CodeCode Available | 0 |
| LongIns: A Challenging Long-context Instruction-based Exam for LLMs | Jun 25, 2024 | 16k4k | —Unverified | 0 |
| ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance | Jun 24, 2024 | 4kDenoising | —Unverified | 0 |
| MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K Tokens | Jun 21, 2024 | 4kFairness | CodeCode Available | 1 |
| LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs | Jun 21, 2024 | 4kChunking | —Unverified | 0 |
| GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models | Jun 20, 2024 | 16k4k | —Unverified | 0 |
| Ultra-High-Definition Image Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution | Jun 19, 2024 | 4kDeblurring | CodeCode Available | 1 |
| 4K4DGen: Panoramic 4D Generation at 4K Resolution | Jun 19, 2024 | 4k | —Unverified | 0 |
| Embedding machine-learnt sub-grid variability improves climate model biases | Jun 13, 2024 | 4k | —Unverified | 0 |
| An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding | Jun 11, 2024 | 4k | CodeCode Available | 1 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 |
| LoCoCo: Dropping In Convolutions for Long Context Compression | Jun 8, 2024 | 4k | CodeCode Available | 1 |
| Measuring and Addressing Indexical Bias in Information Retrieval | Jun 6, 2024 | 4kFairness | CodeCode Available | 0 |
| Uni-ISP: Unifying the Learning of ISPs from Multiple Cameras | Jun 3, 2024 | 4k | —Unverified | 0 |
| Towards Ultra-High-Definition Image Deraining: A Benchmark and An Efficient Method | May 27, 2024 | 4kImage Reconstruction | CodeCode Available | 1 |
| 360Zhinao Technical Report | May 22, 2024 | 4k | CodeCode Available | 3 |
| CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations | May 16, 2024 | 4k | CodeCode Available | 0 |
| Challenges in Deploying Long-Context Transformers: A Theoretical Peak Performance Analysis | May 14, 2024 | 4kGPU | —Unverified | 0 |
| Multimodal Collaboration Networks for Geospatial Vehicle Detection in Dense, Occluded, and Large-Scale Events | May 14, 2024 | 4kobject-detection | CodeCode Available | 0 |
| PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated Images | Apr 29, 2024 | 4kImage Generation | CodeCode Available | 0 |
| Make Your LLM Fully Utilize the Context | Apr 25, 2024 | 4kInformation Retrieval | CodeCode Available | 5 |
| Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey | Apr 25, 2024 | 4kImage Super-Resolution | CodeCode Available | 3 |
| How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites | Apr 25, 2024 | 4kLanguage Modeling | CodeCode Available | 0 |
| UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 2 |
| NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 5 |
| EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation | Apr 19, 2024 | 3DGS4k | —Unverified | 0 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |
| Real-World Efficient Blind Motion Deblurring via Blur Pixel Discretization | Apr 18, 2024 | 4kDeblurring | —Unverified | 0 |
| D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation | Apr 16, 2024 | 4k | —Unverified | 0 |
| LLoCO: Learning Long Contexts Offline | Apr 11, 2024 | 4kIn-Context Learning | CodeCode Available | 2 |
| InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD | Apr 9, 2024 | 4kLanguage Modeling | CodeCode Available | 0 |
| InternLM2 Technical Report | Mar 26, 2024 | 4kLong-Context Understanding | CodeCode Available | 9 |
| Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models | Mar 18, 2024 | 4kPosition | CodeCode Available | 2 |
| m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Mar 17, 2024 | 4k | CodeCode Available | 1 |
| PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Mar 7, 2024 | 4kImage Captioning | CodeCode Available | 5 |
| Scalable Superconductor Neuron with Ternary Synaptic Connections for Ultra-Fast SNN Hardware | Feb 26, 2024 | 4kEfficient Neural Network | —Unverified | 0 |
| KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge | Feb 21, 2024 | 4kMultiple-choice | —Unverified | 0 |
| Data Engineering for Scaling Language Models to 128K Context | Feb 15, 2024 | 4kContinual Pretraining | CodeCode Available | 3 |
| World Model on Million-Length Video And Language With Blockwise RingAttention | Feb 13, 2024 | 4kVideo Understanding | CodeCode Available | 9 |
| Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search | Feb 12, 2024 | 4kConversational Search | CodeCode Available | 1 |
| LongFin: A Multimodal Document Understanding Model for Long Financial Domain Documents | Jan 26, 2024 | 4kDocument AI | —Unverified | 0 |
| E^2-LLM: Efficient and Extreme Length Extension of Large Language Models | Jan 13, 2024 | 4kGPU | —Unverified | 0 |
| Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting | Jan 12, 2024 | 4kGPU | —Unverified | 0 |
| Efficient Parallel Data Optimization for Homogeneous Diffusion Inpainting of 4K Images | Jan 12, 2024 | 4kGPU | —Unverified | 0 |
| Long Context Compression with Activation Beacon | Jan 7, 2024 | 4kdocument understanding | CodeCode Available | 0 |
| DarkShot: Lighting Dark Images with Low-Compute and High-Quality | Dec 28, 2023 | 4kImage Restoration | —Unverified | 0 |
| Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration | Dec 22, 2023 | 4kreinforcement-learning | —Unverified | 0 |
| Holoported Characters: Real-time Free-viewpoint Rendering of Humans from Sparse RGB Cameras | Dec 12, 2023 | 4k | —Unverified | 0 |
| SCCA: Shifted Cross Chunk Attention for long contextual semantic expansion | Dec 12, 2023 | 4k8k | —Unverified | 0 |