| ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities | Jul 19, 2024 | 4k8k | —Unverified | 0 |
| NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context? | Jul 16, 2024 | 4k8k | CodeCode Available | 9 |
| Learning to (Learn at Test Time): RNNs with Expressive Hidden States | Jul 5, 2024 | 16k8k | CodeCode Available | 5 |
| Let the Code LLM Edit Itself When You Edit the Code | Jul 3, 2024 | 8kCode Generation | —Unverified | 0 |
| Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Jun 28, 2024 | 8kAnomaly Detection | CodeCode Available | 2 |
| Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics | Jun 20, 2024 | 8kDescriptive | —Unverified | 0 |
| GenderAlign: An Alignment Dataset for Mitigating Gender Bias in Large Language Models | Jun 20, 2024 | 8k | CodeCode Available | 0 |
| Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Jun 19, 2024 | 8kHallucination | CodeCode Available | 2 |
| Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM | Jun 16, 2024 | 8kOpinion Summarization | —Unverified | 0 |
| Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture | Jun 1, 2024 | 8kFace Reconstruction | —Unverified | 0 |
| Cutting Through the Noise: Boosting LLM Performance on Math Word Problems | May 30, 2024 | 8kMath | CodeCode Available | 0 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 |
| DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents | Apr 30, 2024 | 8kDiversity | CodeCode Available | 0 |
| Extending Llama-3's Context Ten-Fold Overnight | Apr 30, 2024 | 8kGPU | CodeCode Available | 0 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |
| Self-Supervised Visual Preference Alignment | Apr 16, 2024 | 8kMM-Vet | CodeCode Available | 2 |
| BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion | Apr 6, 2024 | 8kScene Generation | —Unverified | 0 |
| CodeShell Technical Report | Mar 23, 2024 | 8kHumanEval | —Unverified | 0 |
| Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs | Mar 13, 2024 | 8kAnswer Generation | —Unverified | 0 |
| Fast Kernel Scene Flow | Mar 9, 2024 | 8kAutonomous Driving | CodeCode Available | 1 |
| Machine Translation in the Covid domain: an English-Irish case study for LoResMT 2021 | Mar 2, 2024 | 8kDomain Adaptation | —Unverified | 0 |
| CAMixerSR: Only Details Need More "Attention" | Feb 29, 2024 | 2k8k | CodeCode Available | 3 |
| MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning | Feb 27, 2024 | 8kLanguage Modeling | CodeCode Available | 0 |
| Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles | Feb 27, 2024 | 8kArticles | —Unverified | 0 |
| Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays | Feb 25, 2024 | 16k8k | —Unverified | 0 |