| Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models | Aug 19, 2024 | 8kInformation Retrieval | CodeCode Available | 0 |
| SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis | Aug 16, 2024 | 8kImage Generation | —Unverified | 0 |
| ProFuser: Progressive Fusion of Large Language Models | Aug 9, 2024 | 8k | —Unverified | 0 |
| PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network | Aug 2, 2024 | 4k8k | —Unverified | 0 |
| Evaluating Long Range Dependency Handling in Code Generation Models using Multi-Step Key Retrieval | Jul 23, 2024 | 8kCode Completion | —Unverified | 0 |
| ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities | Jul 19, 2024 | 4k8k | —Unverified | 0 |
| NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context? | Jul 16, 2024 | 4k8k | CodeCode Available | 9 |
| Learning to (Learn at Test Time): RNNs with Expressive Hidden States | Jul 5, 2024 | 16k8k | CodeCode Available | 5 |
| Let the Code LLM Edit Itself When You Edit the Code | Jul 3, 2024 | 8kCode Generation | —Unverified | 0 |
| Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Jun 28, 2024 | 8kAnomaly Detection | CodeCode Available | 2 |