| MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization | Nov 1, 2024 | 8kMixture-of-Experts | CodeCode Available | 0 |
| C^2: Scalable Auto-Feedback for LLM-based Chart Generation | Oct 24, 2024 | 8kDiversity | CodeCode Available | 1 |
| MPDS: A Movie Posters Dataset for Image Generation with Diffusion Model | Oct 22, 2024 | 4k8k | —Unverified | 0 |
| Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning | Oct 16, 2024 | 8k | CodeCode Available | 1 |
| High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion | Oct 15, 2024 | 8kVideo Frame Interpolation | —Unverified | 0 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior | Oct 13, 2024 | 8kBlind Face Restoration | CodeCode Available | 1 |
| Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks | Oct 10, 2024 | 8kDiversity | —Unverified | 0 |
| L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding? | Oct 3, 2024 | 8kDocument Summarization | CodeCode Available | 1 |
| An Exploration of Self-Supervised Mutual Information Alignment for Multi-Task Settings | Oct 2, 2024 | 8kMath | CodeCode Available | 0 |
| Extending Context Window of Large Language Models from a Distributional Perspective | Oct 2, 2024 | 16k8k | CodeCode Available | 0 |
| On The Adaptation of Unlimiformer for Decoder-Only Transformers | Oct 2, 2024 | 4k8k | —Unverified | 0 |
| PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization | Sep 25, 2024 | 8kDomain Adaptation | CodeCode Available | 1 |
| PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training | Sep 23, 2024 | 8kGPU | —Unverified | 0 |
| LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models | Aug 31, 2024 | 8kGPU | CodeCode Available | 2 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study | Aug 26, 2024 | 8kBenchmarking | —Unverified | 0 |
| Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns | Aug 24, 2024 | 8kArticles | CodeCode Available | 0 |
| SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models | Aug 21, 2024 | 8kGSM8K | CodeCode Available | 1 |
| FocusLLM: Precise Understanding of Long Context by Dynamic Condensing | Aug 21, 2024 | 8kDecoder | CodeCode Available | 1 |
| Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models | Aug 19, 2024 | 8kInformation Retrieval | CodeCode Available | 0 |
| SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis | Aug 16, 2024 | 8kImage Generation | —Unverified | 0 |
| ProFuser: Progressive Fusion of Large Language Models | Aug 9, 2024 | 8k | —Unverified | 0 |
| PGNeXt: High-Resolution Salient Object Detection via Pyramid Grafting Network | Aug 2, 2024 | 4k8k | —Unverified | 0 |
| Evaluating Long Range Dependency Handling in Code Generation Models using Multi-Step Key Retrieval | Jul 23, 2024 | 8kCode Completion | —Unverified | 0 |