| FAN: Fourier Analysis Networks | Oct 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Cascade Prompt Learning for Vision-Language Model Adaptation | Sep 26, 2024 | General Knowledgeimage-classification | CodeCode Available | 3 |
| Agent Workflow Memory | Sep 11, 2024 | AI AgentLanguage Modeling | CodeCode Available | 3 |
| ContextCite: Attributing Model Generation to Context | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model | Aug 30, 2024 | Audio CompressionAudio Generation | CodeCode Available | 3 |
| The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Aug 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs | Aug 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling | Aug 9, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data | Aug 7, 2024 | 16k2k | CodeCode Available | 3 |
| OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale | Jul 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON | Jul 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Jul 22, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 3 |
| SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models | Jul 22, 2024 | Language Modeling | CodeCode Available | 3 |
| Compact Language Models via Pruning and Knowledge Distillation | Jul 19, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 3 |
| OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer | Jul 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases | Jul 15, 2024 | Attributecounterfactual | CodeCode Available | 3 |
| Scaling Retrieval-Based Language Models with a Trillion-Token Datastore | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Tree Search for Language Model Agents | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model | Jun 28, 2024 | Interactive SegmentationLanguage Modeling | CodeCode Available | 3 |
| VisualRWKV: Exploring Recurrent Neural Networks for Visual Language Models | Jun 19, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model Prompts | Jun 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| CarLLaVA: Vision language models for camera-only closed-loop driving | Jun 14, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| Language Model Council: Democratically Benchmarking Foundation Models on Highly Subjective Tasks | Jun 12, 2024 | BenchmarkingChatbot | CodeCode Available | 3 |
| Multimodal Table Understanding | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 |
| Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | May 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Improving Transformers with Dynamically Composable Multi-Head Attention | May 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing | Apr 30, 2024 | Computational EfficiencyHallucination | CodeCode Available | 3 |
| A Survey on the Memory Mechanism of Large Language Model based Agents | Apr 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification | Apr 16, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 3 |
| Rho-1: Not All Tokens Are What You Need | Apr 11, 2024 | AllContinual Pretraining | CodeCode Available | 3 |
| Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python | Apr 9, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 3 |
| PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection | Apr 8, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 3 |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Apr 8, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 3 |
| Evalverse: Unified and Accessible Library for Large Language Model Evaluation | Apr 1, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 3 |
| M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models | Mar 31, 2024 | Image-text RetrievalLanguage Modeling | CodeCode Available | 3 |
| TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios | Mar 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models | Mar 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference | Mar 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| GiT: Towards Generalist Vision Transformer through Universal Language Interface | Mar 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression | Mar 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation | Mar 11, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |
| Embodied Understanding of Driving Scenarios | Mar 7, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 3 |