| Teola: Towards End-to-End Optimization of LLM-based Applications | Jun 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer | Jun 24, 2024 | AI AgentLarge Language Model | CodeCode Available | 2 |
| EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting | Jun 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GenoTEX: An LLM Agent Benchmark for Automated Gene Expression Data Analysis | Jun 21, 2024 | AI AgentAutoML | CodeCode Available | 2 |
| MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression | Jun 21, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Jun 20, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning | Jun 20, 2024 | Autonomous NavigationHeuristic Search | CodeCode Available | 2 |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling | Jun 18, 2024 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 2 |
| Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM | Jun 18, 2024 | Anomaly DetectionAnomaly Localization | CodeCode Available | 2 |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Jun 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Watch Every Step! LLM Agent Learning via Iterative Step-Level Process Refinement | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| StreamBench: Towards Benchmarking Continuous Improvement of Language Agents | Jun 13, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Explore the Limits of Omni-modal Pretraining at Scale | Jun 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Discovering Preference Optimization Algorithms with and for Large Language Models | Jun 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 |
| LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model | Jun 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Tool-Planner: Task Planning with Clusters across Multiple Tools | Jun 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt | Jun 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM | Jun 5, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models | Jun 5, 2024 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 |
| From Redundancy to Relevance: Information Flow in LVLMs Across Reasoning Tasks | Jun 4, 2024 | Image CaptioningLanguage Modelling | CodeCode Available | 2 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |