| Instruction-Driven Game Engine: A Poker Case Study | Oct 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Improving Multi-modal Large Language Model through Boosting Vision Capabilities | Oct 17, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Collaborative AI in Sentiment Analysis: System Architecture, Data Prediction and Deployment Strategies | Oct 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task Automation | Oct 17, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Trust but Verify: Programmatic VLM Evaluation in the Wild | Oct 17, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |
| MedINST: Meta Dataset of Biomedical Instructions | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Advancing Large Language Model Attribution through Self-Improving | Oct 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation | Oct 17, 2024 | GSM8KLanguage Modeling | CodeCode Available | 0 |
| On the Role of Attention Heads in Large Language Model Safety | Oct 17, 2024 | AttributeLanguage Modeling | CodeCode Available | 2 |
| aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing | Oct 17, 2024 | AttributeCode Completion | CodeCode Available | 7 |
| Towards Hybrid Intelligence in Journalism: Findings and Lessons Learnt from a Collaborative Analysis of Greek Political Rhetoric by ChatGPT and Humans | Oct 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems | Oct 17, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| REFINE on Scarce Data: Retrieval Enhancement through Fine-Tuning via Model Fusion of Embedding Models | Oct 16, 2024 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Large Language Model-driven Multi-Agent Simulation for News Diffusion Under Different Network Structures | Oct 16, 2024 | BlockingLanguage Modeling | —Unverified | 0 |
| EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference | Oct 16, 2024 | Computational EfficiencyLarge Language Model | —Unverified | 0 |
| Can We Reverse In-Context Knowledge Edits? | Oct 16, 2024 | knowledge editingLanguage Modelling | —Unverified | 0 |
| Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning | Oct 16, 2024 | Contrastive Learninggraph construction | —Unverified | 0 |
| LFOSum: Summarizing Long-form Opinions with Large Language Models | Oct 16, 2024 | FormLanguage Modelling | —Unverified | 0 |
| BenchmarkCards: Large Language Model and Risk Reporting | Oct 16, 2024 | FairnessLanguage Modeling | —Unverified | 0 |
| ShapefileGPT: A Multi-Agent Large Language Model Framework for Automated Shapefile Processing | Oct 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MedAide: Towards an Omni Medical Aide via Specialized LLM-based Multi-Agent Collaboration | Oct 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding the Role of LLMs in Multimodal Evaluation Benchmarks | Oct 16, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 |
| Explainable Moral Values: a neuro-symbolic approach to value classification | Oct 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples | Oct 16, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |