| EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction | Dec 5, 2024 | ArticlesDataset Generation | CodeCode Available | 0 |
| Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning | Dec 5, 2024 | Large Language ModelMeta Reinforcement Learning | CodeCode Available | 1 |
| LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents | Dec 5, 2024 | Image Super-ResolutionLarge Language Model | CodeCode Available | 0 |
| ALMA: Alignment with Minimal Annotation | Dec 5, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Liquid: Language Models are Scalable Multi-modal Generators | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Context-aware Framework for Translation-mediated Conversations | Dec 5, 2024 | Large Language ModelTranslation | —Unverified | 0 |
| PoTable: Towards Systematic Thinking via Stage-oriented Plan-then-Execute Reasoning on Tables | Dec 5, 2024 | Code GenerationLarge Language Model | —Unverified | 0 |
| EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM | Dec 5, 2024 | Image ManipulationLanguage Modeling | —Unverified | 0 |
| A large language model-type architecture for high-dimensional molecular potential energy surfaces | Dec 5, 2024 | Computational chemistryLanguage Modeling | —Unverified | 0 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation | Dec 4, 2024 | Image GenerationLarge Language Model | —Unverified | 0 |
| Intent-driven In-context Learning for Few-shot Dialogue State Tracking | Dec 4, 2024 | Dialogue State TrackingIn-Context Learning | —Unverified | 0 |
| ObjectFinder: An Open-Vocabulary Assistive System for Interactive Object Search by Blind People | Dec 4, 2024 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models | Dec 4, 2024 | ChatbotLarge Language Model | —Unverified | 0 |
| From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Automatic detection of diseases in Spanish clinical notes combining medical language models and ontologies | Dec 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| Video LLMs for Temporal Reasoning in Long Videos | Dec 4, 2024 | Action SegmentationDense Video Captioning | —Unverified | 0 |
| Controlling the Mutation in Large Language Models for the Efficient Evolution of Algorithms | Dec 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning | Dec 4, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation | Dec 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid-SQuAD: Hybrid Scholarly Question Answering Dataset | Dec 3, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |