| MoSH: Modeling Multi-Objective Tradeoffs with Soft and Hard Bounds | Dec 9, 2024 | Bayesian OptimizationLarge Language Model | —Unverified | 0 |
| MAVias: Mitigate any Visual Bias | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance | Dec 9, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model | Dec 9, 2024 | feature selectionLanguage Modeling | —Unverified | 0 |
| XKV: Personalized KV Cache Memory Reduction for Long-Context LLM Inference | Dec 8, 2024 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Cooperative SQL Generation for Segmented Databases By Using Multi-functional LLM Agents | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GL-Fusion: Rethinking the Combination of Graph Neural Network and Large Language model | Dec 8, 2024 | Graph Neural NetworkLanguage Modeling | —Unverified | 0 |
| Trust No AI: Prompt Injection Along The CIA Security Triad | Dec 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Generative AI to Enhance Automated Vulnerability Scoring | Dec 7, 2024 | Large Language ModelVulnerability Detection | CodeCode Available | 0 |
| ULMRec: User-centric Large Language Model for Sequential Recommendation | Dec 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Confidence Diagram of Nonparametric Ranking for Uncertainty Assessment in Large Language Models Evaluation | Dec 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video2Reward: Generating Reward Function from Videos for Legged Robot Behavior Learning | Dec 7, 2024 | Large Language Model | CodeCode Available | 0 |
| Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation | Dec 7, 2024 | 3D GenerationLanguage Modeling | —Unverified | 0 |
| Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System | Dec 6, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| From Voice to Value: Leveraging AI to Enhance Spoken Online Reviews on the Go | Dec 6, 2024 | AI AgentLanguage Modeling | —Unverified | 0 |
| Multi-Armed Bandit Approach for Optimizing Training on Synthetic Data | Dec 6, 2024 | AttributeLarge Language Model | CodeCode Available | 0 |
| Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Dec 6, 2024 | document understandingHallucination | —Unverified | 0 |
| QueEn: A Large Language Model for Quechua-English Translation | Dec 6, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LinVT: Empower Your Image-level Large Language Model to Understand Videos | Dec 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference | Dec 6, 2024 | GPULanguage Modeling | —Unverified | 0 |
| A Survey of Large Language Model-Based Generative AI for Text-to-SQL: Benchmarks, Applications, Use Cases, and Challenges | Dec 6, 2024 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs | Dec 6, 2024 | Entity AlignmentEntity Embeddings | —Unverified | 0 |
| SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout | Dec 5, 2024 | DenoisingLarge Language Model | —Unverified | 0 |
| MISR: Measuring Instrumental Self-Reasoning in Frontier Models | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction | Dec 5, 2024 | ArticlesDataset Generation | CodeCode Available | 0 |
| Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning | Dec 5, 2024 | Large Language ModelMeta Reinforcement Learning | CodeCode Available | 1 |
| LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents | Dec 5, 2024 | Image Super-ResolutionLarge Language Model | CodeCode Available | 0 |
| ALMA: Alignment with Minimal Annotation | Dec 5, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Liquid: Language Models are Scalable Multi-modal Generators | Dec 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios | Dec 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Context-aware Framework for Translation-mediated Conversations | Dec 5, 2024 | Large Language ModelTranslation | —Unverified | 0 |
| PoTable: Towards Systematic Thinking via Stage-oriented Plan-then-Execute Reasoning on Tables | Dec 5, 2024 | Code GenerationLarge Language Model | —Unverified | 0 |
| EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM | Dec 5, 2024 | Image ManipulationLanguage Modeling | —Unverified | 0 |
| A large language model-type architecture for high-dimensional molecular potential energy surfaces | Dec 5, 2024 | Computational chemistryLanguage Modeling | —Unverified | 0 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation | Dec 4, 2024 | Image GenerationLarge Language Model | —Unverified | 0 |
| Intent-driven In-context Learning for Few-shot Dialogue State Tracking | Dec 4, 2024 | Dialogue State TrackingIn-Context Learning | —Unverified | 0 |
| ObjectFinder: An Open-Vocabulary Assistive System for Interactive Object Search by Blind People | Dec 4, 2024 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models | Dec 4, 2024 | ChatbotLarge Language Model | —Unverified | 0 |
| From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Automatic detection of diseases in Spanish clinical notes combining medical language models and ontologies | Dec 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges | Dec 4, 2024 | Code GenerationImage Comprehension | —Unverified | 0 |
| Video LLMs for Temporal Reasoning in Long Videos | Dec 4, 2024 | Action SegmentationDense Video Captioning | —Unverified | 0 |
| Controlling the Mutation in Large Language Models for the Efficient Evolution of Algorithms | Dec 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning | Dec 4, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| MRP-LLM: Multitask Reflective Large Language Models for Privacy-Preserving Next POI Recommendation | Dec 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hybrid-SQuAD: Hybrid Scholarly Question Answering Dataset | Dec 3, 2024 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |