| Revisited Large Language Model for Time Series Analysis through Modality Alignment | Oct 16, 2024 | Anomaly DetectionImputation | —Unverified | 0 |
| Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation | Oct 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited Responses | Oct 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models | Oct 15, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| De-jargonizing Science for Journalists with GPT-4: A Pilot Study | Oct 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LargePiG: Your Large Language Model is Secretly a Pointer Generator | Oct 15, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic Segmentation | Oct 15, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router | Oct 15, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| A Framework for Adapting Human-Robot Interaction to Diverse User Groups | Oct 15, 2024 | Action DetectionActivity Detection | CodeCode Available | 0 |
| MoChat: Joints-Grouped Spatio-Temporal Grounding LLM for Multi-Turn Motion Comprehension and Description | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ATTNChecker: Highly-Optimized Fault Tolerant Attention for Large Language Model Training | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Y-Mol: A Multiscale Biomedical Knowledge-Guided Large Language Model for Drug Development | Oct 15, 2024 | Drug DesignKnowledge Graphs | —Unverified | 0 |
| Retrieval Augmented Spelling Correction for E-Commerce Applications | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Synthetic Interlocutors. Experiments with Generative AI to Prolong Ethnographic Encounters | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction | Oct 15, 2024 | Emotion RecognitionLanguage Modeling | CodeCode Available | 0 |
| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Preserve or Modify? Context-Aware Evaluation for Balancing Preservation and Modification in Text-Guided Image Editing | Oct 15, 2024 | AttributeLarge Language Model | —Unverified | 0 |
| SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Oct 15, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks | Oct 15, 2024 | HumanEvalLanguage Modelling | —Unverified | 0 |
| GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation | Oct 15, 2024 | Explainable RecommendationLanguage Modelling | CodeCode Available | 1 |
| Sequential LLM Framework for Fashion Recommendation | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Skill Learning Using Process Mining for Large Language Model Plan Generation | Oct 14, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Not All Options Are Created Equal: Textual Option Weighting for Token-Efficient LLM-Based Knowledge Tracing | Oct 14, 2024 | AllBinary Classification | —Unverified | 0 |
| PRACTIQ: A Practical Conversational Text-to-SQL dataset with Ambiguous and Unanswerable Queries | Oct 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory | Oct 14, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 3 |
| Character-aware audio-visual subtitling in context | Oct 14, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Oct 14, 2024 | Density Ratio EstimationGSM8K | CodeCode Available | 0 |
| Diagnosing Hate Speech Classification: Where Do Humans and Machines Disagree, and Why? | Oct 14, 2024 | DiagnosticLarge Language Model | —Unverified | 0 |
| Recipe for Zero-shot POS Tagging: Is It Useful in Realistic Scenarios? | Oct 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model Evaluation via Matrix Nuclear-Norm | Oct 14, 2024 | Computational EfficiencyData Compression | CodeCode Available | 0 |
| A Multi-Task Text Classification Pipeline with Natural Language Explanations: A User-Centric Evaluation in Sentiment Analysis and Offensive Language Identification in Greek Tweets | Oct 14, 2024 | Feature ImportanceLanguage Identification | —Unverified | 0 |
| Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies | Oct 14, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Model-based Large Language Model Customization as Service | Oct 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization | Oct 14, 2024 | Explanation GenerationImage Forgery Detection | —Unverified | 0 |
| MisinfoEval: Generative AI in the Era of "Alternative Facts" | Oct 13, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles | Oct 13, 2024 | Autonomous VehiclesCode Generation | —Unverified | 0 |
| Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization | Oct 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoIN: Mixture of Introvert Experts to Upcycle an LLM | Oct 13, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Adaptive Reasoning and Acting in Medical Language Agents | Oct 13, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| ALLoRA: Adaptive Learning Rate Mitigates LoRA Fatal Flaws | Oct 13, 2024 | Large Language Model | —Unverified | 0 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LoRE: Logit-Ranked Retriever Ensemble for Enhancing Open-Domain Question Answering | Oct 13, 2024 | Answer GenerationLanguage Modeling | —Unverified | 0 |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Oct 12, 2024 | Code GenerationLanguage Modeling | —Unverified | 0 |
| DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning | Oct 12, 2024 | Audio captioningLarge Language Model | —Unverified | 0 |
| Debiasing Vison-Language Models with Text-Only Training | Oct 12, 2024 | Large Language Model | —Unverified | 0 |
| LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning | Oct 12, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Extended Japanese Commonsense Morality Dataset with Masked Token and Label Enhancement | Oct 12, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Enterprise Benchmarks for Large Language Model Evaluation | Oct 11, 2024 | BenchmarkingLanguage Model Evaluation | CodeCode Available | 0 |
| LLMD: A Large Language Model for Interpreting Longitudinal Medical Records | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Oct 11, 2024 | Large Language ModelLogical Reasoning | —Unverified | 0 |