| Superhuman performance of a large language model on the reasoning tasks of a physician | Dec 14, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Inference Scaling for Bridging Retrieval and Augmented Generation | Dec 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Verify Summary Facts with Fine-Grained LLM Feedback | Dec 14, 2024 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| Script-Based Dialog Policy Planning for LLM-Powered Conversational Agents: A Basic Architecture for an "AI Therapist" | Dec 13, 2024 | Large Language Model | —Unverified | 0 |
| A Generative AI-driven Metadata Modelling Approach | Dec 13, 2024 | Large Language Model | —Unverified | 0 |
| WHAT-IF: Exploring Branching Narratives by Meta-Prompting Large Language Models | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evidence Contextualization and Counterfactual Attribution for Conversational QA over Heterogeneous Data with RAG Systems | Dec 13, 2024 | Answer GenerationConversational Question Answering | —Unverified | 0 |
| You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects | Dec 13, 2024 | Large Language Model | CodeCode Available | 2 |
| B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Small Language Model as Data Prospector for Large Language Model | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| Regulation of Language Models With Interpretability Will Likely Result In A Performance Trade-Off | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals | Dec 12, 2024 | Image CaptioningImage Generation | —Unverified | 0 |
| MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation | Dec 12, 2024 | DescriptiveFood recommendation | CodeCode Available | 0 |
| Towards Wireless Native Big AI Model: The Mission and Approach Differ From Large Language Model | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM | Dec 12, 2024 | Image ComprehensionImage Generation | —Unverified | 0 |
| When Text Embedding Meets Large Language Model: A Comprehensive Survey | Dec 12, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| ATPrompt: Textual Prompt Learning with Embedded Attributes | Dec 12, 2024 | AttributeLarge Language Model | CodeCode Available | 3 |
| Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PyOD 2: A Python Library for Outlier Detection with LLM-powered Model Selection | Dec 11, 2024 | Anomaly DetectionFraud Detection | —Unverified | 0 |
| COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework | Dec 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs | Dec 11, 2024 | Large Language ModelObject | —Unverified | 0 |