| UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation | Oct 3, 2024 | ChunkingLanguage Modeling | —Unverified | 0 |
| CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning | Oct 3, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| LLMCO2: Advancing Accurate Carbon Footprint Prediction for LLM Inferences | Oct 3, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration | Oct 3, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Neutral residues: revisiting adapters for model extension | Oct 3, 2024 | Domain AdaptationLanguage Modelling | —Unverified | 0 |
| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 |
| Leveraging Large Language Models to Enhance Personalized Recommendations in E-commerce | Oct 2, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Long-range gene expression prediction with token alignment of large language model | Oct 2, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Two-Stage Proactive Dialogue Generator for Efficient Clinical Information Collection Using Large Language Model | Oct 2, 2024 | DiagnosticDialogue Generation | —Unverified | 0 |
| CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL | Oct 2, 2024 | Large Language ModelText to SQL | —Unverified | 0 |
| Racing Thoughts: Explaining Contextualization Errors in Large Language Models | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition | Oct 2, 2024 | Common Sense ReasoningInductive logic programming | —Unverified | 0 |
| Generate then Refine: Data Augmentation for Zero-shot Intent Detection | Oct 2, 2024 | Data AugmentationDiversity | CodeCode Available | 0 |
| OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning | Oct 2, 2024 | 3D GenerationLanguage Modeling | —Unverified | 0 |
| TypedThinker: Typed Thinking Improves Large Language Model Reasoning | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automatic deductive coding in discourse analysis: an application of large language models in learning analytics | Oct 2, 2024 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices | Oct 2, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Investigating on RLHF methodology | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Spoken Grammar Assessment Using LLM | Oct 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Elaborative Subtopic Query Reformulation for Broad and Indirect Queries in Travel Destination Recommendation | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Efficient 1-bit tensor approximations | Oct 2, 2024 | Large Language Model | —Unverified | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension | Oct 2, 2024 | Image SegmentationLarge Language Model | —Unverified | 0 |
| OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data | Oct 2, 2024 | Arithmetic ReasoningLarge Language Model | CodeCode Available | 4 |
| ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving | Oct 2, 2024 | BenchmarkingDocument Summarization | —Unverified | 0 |
| From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge | Oct 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model | Oct 1, 2024 | AllLanguage Modeling | —Unverified | 0 |
| Khattat: Enhancing Readability and Concept Representation of Semantic Typography | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Democratization of Subspeciality Medical Expertise | Oct 1, 2024 | DiagnosticLarge Language Model | —Unverified | 0 |
| Detección Automática de Patologías en Notas Clínicas en Español Combinando Modelos de Lenguaje y Ontologías Médicos | Oct 1, 2024 | Large Language Model | —Unverified | 0 |
| PclGPT: A Large Language Model for Patronizing and Condescending Language Detection | Oct 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Optimizing Token Usage on Large Language Model Conversations Using the Design Structure Matrix | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Empty Spaces: Human-in-the-Loop Data Augmentation | Oct 1, 2024 | Data AugmentationDiversity | CodeCode Available | 1 |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation | Oct 1, 2024 | DescriptiveInductive Bias | —Unverified | 0 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| Don't Stop Me Now: Embedding Based Scheduling for LLMs | Oct 1, 2024 | BlockingLarge Language Model | —Unverified | 0 |
| ViDAS: Vision-based Danger Assessment and Scoring | Oct 1, 2024 | Fixed Few Shot PromptingFixed Few Shot Prompting Danger Assessment | —Unverified | 0 |
| ReXplain: Translating Radiology into Patient-Friendly Video Reports | Oct 1, 2024 | AnatomyImage Segmentation | —Unverified | 0 |
| Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| "Show Me What's Wrong!": Combining Charts and Text to Guide Data Analysis | Oct 1, 2024 | Fraud DetectionLanguage Modeling | —Unverified | 0 |
| A Hierarchical conv-LSTM and LLM Integrated Model for Holistic Stock Forecasting | Sep 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Methodology for Explainable Large Language Models with Integrated Gradients and Linguistic Analysis in Text Classification | Sep 30, 2024 | Large Language Modeltext-classification | —Unverified | 0 |
| ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer | Sep 30, 2024 | AllLarge Language Model | —Unverified | 0 |
| EEG Emotion Copilot: Optimizing Lightweight LLMs for Emotional EEG Interpretation with Assisted Medical Record Generation | Sep 30, 2024 | Computational EfficiencyDiagnostic | CodeCode Available | 0 |
| DoPAMine: Domain-specific Pre-training Adaptation from seed-guided data Mining | Sep 30, 2024 | Continual PretrainingDomain Adaptation | —Unverified | 0 |
| Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning | Sep 30, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |