| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| CareBot: A Pioneering Full-Process Open-Source Medical Language Model | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Regulation of Language Models With Interpretability Will Likely Result In A Performance Trade-Off | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Congruence-based Learning of Probabilistic Deterministic Finite Automata | Dec 12, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| Towards Wireless Native Big AI Model: The Mission and Approach Differ From Large Language Model | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dialogue Language Model with Large-Scale Persona Data Engineering | Dec 12, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Language model driven: a PROTAC generation pipeline with dual constraints of structure and property | Dec 12, 2024 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering | Dec 12, 2024 | feature selectionLanguage Modeling | —Unverified | 0 |
| Learning Novel Skills from Language-Generated Demonstrations | Dec 12, 2024 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| When Text Embedding Meets Large Language Model: A Comprehensive Survey | Dec 12, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Efficient and Comprehensive Feature Extraction in Large Vision-Language Model for Pathology Analysis | Dec 12, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Phi-4 Technical Report | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework | Dec 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| LatentQA: Teaching LLMs to Decode Activations Into Natural Language | Dec 11, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis | Dec 11, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 1 |
| Multimodal Latent Language Modeling with Next-Token Diffusion | Dec 11, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning | Dec 11, 2024 | ClassificationDecoder | —Unverified | 0 |
| POINTS1.5: Building a Vision-Language Model towards Real World Applications | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Underestimated Privacy Risks for Minority Populations in Large Language Model Unlearning | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TurboAttention: Efficient Attention Approximation For High Throughputs LLMs | Dec 11, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs | Dec 11, 2024 | ARCGSM8K | —Unverified | 0 |
| Concept Bottleneck Large Language Models | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Position-aware Guided Point Cloud Completion with CLIP Model | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training | Dec 11, 2024 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| Automatic Item Generation for Personality Situational Judgment Tests with Large Language Models | Dec 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Research on the Application of Spark Streaming Real-Time Data Analysis System and large language model Intelligent Agents | Dec 10, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Active Inference for Self-Organizing Multi-LLM Systems: A Bayesian Thermodynamic Approach to Adaptation | Dec 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Preference Adaptive and Sequential Text-to-Image Generation | Dec 10, 2024 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Neural Scaling Laws Rooted in the Data Distribution | Dec 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Agents for self-driving laboratories applied to quantum computing | Dec 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models | Dec 10, 2024 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| CoPrUS: Consistency Preserving Utterance Synthesis towards more realistic benchmark dialogues | Dec 10, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Bayesian Optimization of Antibodies Informed by a Generative Model of Evolving Sequences | Dec 10, 2024 | Bayesian OptimizationLanguage Modeling | CodeCode Available | 1 |
| MAPLE: A Framework for Active Preference Learning Guided by Large Language Models | Dec 10, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| Granite Guardian | Dec 10, 2024 | HallucinationLanguage Modeling | CodeCode Available | 2 |
| KULTURE Bench: A Benchmark for Assessing Language Model in Korean Cultural Context | Dec 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RAG-based Question Answering over Heterogeneous Data and Text | Dec 10, 2024 | Answer GenerationKnowledge Graphs | —Unverified | 0 |
| The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model | Dec 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Filling Memory Gaps: Enhancing Continual Semantic Parsing via SQL Syntax Variance-Guided LLMs without Real Data Replay | Dec 10, 2024 | Continual LearningLanguage Modeling | —Unverified | 0 |
| IntellectSeeker: A Personalized Literature Management System with the Probabilistic Model and Large Language Model | Dec 10, 2024 | ArticlesFew-Shot Learning | CodeCode Available | 0 |
| LINKs: Large Language Model Integrated Management for 6G Empowered Digital Twin NetworKs | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection | Dec 9, 2024 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning | Dec 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Small Languages, Big Models: A Study of Continual Training on Languages of Norway | Dec 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions | Dec 9, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Gated Delta Networks: Improving Mamba2 with Delta Rule | Dec 9, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 4 |