| Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks | Dec 12, 2024 | DiversityGPU | —Unverified | 0 |
| CareBot: A Pioneering Full-Process Open-Source Medical Language Model | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Regulation of Language Models With Interpretability Will Likely Result In A Performance Trade-Off | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Dialogue Language Model with Large-Scale Persona Data Engineering | Dec 12, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Towards Wireless Native Big AI Model: The Mission and Approach Differ From Large Language Model | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language model driven: a PROTAC generation pipeline with dual constraints of structure and property | Dec 12, 2024 | Drug DiscoveryLanguage Modeling | —Unverified | 0 |
| Congruence-based Learning of Probabilistic Deterministic Finite Automata | Dec 12, 2024 | Active LearningLanguage Modeling | —Unverified | 0 |
| AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| When Text Embedding Meets Large Language Model: A Comprehensive Survey | Dec 12, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Efficient and Comprehensive Feature Extraction in Large Vision-Language Model for Pathology Analysis | Dec 12, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| Learning Novel Skills from Language-Generated Demonstrations | Dec 12, 2024 | Imitation LearningLanguage Modeling | —Unverified | 0 |
| Phi-4 Technical Report | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering | Dec 12, 2024 | feature selectionLanguage Modeling | —Unverified | 0 |
| COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework | Dec 11, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| LatentQA: Teaching LLMs to Decode Activations Into Natural Language | Dec 11, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis | Dec 11, 2024 | Continual PretrainingLanguage Modeling | CodeCode Available | 1 |
| Advancing Single and Multi-task Text Classification through Large Language Model Fine-tuning | Dec 11, 2024 | ClassificationDecoder | —Unverified | 0 |
| SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs | Dec 11, 2024 | ARCGSM8K | —Unverified | 0 |
| POINTS1.5: Building a Vision-Language Model towards Real World Applications | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Position-aware Guided Point Cloud Completion with CLIP Model | Dec 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Concept Bottleneck Large Language Models | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multimodal Latent Language Modeling with Next-Token Diffusion | Dec 11, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 0 |