| Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers | Nov 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models | Nov 1, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents | Nov 1, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement | Nov 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Enhancing the Traditional Chinese Medicine Capabilities of Large Language Model through Reinforcement Learning from AI Feedback | Nov 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-KT: A Versatile Framework for Knowledge Transfer from Large Language Models to Collaborative Filtering | Nov 1, 2024 | Collaborative FilteringLanguage Modeling | —Unverified | 0 |
| Unified Generative and Discriminative Training for Multi-modal Large Language Models | Nov 1, 2024 | Dynamic Time WarpingImage-text Classification | —Unverified | 0 |
| Randomized Autoregressive Visual Generation | Nov 1, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 5 |
| LLaMo: Large Language Model-based Molecular Graph Assistant | Oct 31, 2024 | Instruction FollowingIUPAC Name Prediction | CodeCode Available | 1 |
| DEREC-SIMPRO: unlock Language Model benefits to advance Synthesis in Data Clean Room | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MESS+: Energy-Optimal Inferencing in Language Model Zoos with Service Level Guarantees | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning | Oct 31, 2024 | Dictionary LearningLanguage Modeling | —Unverified | 0 |
| Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking | Oct 31, 2024 | Data AugmentationDialogue State Tracking | —Unverified | 0 |
| GPT or BERT: why not both? | Oct 31, 2024 | Causal Language ModelingLanguage Modeling | CodeCode Available | 2 |
| EchoNarrator: Generating natural text explanations for ejection fraction predictions | Oct 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Morphological Typology in BPE Subword Productivity and Language Modeling | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction | Oct 31, 2024 | Disaster ResponseLanguage Modeling | CodeCode Available | 1 |
| Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Interpretable Language Modeling via Induction-head Ngram Models | Oct 31, 2024 | Causal Language ModelingHuman fMRI response prediction | CodeCode Available | 1 |
| Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| π_0: A Vision-Language-Action Flow Model for General Robot Control | Oct 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What is Wrong with Perplexity for Long-context Language Modeling? | Oct 31, 2024 | Document SummarizationIn-Context Learning | CodeCode Available | 2 |
| Matchmaker: Self-Improving Large Language Model Programs for Schema Matching | Oct 31, 2024 | Data IntegrationLanguage Modeling | —Unverified | 0 |
| The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge | Oct 31, 2024 | Audio GenerationLanguage Modeling | —Unverified | 0 |