| Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters | May 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior | Sep 16, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 0 |
| Ternary Singular Value Decomposition as a Better Parameterized Form in Linear Mapping | Aug 15, 2023 | FormLanguage Modeling | CodeCode Available | 0 |
| Morphology Matters: A Multilingual Language Modeling Analysis | Dec 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Replacing Language Model for Style Transfer | Nov 14, 2022 | DisentanglementLanguage Modeling | CodeCode Available | 0 |
| Test Case-Informed Knowledge Tracing for Open-ended Coding Tasks | Sep 28, 2024 | Knowledge TracingLanguage Modeling | CodeCode Available | 0 |
| PIORS: Personalized Intelligent Outpatient Reception based on Large Language Model with Multi-Agents Medical Scenario Simulation | Nov 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Physics Event Classification Using Large Language Models | Apr 5, 2024 | ChatbotClassification | CodeCode Available | 0 |
| Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions | Mar 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Watermark under Fire: A Robustness Evaluation of LLM Watermarking | Nov 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Zero-Shot Prompting for Implicit Intent Prediction and Recommendation with Commonsense Reasoning | Oct 12, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis | Apr 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Phonotactic Complexity across Dialects | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency | Feb 26, 2025 | intent-classificationIntent Classification | CodeCode Available | 0 |
| Phonemic Transcription of Low-Resource Tonal Languages | Dec 1, 2017 | Acoustic ModellingLanguage Modeling | CodeCode Available | 0 |
| MAMA: Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation | Jul 7, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Test-time Augmentation for Factual Probing | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Phone-ing it in: Towards Flexible Multi-Modal Language Model Training by Phonetic Representations of Data | May 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Repairing Language Model Pipelines by Meta Self-Refining Competing Constraints at Runtime | Jul 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration | Oct 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER | Aug 29, 2019 | Language ModelingNamed Entity Recognition (NER) | CodeCode Available | 0 |
| More Room for Language: Investigating the Effect of Retrieval on Language Models | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TexShape: Information Theoretic Sentence Embedding for Language Models | Feb 5, 2024 | Data CompressionFairness | CodeCode Available | 0 |
| Reliable Academic Conference Question Answering: A Study Based on Large Language Model | Oct 19, 2023 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Relevance in Dialogue: Is Less More? An Empirical Comparison of Existing Metrics, and a Novel Simple Metric | Jun 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly | Oct 16, 2022 | Cultural Vocal Bursts Intensity PredictionHallucination | CodeCode Available | 0 |
| Relational recurrent neural networks | Jun 5, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Meta Fine-Tuning Neural Language Models for Multi-Domain Text Mining | Mar 29, 2020 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| MoRE-LLM: Mixture of Rule Experts Guided by a Large Language Model | Mar 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Text-based classification of interviews for mental health -- juxtaposing the state of the art | Jul 29, 2020 | Audio ClassificationClassification | CodeCode Available | 0 |
| Reinforced Large Language Model is a formal theorem prover | Feb 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PHD: Pixel-Based Language Modeling of Historical Documents | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transition-Based Generation from Abstract Meaning Representations | Jul 24, 2017 | Abstract Meaning RepresentationLanguage Modeling | CodeCode Available | 0 |
| KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning | Nov 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Regulation of Language Models With Interpretability Will Likely Result In A Performance Trade-Off | Dec 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Transition-Based Syntactic Linearization with Lookahead Features | Jun 1, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model | May 24, 2023 | AllLanguage Modeling | CodeCode Available | 0 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Regularizing RNNs by Stabilizing Activations | Nov 26, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Manifold-Preserving Transformers are Effective for Short-Long Range Encoding | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| More Expressive Attention with Negative Weights | Nov 11, 2024 | DecoderImage Generation | CodeCode Available | 0 |
| Scaling Capability in Token Space: An Analysis of Large Vision Language Model | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Text Counterfactuals via Latent Optimization and Shapley-Guided Search | Oct 22, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers | Nov 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| Meta-Context Transformers for Domain-Specific Response Generation | Oct 12, 2020 | Dialogue GenerationLanguage Modeling | CodeCode Available | 0 |
| PhayaThaiBERT: Enhancing a Pretrained Thai Language Model with Unassimilated Loanwords | Nov 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Text-Driven Neural Collaborative Filtering Model for Paper Source Tracing | Jul 25, 2024 | ArticlesCollaborative Filtering | CodeCode Available | 0 |