| Juman++: A Morphological Analysis Toolkit for Scriptio Continua | Nov 1, 2018 | Art AnalysisLanguage Modeling | CodeCode Available | 0 |
| Development and Validation of a Dynamic-Template-Constrained Large Language Model for Generating Fully-Structured Radiology Reports | Sep 26, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| Building a Swedish Open-Domain Conversational Language Model | Apr 12, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling | Oct 25, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Living Machines: A study of atypical animacy | May 22, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA | Jun 11, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model | Jun 6, 2024 | Chinese Word SegmentationLanguage Modeling | CodeCode Available | 0 |
| How much complexity does an RNN architecture need to learn syntax-sensitive dependencies? | May 17, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Cross-Domain NER using Cross-Domain Language Modeling | Jul 1, 2019 | Cross-Domain Named Entity RecognitionDomain Adaptation | CodeCode Available | 0 |
| CroissantLLM: A Truly Bilingual French-English Language Model | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language Models | Jul 16, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench | May 24, 2023 | DiversityLanguage Modeling | CodeCode Available | 0 |
| Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping | Oct 26, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Pre-training of Graph Augmented Transformers for Medication Recommendation | Jun 2, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models | Nov 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridging Information-Theoretic and Geometric Compression in Language Models | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance | Dec 23, 2024 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Low-Perplexity Toxic Prompts | Jul 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| K-12BERT: BERT for K-12 education | May 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey | Dec 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How to Determine the Preferred Image Distribution of a Black-Box Vision-Language Model? | Sep 3, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics | Aug 24, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Oct 25, 2023 | Data-to-Text GenerationHallucination | CodeCode Available | 0 |
| CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics | Jun 16, 2024 | ClassificationInformativeness | CodeCode Available | 0 |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Oct 14, 2024 | Density Ratio EstimationGSM8K | CodeCode Available | 0 |
| How to Leverage Personal Textual Knowledge for Personalized Conversational Information Retrieval | Jul 23, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Non-Determinism of "Deterministic" LLM Settings | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| How to Protect Copyright Data in Optimization of Large Language Models? | Aug 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Logical Implications for Visual Question Answering Consistency | Mar 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Creative GANs for generating poems, lyrics, and metaphors | Sep 20, 2019 | Generative Adversarial NetworkLanguage Modeling | CodeCode Available | 0 |
| How to Unleash the Power of Large Language Models for Few-shot Relation Extraction? | May 2, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training | May 18, 2025 | Contrastive LearningIn-Context Learning | CodeCode Available | 0 |
| CRCL at SemEval-2024 Task 2: Simple prompt optimizations | May 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph | Sep 17, 2024 | cross-modal alignmentImage Captioning | CodeCode Available | 0 |
| Agglomerative Attention | Jul 15, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer | Apr 19, 2023 | Dependency ParsingExtractive Question-Answering | CodeCode Available | 0 |
| Assessing the Promise and Pitfalls of ChatGPT for Automated Code Generation | Nov 5, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| Anatomy of Neural Language Models | Jan 8, 2024 | AnatomyLanguage Modeling | CodeCode Available | 0 |
| LegiLM: A Fine-Tuned Legal Language Model for Data Compliance | Sep 9, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model | Oct 24, 2023 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression | Oct 16, 2021 | Few-Shot LearningKnowledge Distillation | CodeCode Available | 0 |
| Breaking Time Invariance: Assorted-Time Normalization for RNNs | Sep 28, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LEGOBench: Scientific Leaderboard Generation Benchmark | Jan 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 |
| A Geometric Notion of Causal Probing | Jul 27, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication | Oct 21, 2024 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 |