| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| DELIFT: Data Efficient Language model Instruction Fine Tuning | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Latxa: An Open Language Model and Evaluation Suite for Basque | Mar 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LaMP: When Large Language Models Meet Personalization | Apr 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DEMix Layers: Disentangling Domains for Modular Language Modeling | Aug 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Democratizing Reasoning Ability: Tailored Learning from Large Language Model | Oct 20, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Language Modeling with Gated Convolutional Networks | Dec 23, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| L^2M: Mutual Information Scaling Law for Long-Context Language Modeling | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dependency-based Mixture Language Models | Mar 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| Label2Label: A Language Modeling Framework for Multi-Attribute Learning | Jul 18, 2022 | AttributeClothing Attribute Recognition | CodeCode Available | 1 | 5 |
| Chinese Spelling Correction as Rephrasing Language Model | Aug 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation | Dec 28, 2023 | GSM8KLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection | Apr 28, 2020 | Abuse DetectionLanguage Modeling | CodeCode Available | 1 | 5 |
| ASR2K: Speech Recognition for Around 2000 Languages without Audio | Sep 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions | May 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-TaskLearning for Offensive Language Detection | Dec 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Detection-Correction Structure via General Language Model for Grammatical Error Correction | May 28, 2024 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| CMD: a framework for Context-aware Model self-Detoxification | Aug 16, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| KR-BERT: A Small-Scale Korean-Specific Language Model | Aug 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Labrador: Exploring the Limits of Masked Language Modeling for Laboratory Data | Dec 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control | Feb 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Aug 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 | 5 |
| Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dialogue State Tracking with a Language Model using Schema-Driven Prompting | Sep 15, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| Dialogue-oriented Pre-training | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation | Apr 27, 2022 | DecoderDiversity | CodeCode Available | 1 | 5 |
| Learning to Attribute with Attention | Apr 18, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation | Apr 19, 2022 | Dialogue GenerationKnowledge Distillation | CodeCode Available | 1 | 5 |
| Learning to engineer protein flexibility | Dec 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DiffEditor: Enhancing Speech Editing with Semantic Enrichment and Acoustic Consistency | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATION | Jan 1, 2021 | ChatbotDecoder | CodeCode Available | 1 | 5 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers | Jun 19, 2020 | Adversarial AttackLanguage Modeling | CodeCode Available | 1 | 5 |
| KnowMAN: Weakly Supervised Multinomial Adversarial Networks | Sep 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |