| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| GENIUS: Sketch-based Language Model Pre-training via Extreme and Selective Masking for Text Generation and Augmentation | Nov 18, 2022 | Conditional Text GenerationData Augmentation | CodeCode Available | 1 | 5 |
| LMBot: Distilling Graph Knowledge into Language Model for Graph-less Deployment in Twitter Bot Detection | Jun 30, 2023 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 | 5 |
| LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation | Mar 25, 2025 | Code CompletionLanguage Modeling | CodeCode Available | 1 | 5 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models | Nov 11, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| GePpeTto Carves Italian into a Language Model | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Data Augmentation using Pre-trained Transformer Models | Mar 4, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs | Oct 25, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language Modeling | Oct 16, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital Twins | May 28, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis | Oct 23, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Dec 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text | Aug 14, 2023 | Drug DiscoveryImage Captioning | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Dec 6, 2024 | counterfactualLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model | Jun 11, 2023 | General KnowledgeKnowledge Distillation | CodeCode Available | 1 | 5 |
| GLADIS: A General and Large Acronym Disambiguation Benchmark | Feb 3, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMBind: A Unified Modality-Task Integration Framework | Feb 22, 2024 | AI AgentAudio Generation | CodeCode Available | 1 | 5 |
| DARTS: Differentiable Architecture Search | Jun 24, 2018 | General Classificationimage-classification | CodeCode Available | 1 | 5 |
| LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Oct 28, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| 3D Visual Illusion Depth Estimation | May 19, 2025 | Common Sense ReasoningDepth Estimation | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| Contrastive Chain-of-Thought Prompting | Nov 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMs Can Simulate Standardized Patients via Agent Coevolution | Dec 16, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval | Jul 31, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Jan 6, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Residual Energy-Based Models for Text Generation | Apr 22, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GNN-LM: Language Modeling based on Global Contexts via GNN | Oct 17, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Contrastive Learning for Prompt-Based Few-Shot Language Learners | May 3, 2022 | Contrastive LearningIn-Context Learning | CodeCode Available | 1 | 5 |
| ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic | Feb 20, 2024 | ArabicMMLULanguage Model Evaluation | CodeCode Available | 1 | 5 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Golos: Russian Dataset for Speech Research | Jun 18, 2021 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model | Dec 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 | 5 |
| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |