| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Context-aware Stand-alone Neural Spelling Correction | Nov 12, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Fusing Context Into Knowledge Graph for Commonsense Question Answering | Dec 9, 2020 | Common Sense ReasoningKnowledge Graphs | CodeCode Available | 1 | 5 |
| AraGPT2: Pre-Trained Transformer for Arabic Language Generation | Dec 31, 2020 | ArticlesLanguage Modeling | CodeCode Available | 1 | 5 |
| LLaRA: Large Language-Recommendation Assistant | Dec 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CultureBank: An Online Community-Driven Knowledge Base Towards Culturally Aware Language Technologies | Apr 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Context Compression for Auto-regressive Transformers with Sentinel Tokens | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Q-learning with Language Model for Edit-based Unsupervised Summarization | Oct 9, 2020 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 | 5 |
| CTRAN: CNN-Transformer-based Network for Natural Language Understanding | Mar 19, 2023 | DecoderIntent Detection | CodeCode Available | 1 | 5 |
| FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data | Aug 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FVEval: Understanding Language Model Capabilities in Formal Verification of Digital Hardware | Oct 15, 2024 | Code GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context | May 7, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding | Dec 31, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRL: A Conditional Transformer Language Model for Controllable Generation | Sep 11, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation | Apr 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaMo: Large Language Model-based Molecular Graph Assistant | Oct 31, 2024 | Instruction FollowingIUPAC Name Prediction | CodeCode Available | 1 | 5 |
| CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model | Apr 9, 2023 | Cross-Part Crowd CountingCrowd Counting | CodeCode Available | 1 | 5 |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Jun 1, 2022 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation | Sep 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| GenAug: Data Augmentation for Finetuning Text Generators | Oct 5, 2020 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing | Oct 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LiveMind: Low-latency Large Language Models with Simultaneous Inference | Jun 20, 2024 | Collaborative InferenceLanguage Modeling | CodeCode Available | 1 | 5 |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Oct 23, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance | Nov 30, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 | 5 |
| Contextual information integration for stance detection via cross-attention | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| 3D Visual Illusion Depth Estimation | May 19, 2025 | Common Sense ReasoningDepth Estimation | CodeCode Available | 1 | 5 |
| Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization | Jan 22, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Generate to Understand for Representation | Jun 14, 2023 | Contrastive LearningGPU | CodeCode Available | 1 | 5 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task | Jul 16, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval | Jul 31, 2022 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 | 5 |
| LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models | Apr 1, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Listen, Attend and Spell | Aug 5, 2015 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic | Feb 20, 2024 | ArabicMMLULanguage Model Evaluation | CodeCode Available | 1 | 5 |
| LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses | Oct 30, 2023 | FormLanguage Modeling | CodeCode Available | 1 | 5 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RDF2Vec: RDF Graph Embeddings and Their Applications | Nov 10, 2017 | Entity EmbeddingsGeneral Knowledge | CodeCode Available | 1 | 5 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| R-Drop: Regularized Dropout for Neural Networks | Jun 28, 2021 | Abstractive Text Summarizationimage-classification | CodeCode Available | 1 | 5 |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Nov 23, 2021 | Image CaptioningImage Description | CodeCode Available | 1 | 5 |
| Lite Transformer with Long-Short Range Attention | Apr 24, 2020 | Abstractive Text SummarizationAutoML | CodeCode Available | 1 | 5 |
| Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model | Dec 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment | Oct 9, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |