| VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring | May 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Incorporating External POS Tagger for Punctuation Restoration | Jun 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| Stabilizing Transformers for Reinforcement Learning | Oct 13, 2019 | General Reinforcement LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images | Oct 22, 2023 | DiagnosticLanguage Modeling | CodeCode Available | 1 | 5 |
| Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| -former: Infinite Memory Transformer | May 1, 2022 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents | Jul 12, 2024 | Document Layout Analysisdocument understanding | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| AutoScrum: Automating Project Planning Using Large Language Models | Jun 5, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model | Mar 4, 2025 | es-enLanguage Modeling | CodeCode Available | 1 | 5 |
| InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation | Dec 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models | Oct 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought | Feb 25, 2025 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| InforMask: Unsupervised Informative Masking for Language Model Pretraining | Oct 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| RealFormer: Transformer Likes Residual Attention | Dec 21, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis | Feb 17, 2025 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 | 5 |