CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data Apr 28, 2023 document understanding Language Modeling
Code Code Available 1CC-Riddle: A Question Answering Dataset of Chinese Character Riddles Jun 28, 2022 General Knowledge Language Modelling
Code Code Available 1ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation Dec 23, 2021 Language Modeling Language Modelling
Code Code Available 1EscapeBench: Pushing Language Models to Think Outside the Box Dec 18, 2024 Language Modeling Language Modelling
Code Code Available 1A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Entity-aware Transformers for Entity Search May 2, 2022 Entity Embeddings Entity Retrieval
Code Code Available 1Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval Oct 4, 2024 Descriptive Language Modeling
Code Code Available 1Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost Mar 15, 2022 Contrastive Learning Language Modeling
Code Code Available 1Entity Tracking in Language Models May 3, 2023 Language Modeling Language Modelling
Code Code Available 1In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning Aug 8, 2023 In-Context Learning Language Modeling
Code Code Available 1Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method Jun 11, 2023 Knowledge Distillation Language Modeling
Code Code Available 1In-Context Learning with Many Demonstration Examples Feb 9, 2023 16k 8k
Code Code Available 1ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences Nov 10, 2023 Dialogue Generation Language Modeling
Code Code Available 1Enhancing Vision-Language Model with Unmasked Token Alignment May 29, 2024 Language Modeling Language Modelling
Code Code Available 1Incorporating Large Language Models into Production Systems for Enhanced Task Automation and Flexibility Jul 11, 2024 Language Modeling Language Modelling
Code Code Available 1VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups Jun 1, 2021 Language Modeling Language Modelling
Code Code Available 1Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement Feb 9, 2024 Code Generation Decision Making
Code Code Available 1Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced Chat Corpus Generation and Evaluation Nov 27, 2023 Diversity Language Modelling
Code Code Available 1ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain May 20, 2023 De-identification Language Modeling
Code Code Available 1Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction Jul 4, 2024 Language Modeling Language Modelling
Code Code Available 1Does It Make Sense? And Why? A Pilot Study for Sense Making and Explanation Jun 2, 2019 Common Sense Reasoning Language Modeling
Code Code Available 1Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning May 24, 2023 Language Modeling Language Modelling
Code Code Available 1InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model Mar 4, 2025 es-en Language Modeling
Code Code Available 1CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model Nov 10, 2023 Language Modeling Language Modelling
Code Code Available 1InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings Oct 8, 2022 Contrastive Learning Language Modeling
Code Code Available 1CFGPT: Chinese Financial Assistant with Large Language Model Sep 19, 2023 Decision Making Financial Analysis
Code Code Available 1Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities Nov 30, 2023 Audio Classification Few-Shot Audio Classification
Code Code Available 1RealFormer: Transformer Likes Residual Attention Dec 21, 2020 Language Modeling Language Modelling
Code Code Available 1Large Language Models are Learnable Planners for Long-Term Recommendation Feb 29, 2024 Decision Making Language Modelling
Code Code Available 1-former: Infinite Memory Transformer May 1, 2022 Dialogue Generation Language Modeling
Code Code Available 1Enhancing Indic Handwritten Text Recognition Using Global Semantic Information Dec 15, 2022 Decoder Handwritten Text Recognition
Code Code Available 1Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation Dec 16, 2022 Answer Generation Decoder
Code Code Available 1Enhancing Domain Adaptation through Prompt Gradient Alignment Jun 13, 2024 Domain Adaptation Language Modeling
Code Code Available 1Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation Jun 28, 2023 Chatbot Dialogue Generation
Code Code Available 1CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning Jul 30, 2024 Contrastive Learning Diagnostic
Code Code Available 1Chain of Images for Intuitively Reasoning Nov 9, 2023 Common Sense Reasoning Language Modelling
Code Code Available 1Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources May 22, 2023 Hallucination Language Modelling
Code Code Available 1InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER Mar 8, 2022 Entity Typing Few-Shot Learning
Code Code Available 1XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection Feb 27, 2024 Language Modeling Language Modelling
Code Code Available 1Enhancing Clinical BERT Embedding using a Biomedical Knowledge Base Dec 1, 2020 Language Modeling Language Modelling
Code Code Available 1An Analysis and Mitigation of the Reversal Curse Nov 13, 2023 Denoising Language Modelling
Code Code Available 1Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics Graph May 1, 2022 Language Modeling Language Modelling
Code Code Available 1Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection Apr 10, 2023 Action Detection Language Modeling
Code Code Available 1Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting Oct 15, 2023 Conversational Search Language Modeling
Code Code Available 1Intermediate Training of BERT for Product Matching Aug 31, 2020 Entity Resolution Language Modeling
Code Code Available 1A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions Feb 10, 2021 Denoising Image Segmentation
Code Code Available 1Enhancing Biomedical Relation Extraction with Directionality Jan 23, 2025 Benchmarking Document-level Relation Extraction
Code Code Available 1Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning Sep 19, 2024 Change Detection Decoder
Code Code Available 1End-to-end lyrics Recognition with Voice to Singing Style Transfer Feb 17, 2021 Data Augmentation Language Modeling
Code Code Available 1