Efficient Content-Based Sparse Attention with Routing Transformers Mar 12, 2020 Image Generation Language Modeling
Code Code Available 1ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training Dec 20, 2023 Language Modeling Language Modelling
Code Code Available 1An Efficient Self-Supervised Cross-View Training For Sentence Embedding Nov 6, 2023 Contrastive Learning Language Modeling
Code Code Available 1What do you learn from context? Probing for sentence structure in contextualized word representations May 15, 2019 Language Modelling Sentence
Code Code Available 1ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling Dec 18, 2024 Language Modeling Language Modelling
Code Code Available 1End-to-End Beam Retrieval for Multi-Hop Question Answering Aug 17, 2023 Language Modelling Large Language Model
Code Code Available 1An Efficient Multilingual Language Model Compression through Vocabulary Trimming May 24, 2023 Language Modeling Language Modelling
Code Code Available 1EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing Jul 18, 2024 Instruction Following Language Modeling
Code Code Available 1DziriBERT: a Pre-trained Language Model for the Algerian Dialect Sep 25, 2021 Language Modeling Language Modelling
Code Code Available 1EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs Oct 13, 2024 Language Modeling Language Modelling
Code Code Available 1A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration Oct 3, 2023 Arithmetic Reasoning Code Generation
Code Code Available 1Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs May 1, 2022 Constituency Grammar Induction Language Modeling
Code Code Available 1When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings Jul 5, 2024 Language Modelling
Code Code Available 1Dynamic Contextualized Word Embeddings Oct 23, 2020 Language Modeling Language Modelling
Code Code Available 1WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words Dec 5, 2023 Language Modeling Language Modelling
Code Code Available 1BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models Sep 23, 2023 Code Completion Hallucination
Code Code Available 1Why do language models perform worse for morphologically complex languages? Nov 21, 2024 Language Modeling Language Modelling
Code Code Available 1Dynamic Grained Encoder for Vision Transformers Jan 10, 2023 image-classification Image Classification
Code Code Available 1Dealing with Typos for BERT-based Passage Retrieval and Ranking Aug 27, 2021 Information Retrieval Language Modeling
Code Code Available 1BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla Jan 1, 2021 Document Classification Language Modeling
Code Code Available 1WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Jan 30, 2025 Language Modeling Language Modelling
Code Code Available 1DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Decoding-Time Language Model Alignment with Multiple Objectives Jun 27, 2024 Language Modeling Language Modelling
Code Code Available 1WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks Sep 12, 2024 Decision Making Language Modeling
Code Code Available 1DILBERT: Customized Pre-Training for Domain Adaptation with Category Shift, with an Application to Aspect Extraction Nov 1, 2021 Aspect Extraction Domain Adaptation
Code Code Available 1DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation Dec 17, 2024 Contrastive Learning Image Segmentation
Code Code Available 1With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition Nov 1, 2021 Action Recognition Language Modeling
Code Code Available 1BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla May 23, 2022 Conditional Text Generation Dialogue Generation
Code Code Available 1Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers Jan 22, 2023 Language Modeling Language Modelling
Code Code Available 1DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines Nov 17, 2023 Language Modelling Large Language Model
Code Code Available 1ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning Feb 8, 2022 Benchmarking Language Modelling
Code Code Available 1Writer-Aware CNN for Parsimonious HMM-Based Offline Handwritten Chinese Text Recognition Dec 24, 2018 Handwritten Chinese Text Recognition Language Modeling
Code Code Available 1Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation Apr 4, 2025 Clustering Hallucination
Code Code Available 1Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering May 19, 2023 Language Modeling Language Modelling
Code Code Available 1Xiwu: A Basis Flexible and Learnable LLM for High Energy Physics Apr 8, 2024 Code Generation Language Modelling
Code Code Available 1Deciphering antibody affinity maturation with language models and weakly supervised learning Dec 14, 2021 Language Modeling Language Modelling
Code Code Available 1XLM-E: Cross-lingual Language Model Pre-training via ELECTRA Jun 30, 2021 Language Modeling Language Modelling
Code Code Available 1Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations Jul 10, 2023 Language Modeling Language Modelling
Code Code Available 1Generator-Retriever-Generator Approach for Open-Domain Question Answering Jul 21, 2023 Language Modeling Language Modelling
Code Code Available 1Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems Dec 18, 2023 Knowledge Graphs Language Modeling
Code Code Available 1On the Learnability of Watermarks for Language Models Dec 7, 2023 Decoder Language Modeling
Code Code Available 1Bayesian Reward Models for LLM Alignment Feb 20, 2024 Language Modeling Language Modelling
— Unverified 0Advancements in Reordering Models for Statistical Machine Translation Aug 1, 2013 Language Modelling Machine Translation
— Unverified 0An Effective Data Creation Pipeline to Generate High-quality Financial Instruction Data for Large Language Model Jul 31, 2023 Language Modeling Language Modelling
— Unverified 0On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating May 16, 2025 Language Modeling Language Modelling
— Unverified 0Dual-State Capsule Networks for Text Classification Sep 10, 2021 Classification Language Modeling
— Unverified 0Bayesian Neural Networks with Variance Propagation for Uncertainty Evaluation Jan 1, 2021 Bayesian Inference Computational Efficiency
— Unverified 0An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features Jun 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advanced Natural-based interaction for the ITAlian language: LLaMAntino-3-ANITA May 11, 2024 Computational Efficiency Language Modelling
— Unverified 0