(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts May 20, 2024 Machine Translation Translation
Code Code Available 9Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 6StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning Jun 5, 2024 Automatic Speech Recognition (ASR) de-en
Code Code Available 5ChatGPT MT: Competitive for High- (but not Low-) Resource Languages Sep 14, 2023 Machine Translation
Code Code Available 5How to Design Translation Prompts for ChatGPT: An Empirical Study Apr 5, 2023 Machine Translation Natural Language Understanding
Code Code Available 5M-Prometheus: A Suite of Open Multilingual LLM Judges Apr 7, 2025 Machine Translation Model Selection
Code Code Available 5Knowledge Fusion of Large Language Models Jan 19, 2024 Code Generation Common Sense Reasoning
Code Code Available 4Efficient Post-training Quantization with FP8 Formats Sep 26, 2023 image-classification Image Classification
Code Code Available 4Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Jan 16, 2024 Decoder Machine Translation
Code Code Available 4DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Dec 23, 2024 Machine Translation Math
Code Code Available 3ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning May 19, 2025 Machine Translation reinforcement-learning
Code Code Available 3Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 38-bit Optimizers via Block-wise Quantization Oct 6, 2021 Language Modeling Language Modelling
Code Code Available 3Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation May 30, 2023 Machine Translation Segmentation
Code Code Available 3Accelerating Transformer Inference for Translation via Parallel Decoding May 17, 2023 Machine Translation Translation
Code Code Available 3Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 3TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 3Towards Fully Automated Manga Translation Dec 28, 2020 Machine Translation Translation
Code Code Available 3Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 3Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3Bird-Eye Transformers for Text Generation Models Oct 8, 2022 Attribute Inductive Bias
Code Code Available 3MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation Mar 26, 2024 Cross-Lingual Transfer Language Modelling
Code Code Available 2Mega: Moving Average Equipped Gated Attention Sep 21, 2022 Image Classification Inductive Bias
Code Code Available 2MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning Apr 14, 2025 Machine Translation Reinforcement Learning (RL)
Code Code Available 2LightSeq2: Accelerated Training for Transformer-based Models on GPUs Oct 12, 2021 Decoder GPU
Code Code Available 2Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges Apr 24, 2024 Drug Design Inductive Bias
Code Code Available 2MASS: Masked Sequence to Sequence Pre-training for Language Generation May 7, 2019 Conversational Response Generation Decoder
Code Code Available 2LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 2Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings Oct 23, 2022 Cross-Lingual NER Cross-Lingual Transfer
Code Code Available 2LightSeq: A High Performance Inference Library for Transformers Oct 23, 2020 GPU Machine Translation
Code Code Available 2Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine Jan 20, 2023 Machine Translation Sentence
Code Code Available 2A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Sep 20, 2023 Language Modelling Machine Translation
Code Code Available 2Inseq: An Interpretability Toolkit for Sequence Generation Models Feb 27, 2023 Decoder Feature Importance
Code Code Available 2Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level Jun 22, 2024 Machine Translation Translation
Code Code Available 2MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation Apr 4, 2025 Machine Translation Translation
Code Code Available 2GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism Nov 16, 2018 Fine-Grained Image Classification image-classification
Code Code Available 2HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation May 19, 2023 Hallucination Machine Translation
Code Code Available 2Exploring Human-Like Translation Strategy with Large Language Models May 6, 2023 Hallucination Machine Translation
Code Code Available 2Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems Mar 18, 2024 Machine Translation Translation
Code Code Available 2Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Oct 23, 2019 Answer Generation Common Sense Reasoning
Code Code Available 2Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms Jun 5, 2024 Low-Rank Matrix Completion Machine Translation
Code Code Available 2Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate May 30, 2023 Arithmetic Reasoning Machine Translation
Code Code Available 2TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement Feb 26, 2024 Machine Translation Translation
Code Code Available 2GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators Feb 10, 2024 Machine Translation Speech-to-Speech Translation
Code Code Available 2AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model Aug 2, 2022 Causal Language Modeling Common Sense Reasoning
Code Code Available 2Cross-lingual and Multilingual CLIP Jun 1, 2022 Contrastive Learning Image-text Retrieval
Code Code Available 2IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages May 25, 2023 All Machine Translation
Code Code Available 2DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory Oct 10, 2024 Document Translation Machine Translation
Code Code Available 2CATT: Character-based Arabic Tashkeel Transformer Jul 3, 2024 Arabic Text Diacritization Decoder
Code Code Available 2