(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts May 20, 2024 Machine Translation Translation
Code Code Available 9Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 6M-Prometheus: A Suite of Open Multilingual LLM Judges Apr 7, 2025 Machine Translation Model Selection
Code Code Available 5StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning Jun 5, 2024 Automatic Speech Recognition (ASR) de-en
Code Code Available 5ChatGPT MT: Competitive for High- (but not Low-) Resource Languages Sep 14, 2023 Machine Translation
Code Code Available 5How to Design Translation Prompts for ChatGPT: An Empirical Study Apr 5, 2023 Machine Translation Natural Language Understanding
Code Code Available 5Knowledge Fusion of Large Language Models Jan 19, 2024 Code Generation Common Sense Reasoning
Code Code Available 4Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation Jan 16, 2024 Decoder Machine Translation
Code Code Available 4Efficient Post-training Quantization with FP8 Formats Sep 26, 2023 image-classification Image Classification
Code Code Available 4ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement Learning May 19, 2025 Machine Translation reinforcement-learning
Code Code Available 3DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Dec 23, 2024 Machine Translation Math
Code Code Available 3Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset May 17, 2024 16k Benchmarking
Code Code Available 3Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation May 30, 2023 Machine Translation Segmentation
Code Code Available 3Accelerating Transformer Inference for Translation via Parallel Decoding May 17, 2023 Machine Translation Translation
Code Code Available 3TextBox 2.0: A Text Generation Library with Pre-trained Language Models Dec 26, 2022 Abstractive Text Summarization Data-to-Text Generation
Code Code Available 3Bird-Eye Transformers for Text Generation Models Oct 8, 2022 Attribute Inductive Bias
Code Code Available 38-bit Optimizers via Block-wise Quantization Oct 6, 2021 Language Modeling Language Modelling
Code Code Available 3Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3Finetuned Language Models Are Zero-Shot Learners Sep 3, 2021 ARC Common Sense Reasoning
Code Code Available 3Towards Fully Automated Manga Translation Dec 28, 2020 Machine Translation Translation
Code Code Available 3Ludwig: a type-based declarative deep learning toolbox Sep 17, 2019 Decoder Deep Learning
Code Code Available 3Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 3MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning Apr 14, 2025 Machine Translation Reinforcement Learning (RL)
Code Code Available 2MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation Apr 4, 2025 Machine Translation Translation
Code Code Available 2WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects Feb 18, 2025 Machine Translation
Code Code Available 2DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory Oct 10, 2024 Document Translation Machine Translation
Code Code Available 2Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Aug 8, 2024 Language Modeling Language Modelling
Code Code Available 2CATT: Character-based Arabic Tashkeel Transformer Jul 3, 2024 Arabic Text Diacritization Decoder
Code Code Available 2Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level Jun 22, 2024 Machine Translation Translation
Code Code Available 2Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms Jun 5, 2024 Low-Rank Matrix Completion Machine Translation
Code Code Available 2TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation May 28, 2024 Machine Translation speech-recognition
Code Code Available 2IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Apr 25, 2024 Cross-Lingual Question Answering Diversity
Code Code Available 2Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges Apr 24, 2024 Drug Design Inductive Bias
Code Code Available 2MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation Mar 26, 2024 Cross-Lingual Transfer Language Modelling
Code Code Available 2Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems Mar 18, 2024 Machine Translation Translation
Code Code Available 2TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement Feb 26, 2024 Machine Translation Translation
Code Code Available 2GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators Feb 10, 2024 Machine Translation Speech-to-Speech Translation
Code Code Available 2LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 2Quantifying the Plausibility of Context Reliance in Neural Machine Translation Oct 2, 2023 Machine Translation Translation
Code Code Available 2A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Sep 20, 2023 Language Modelling Machine Translation
Code Code Available 2OWL: A Large Language Model for IT Operations Sep 17, 2023 Language Modeling Language Modelling
Code Code Available 2SONAR: Sentence-Level Multimodal and Language-Agnostic Representations Aug 22, 2023 Decoder Machine Translation
Code Code Available 2SeamlessM4T: Massively Multilingual & Multimodal Machine Translation Aug 22, 2023 Automatic Speech Recognition Machine Translation
Code Code Available 2Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate May 30, 2023 Arithmetic Reasoning Machine Translation
Code Code Available 2BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages May 29, 2023 Machine Translation Translation
Code Code Available 2IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages May 25, 2023 All Machine Translation
Code Code Available 2HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation May 19, 2023 Hallucination Machine Translation
Code Code Available 2Exploring Human-Like Translation Strategy with Large Language Models May 6, 2023 Hallucination Machine Translation
Code Code Available 2Self-Supervised Multimodal Learning: A Survey Mar 31, 2023 Machine Translation Self-Supervised Learning
Code Code Available 2Stabilizing Transformer Training by Preventing Attention Entropy Collapse Mar 11, 2023 Automatic Speech Recognition image-classification
Code Code Available 2