Exploring the traditional NMT model and Large Language Model for chat translation

2024-09-24Unverified0· sign in to hype

Jinlong Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li, Zhanglin Wu, Zhiqiang Rao, Shaojun Li, Yuhao Xie, Yuanchang Luo, Jiawei Zheng, Bin Wei, Hao Yang

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes the submissions of Huawei Translation Services Center(HW-TSC) to WMT24 chat translation shared task on EnglishGermany (en-de) bidirection. The experiments involved fine-tuning models using chat data and exploring various strategies, including Minimum Bayesian Risk (MBR) decoding and self-training. The results show significant performance improvements in certain directions, with the MBR self-training method achieving the best results. The Large Language Model also discusses the challenges and potential avenues for further research in the field of chat translation.

Tasks

Language Modeling Language Modelling Large Language Model model NMT Translation

Exploring the traditional NMT model and Large Language Model for chat translation

Abstract

Tasks

Reproductions