Knesset-DictaBERT: A Hebrew Language Model for Parliamentary Proceedings
2024-07-30Unverified0· sign in to hype
Gili Goldin, Shuly Wintner
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We present Knesset-DictaBERT, a large Hebrew language model fine-tuned on the Knesset Corpus, which comprises Israeli parliamentary proceedings. The model is based on the DictaBERT architecture and demonstrates significant improvements in understanding parliamentary language according to the MLM task. We provide a detailed evaluation of the model's performance, showing improvements in perplexity and accuracy over the baseline DictaBERT model.