arxiv:2412.04092

GEITje 7B Ultra: A Conversational Model for Dutch

Published on Dec 5, 2024

Authors:

Abstract

The research enhances the GEITje model, derived from Mistral 7B, through supervised finetuning with synthetic conversational datasets and preference alignment to improve its capabilities in Dutch.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

Language models have rapidly evolved, predominantly focusing on English while often neglecting extensive pretraining in other languages. This approach has required initiatives to adapt powerful, English-centric models to other linguistic contexts through finetuning. For Dutch, such a recent endeavour is ``GEITje'' a model originally derived from the English-based Mistral 7B. Building on this fundamental work, the current research extends the capabilities of GEITje by supervised finetuning on newly created high-quality synthetic conversational datasets, along with an additional preference alignment procedure on a synthetic feedback dataset. Both the developed models and the created datasets are openly available.