Multilingual Language Models
Multilingual Language Models
Norwegian health and care services use an intricate mix of Norwegian and international medical terminology. This requires advanced embedding spaces that can handle multiple languages simultaneously.
An embedding space is a multidimensional mathematical space where words, sentences, images, or other data are represented as vectors. Similar concepts are placed near each other based on semantic or contextual similarity.
Modern language models implement this through shared vector spaces for medical terminology, while maintaining separate representations for Bokmål and Nynorsk with a common medical vocabulary as foundation. Extensive testing has shown that this architecture, especially with dedicated medical embedding levels, results in a significant increase in precision in cross-linguistic medical communication.
Source: https://link.springer.com/article/10.1007/s10462-025-11162-5