The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI

Benfareh Aicha

The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI

Benfareh Aicha

المكان (URI): http://ddeposit.univ-alger2.dz:8080/xmlui/handle/20.500.12387/9009

التاريخ: 2025-06-12

الخلاصة:

We first examined the representation of the 20 languages in M-BERT by deriving language identity representations in 1000 labeled corpora. The high performance of the language identification model in distinguishing the languages in M-BERT (mean F1 score 0.999) indicated that BERT models use strong language-specific information in their pretraining process. We then tested the M-BERT model's capability of differentiating between pairs of languages. By feeding modeling prompts that include the name of the language and a token from one of the two languages to the model, we used the model's output probability to determine which language the input was expressed in. This is effectively a language disambiguation task, and we should be able to use it to measure the model's ability to differentiate and understand pairs of languages. This simple disambiguation setup, combined with the model's ability to perform probability judgment, could serve as a test to reveal what exchanges the model is capable of processing for any given pair of languages.

عرض سجل المادة الكامل

الملفات في هذه المادة

الاسم: the-ai-tongue-twi ...

الحجم: 336.6Kb

التنسيق: PDF

عرض/افتح

ملفات الرخصة التالية مرتبطة بهذه المادة:

الموسوعة الابداعية

هذه المادة تظهر في الحاويات التالية

RASDL المجلة الجزائرية لعلوم اللسان
مجلة تصدر من مخبر اللسانيات وعلم الاجتماع وتعليمية اللغات

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 United States

The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI

The AI Tongue-Twister: Disentangling the Algorithmic Underpinnings of Multilingual AI

الخلاصة:

الملفات في هذه المادة

هذه المادة تظهر في الحاويات التالية

بحث دي سبيس

استعرض

جميع محتويات المستودع

هذه الحاوية

حسابي