GSLM-based Foreign Accent Simulation

Description

Converts any language(A) speech into one with the accent of a certain language(B). Train data is ONLY NATIVE SPEECH. No L2 speech data is needed. We're using GSLM architecture to generate accent. If the model is trained with Native-B speech, the model will recognize the input speech with Native-B phonological representation. Thus, the model will generate accent of Native-B.

Results

All the models are trained with k = 200.
A \ B accented Original English Japanese Chinese Spanish French
English
Japanese
Chinese
Spanish
French