Converts any language(A) speech into one with the accent of a certain language(B).
Train data is ONLY NATIVE SPEECH. No L2 speech data is needed.
We're using GSLM
architecture to generate accent.
If the model is trained with Native-B speech, the model will recognize the input speech with Native-B
phonological representation.
Thus, the model will generate accent of Native-B.
A \ B accented | Original | English | Japanese | Chinese | Spanish | French |
---|---|---|---|---|---|---|
English | ||||||
Japanese | ||||||
Chinese | ||||||
Spanish | ||||||
French |