Converts any language(A) speech into one with the accent of a certain language(B).
Train data is ONLY NATIVE SPEECH. No L2 speech data is needed.
We're using GSLM
architecture to generate accent.
If the model is trained with Native-B speech, the model will recognize the input speech with Native-B
phonological representation.
Thus, the model will generate accent of Native-B.
| A \ B accented | Original | English | Japanese | Chinese | Spanish | French |
|---|---|---|---|---|---|---|
| English | ||||||
| Japanese | ||||||
| Chinese | ||||||
| Spanish | ||||||
| French |