-
Notifications
You must be signed in to change notification settings - Fork 154
Description
Hi!
During preprocessing, when add_blank is True in hparams, some weird intersperse function (here) intersperses an index, which is out of vocabulary bounds (item=len(symbols)), between each pair of adjacent tokens.
My first guess was that this token plays the role of some pauses between tokens, as pause token was not presented in vocabulary. So while training, all pauses sift to this token.
Then, as it's name state, I treated it as some blank token, which is needed to absorb all "noises" between adjacent tokens, as for other tokens to present more clear phonemes. There I thought it may also be used to learn transformations from one phoneme to another, which is not a part of any of two adjacant phonemes itself, but a separate part. but if so, why is it a common token for all gaps?
So, what is the real purpose of this blank token?
this question is more addressed to the authors, but any guesses are welcome.
thanks in advance.