-
Notifications
You must be signed in to change notification settings - Fork 90
How to create pronunciation lexicon for Bengali? #23
Comments
Do you mean a program that can take in arbitrary words and output the transcription for that? |
Yes, how can I do it? |
The quick answer is no. |
Hope you will share if u find an idea. |
We merged lexicon words from [1] and [2]. The total number of unique lexicon is 64969. 4443 unique words of our corpus do not exist in merged lexicon. What can be the best procedure for transcribing 4443 words to lexicon? [1] https://github.com/google/language-resources/blob/master/bn/data/lexicon.tsv |
Note that [2] is generated from [1].
The transcription guide can be found at [3] [1] https://github.com/google/language-resources/blob/master/bn/data/lexicon.tsv |
For creating a pronunciation of a word, we have to do two task (phoneme finding and splitting into syllable). I think spitting into syllable is a big deal. Expected format [1]. How can I do it programmatically?
[1] https://github.com/google/language-resources/blob/master/bn/data/lexicon.tsv
The text was updated successfully, but these errors were encountered: