The Impact Of Non-Standard Words And Pronunciations On Text-To-Speech Quality
In English news wire text, on average one word out of twenty is non-standard, i.e. not simply made up of letters from the English alphabet. Examples are abbreviations, numbers, dates, times, and other measures. On top of these non-standard words comes an endless list of (foreign) names of people, places, products or companies, whose pronunciation is unknown and potentially irregular. When confronted with unknown words, modern text-to-speech engines try to guess the correct pronunciation based on the written form of the word, sometimes with weird results.
In this post we show two - admittedly contrived - example sentences, each with two pronunciations. The first pronunciation is out-of-the-box, whereas the second one benefits from a manually edited pronunciation dictionary.
Here’s the first sentence:
Out of the box, it is pronounced like this: sentence1_orig
With the abbreviations ‘Maj.’ and ‘NYTimes’ expanded and phonetic transcriptions for ‘Netanyahu’, ‘Paypal’ and ‘GMail’, the sentence becomes much more comprehensible. Judge for yourself: sentence1_enhanced
Here’s a second sentence:
The out-of-the-box pronunciation goes like this: sentence2_orig
By expanding the abbreviation ‘Jlem’ (= ‘Jerusalem’) and pronouncing ‘odiogo’ as a word rather than as an abbreviation, the overall quality is greatly enhanced: sentence2_enhanced
Note that we decided not to expand the abbreviations ‘LLC.’ and ‘US’.
With this post we wanted to show you a few very simple examples of how Odiogo enhances the out-of-the-box quality of its speech synthesis engine. If you want to learn about more advanced ways of improving text-to-speech quality, fetch the White Paper “Turning news & blog articles into high-fidelity computer-generated audio” from our download page.
November 4th, 2007 at 4:03 pm
This should really be in your FAQ. So should the white paper or at least a link to the download page for it. Blogs are great but I go the the FAQ first.
June 19th, 2011 at 8:20 pm
Great One…
I must say, its worth it! My link! http://chjmyt.diblogotus.com/ ,many Thanks….
October 30th, 2011 at 7:13 pm
foreign language…
[…]Odiogo Blog » Blog Archive » The Impact Of Non-Standard Words And Pronunciations On Text-To-Speech Quality[…]…
July 17th, 2012 at 11:34 am
…
Buy Quality Drugs Now!…
February 10th, 2013 at 4:01 am
kids memorizing scripture…
…