User:Manetta/scripts/python-translate-to-computer-phonemes
translating text into phonemes used by Sphinx
article on Wikipedia about the cmu07a.dic
using the CMU dictionary file from the software package Sphinx (cmu07a.dic)
for download here: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
cmu07a.dic looks like:
abso AE B S OW absolom AE B S AH L AH M absolut AE B S AH L UW T absolut's AE B S AH L UW T S absolute AE B S AH L UW T absolutely AE B S AH L UW T L IY absoluteness AE B S AH L UW T N AH S absolutes AE B S AH L UW T S absolution AE B S AH L UW SH AH N absolutism AE B S AH L UW T IH Z AH M absolutist AE B S IH L UW T IH S T absolve AH B Z AA L V
import re
import os
with open('output.txt', 'w') as txt:
x = open('input.txt', 'r')
searchlines = x.readlines()
x.close()
print searchlines
search = searchlines[0].split(" ")
print search[0]
for i, searchitem in enumerate(search):
print searchitem
dic = open('cmu07a.dic', 'r')
for line in dic:
if re.match(searchitem, line):
print line
break
txt.write(line), "\n"
dic.close()
call K AO L me M IY echo EH K OW
my M AY wife W AY F is IH Z echo EH K OW
my M AY brother B R AH DH ER is IH Z echo EH K OW
echo EH K OW is IH Z my M AY mom M AA M
my M AY boss B AA S name N EY M is IH Z echo EH K OW
my M AY dad D AE D is IH Z echo EH K OW