User:Manetta/scripts/python-translate-to-computer-phonemes: Difference between revisions
No edit summary |
|||
Line 17: | Line 17: | ||
absolutist AE B S IH L UW T IH S T | absolutist AE B S IH L UW T IH S T | ||
absolve AH B Z AA L V | absolve AH B Z AA L V | ||
Line 75: | Line 73: | ||
is IH Z | is IH Z | ||
echo EH K OW | echo EH K OW | ||
[[File:Mb-echo-semantic-simulations-01-page005.png | 500px]] |
Revision as of 10:38, 24 March 2015
translating text into computer phonemes
using the CMU dictionary file from the software package Sphinx (cmu07a.dic)
for download here: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
cmu07a.dic looks like:
abso AE B S OW absolom AE B S AH L AH M absolut AE B S AH L UW T absolut's AE B S AH L UW T S absolute AE B S AH L UW T absolutely AE B S AH L UW T L IY absoluteness AE B S AH L UW T N AH S absolutes AE B S AH L UW T S absolution AE B S AH L UW SH AH N absolutism AE B S AH L UW T IH Z AH M absolutist AE B S IH L UW T IH S T absolve AH B Z AA L V
import re
import os
with open('output.txt', 'w') as txt:
x = open('input.txt', 'r')
searchlines = x.readlines()
x.close()
print searchlines
search = searchlines[0].split(" ")
print search[0]
for i, searchitem in enumerate(search):
print searchitem
dic = open('cmu07a.dic', 'r')
for line in dic:
if re.match(searchitem, line):
print line
break
txt.write(line), "\n"
dic.close()
call K AO L me M IY echo EH K OW
my M AY wife W AY F is IH Z echo EH K OW
my M AY brother B R AH DH ER is IH Z echo EH K OW
echo EH K OW is IH Z my M AY mom M AA M
my M AY boss B AA S name N EY M is IH Z echo EH K OW
my M AY dad D AE D is IH Z echo EH K OW