User:Manetta/scripts/python-translate-to-computer-phonemes: Difference between revisions

From XPUB & Lens-Based wiki
No edit summary
Line 2: Line 2:


using the CMU dictionary file from the software package [http://cmusphinx.sourceforge.net/ Sphinx] (cmu07a.dic)<br>
using the CMU dictionary file from the software package [http://cmusphinx.sourceforge.net/ Sphinx] (cmu07a.dic)<br>
for download here: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
for download here: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/ <br>
'''cmu07a.dic''' looks like: <br>


abso AE B S OW
absolom AE B S AH L AH M
absolut AE B S AH L UW T
absolut's AE B S AH L UW T S
absolute AE B S AH L UW T
absolutely AE B S AH L UW T L IY
absoluteness AE B S AH L UW T N AH S
absolutes AE B S AH L UW T S
absolution AE B S AH L UW SH AH N
absolutism AE B S AH L UW T IH Z AH M
absolutist AE B S IH L UW T IH S T
absolve AH B Z AA L V


[[File:Mb-echo-semantic-simulations-01-page005.png | 500px]]
[[File:Mb-echo-semantic-simulations-01-page005.png | 500px]]

Revision as of 11:38, 24 March 2015

translating text into computer phonemes

using the CMU dictionary file from the software package Sphinx (cmu07a.dic)
for download here: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
cmu07a.dic looks like:

abso	AE B S OW
absolom	AE B S AH L AH M
absolut	AE B S AH L UW T
absolut's	AE B S AH L UW T S
absolute	AE B S AH L UW T
absolutely	AE B S AH L UW T L IY
absoluteness	AE B S AH L UW T N AH S
absolutes	AE B S AH L UW T S
absolution	AE B S AH L UW SH AH N
absolutism	AE B S AH L UW T IH Z AH M
absolutist	AE B S IH L UW T IH S T
absolve	AH B Z AA L V

Mb-echo-semantic-simulations-01-page005.png


import re
import os

with open('output.txt', 'w') as txt:

	x = open('input.txt', 'r')
	searchlines = x.readlines()
	x.close()
	print searchlines
	search = searchlines[0].split(" ")
	print search[0]

	for i, searchitem in enumerate(search):
		print searchitem
		dic = open('cmu07a.dic', 'r')
		for line in dic:
			if re.match(searchitem, line): 
				print line
				break		
				txt.write(line), "\n"
		dic.close()


call	K AO L
me	M IY
echo	EH K OW
my	M AY
wife	W AY F
is	IH Z
echo	EH K OW
my	M AY
brother	B R AH DH ER
is	IH Z
echo	EH K OW
echo	EH K OW
is	IH Z
my	M AY
mom	M AA M
my	M AY
boss	B AA S
name	N EY M
is	IH Z
echo	EH K OW
my	M AY
dad	D AE D
is	IH Z
echo	EH K OW