Speech recognition: Difference between revisions

Revision as of 12:17, 5 July 2018

Requirements: Python 3.3+

SpeechRecognition 3.8.1
- pip install SpeechRecognition
- To quickly try it out, run python -m speech_recognition after installing.
PyAudio (for microphone users):
- Install PyAudio using APT: execute sudo apt-get install python-pyaudio python3-pyaudio in a terminal.
Pocketsphinx:
1. One way:
  - install dependencies: sudo apt-get install gcc automake autoconf libtool bison swig python-dev libpulse-dev
  - Follow the instructions: https://bangladroid.wordpress.com/2017/02/16/installing-cmu-sphinx-on-ubuntu/
2. Another way (not sure if it is different or supplementary):
  - pip3 install pocketsphinx
3. Test+Errors:
  - Try this: pocketsphinx_continuous
  - If you find this error error while loading shared libraries: libpocketsphinx.so.3: cannot open shared object file: No such file or directory do this sudo nano /etc/ld.so.conf and add
  include usr/local/lib
  
  include usr/
4. Install other languages
Pocketsphinx Python:
pip install --upgrade pip setuptools wheel

pip install --upgrade pocketsphinx
Examples:
1. Transcribe an audio file: run python3 audio_transcribe.py
2. Save audio data to an audio file: run python3 write_audio.py

@@ Line 40: / Line 40: @@
 * * pip3 install termcolor
 * * pip3 install pyaudio
+* Install Windows SDC
+* Install Visual C++ 2015 Tools for Windows Desktop
 == Mac ==
 (haven't tried it)
 https://github.com/watsonbox/homebrew-cmu-sphinx