Speech recognition: Difference between revisions
(2 intermediate revisions by the same user not shown) | |||
Line 45: | Line 45: | ||
* Install GNU make: | * Install GNU make: | ||
*:''sudo apt-get install build-essential'' | *:''sudo apt-get install build-essential'' | ||
* ''sudo apt install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 ffmpeg libav-tools'' | |||
* Follow the instructions of the Ubuntu version above | * Follow the instructions of the Ubuntu version above | ||
Latest revision as of 14:13, 6 July 2018
Ubuntu/ Mac
Requirements: Python 3.3+
https://pypi.python.org/pypi/SpeechRecognition
- SpeechRecognition 3.8.1
- pip install SpeechRecognition
- To quickly try it out, run python -m speech_recognition after installing.
- PyAudio (for microphone users):
- Install PyAudio using APT: execute sudo apt-get install python-pyaudio python3-pyaudio in a terminal.
- Pocketsphinx:
- One way:
- install dependencies: sudo apt-get install gcc automake autoconf libtool bison swig python-dev libpulse-dev
- Follow the instructions: https://bangladroid.wordpress.com/2017/02/16/installing-cmu-sphinx-on-ubuntu/
- Another way (not sure if it is different or supplementary):
- pip3 install pocketsphinx
- Test+Errors:
- Try this: pocketsphinx_continuous
- If you find this error error while loading shared libraries: libpocketsphinx.so.3: cannot open shared object file: No such file or directory do this sudo nano /etc/ld.so.conf and add
- include usr/local/lib
- include usr/
- Install other languages
- One way:
- Pocketsphinx Python:
- pip install --upgrade pip setuptools wheel
- pip install --upgrade pocketsphinx
- Examples:
- Transcribe an audio file: run python3 audio_transcribe.py
- Save audio data to an audio file: run python3 write_audio.py
Windows
A solution:
Install Ubuntu in Windows
https://tutorials.ubuntu.com/tutorial/tutorial-ubuntu-on-windows#0
- Open the start page and search for CMD (Command Prompt). Open it and type bash. Now you are in the Ubuntu environment.
- Install git: sudo apt install git
- Install python3:
- sudo apt-get update
- sudo apt-get install python3.6
- sudo apt-get update
- Install pip:
- sudo apt-get install python-pip python-dev build-essential
- sudo pip install --upgrade pip
- sudo pip install --upgrade virtualenv
- Install GNU make:
- sudo apt-get install build-essential
- sudo apt install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 ffmpeg libav-tools
- Follow the instructions of the Ubuntu version above
B solution:
- Install python3: https://tutorial.djangogirls.org/en/installation/
- Install git: https://git-scm.com/download/win
- Install pocketsphinx: https://cmusphinx.github.io/wiki/tutorialpocketsphinx/#windows OR
- Install Windows SDC
- Install Visual C++ 2015 Tools for Windows Desktop
(...needs development)
Mac
(haven't tried it) https://github.com/watsonbox/homebrew-cmu-sphinx