RealTimeTranscription Demo

This demo is a server application consuming audio from Twilio Media Streams and using Google Cloud Speech to perform realtime transcriptions.

App sever setup

Select a Project
Enable or Manage
Choose Credentials
- Create a new Credential or make sure you have the JSON
- Copy JSON and save as google_creds.json in the root of this project

Create your virtualenv virtualenv -p python3 env
Run source env/bin/activate
Run pip install -r requirements.txt
Set the Google App Credentials from above for your env. export GOOGLE_APPLICATION_CREDENTIALS="./google_creds.json"
Run python ./server.py

You can setup your environment to run the demo by using the CLI (BETA) or the Console.

Find available phone number twilio api:core:available-phone-numbers:local:list --country-code="US" --voice-enabled --properties="phoneNumber"
Purchase the phone number (where +123456789 is a number you found) twilio api:core:incoming-phone-numbers:create --phone-number="+123456789"
Start ngrok ngrok http 8080
Edit the templates/streams file to replace <ngrok url> with your ngrok host.
Make the call where +123456789 is the Twilio number you bought and +198765432 is your phone number and abcdef.ngrok.io is your ngrok host. twilio api:core:calls:create --from="+123456789" --to="+198765432" --url="https://abcdef.ngrok.io/twiml"

Access the Twilio console to get a <TWILIO-PHONE-NUMBER>.
Run the server (listening in 8080 port)
Use ngrok to make the server publicly available: ngrok http 8080
Edit the streams.xml file in the templates directory and add your ngrok URL as wss://<ngrok url>
Run the curl command in order to make the proper call curl -XPOST https://api.twilio.com/2010-04-01/Accounts/<ACCOUNT-SID>/Calls.json -d "Url=http://<ngrok url>/twiml" -d "To=<PHONE-NUMBER>" -d "From=<TWILIO-PHONE-NUMBER>" -u <ACCOUNT-SID>:<AUTH-TOKEN>