To transcribe audio, I used Amazon, Microsoft & Google's speech to text services. The app can transcibe and test batches of audio files and store results locally
To calculate accuracy, the word error rate (WER) is calculated each transcription is compared against its original transcript. Transcription times for each of the cloud services is recorded to understand performance
Javascript, Node js, Python, AWS S3, Boto3, Microsoft Speech SDK and Google Speech to Text API