I am looking for English language analyzing engine where: I will be feeding inputs (request ID, text and audio clip of the text) via API. Engine will accept the 3 inputs, analyze the spoken text in the audio clip and map it with the text… (Budget: $30 – $250 USD, Jobs: Python)