Azure OpenAI Whisper Parser is a wrapper around the Azure OpenAI Whisper API which utilizes machine learning to transcribe audio files to english text. The Parser supportsThe current implementation follows LangChain core principles and can be used with other loaders to handle both audio downloading and parsing. As a result of this the parser will.mp3,.mp4,.mpeg,.mpga,.m4a,.wav, and.webm.
yield an Iterator[Document].
Prerequisites
The service requires Azure credentials, Azure endpoint and Whisper Model deployment, which can be set up by following the guide here. Furthermore, the required dependencies must be installed.Example 1
TheAzureOpenAIWhisperParser’s method, .lazy_parse, accepts a Blob object as a parameter containing the file path of the file to be transcribed.
Example 2
TheAzureOpenAIWhisperParser can also be used in conjunction with audio loaders, like the YoutubeAudioLoader with a GenericLoader.