History

tomkarho c0bb7c4cd3 Notes		2024-05-08 15:32:46 +03:00
..
Components	Transcribe entire files	2024-05-08 15:29:14 +03:00
note-resources	UI	2024-05-08 13:14:30 +03:00
Properties	Add project to test transcribe functionalities	2024-05-08 10:21:37 +03:00
Services	Transcribe entire files	2024-05-08 15:29:14 +03:00
wwwroot	Notes	2024-05-08 13:15:01 +03:00
.gitignore	Handle audio up to 30 seconds	2024-05-08 13:14:57 +03:00
appsettings.Development.json	Add project to test transcribe functionalities	2024-05-08 10:21:37 +03:00
appsettings.json	Add project to test transcribe functionalities	2024-05-08 10:21:37 +03:00
AzureAi.Transcriber.csproj	Handle audio up to 30 seconds	2024-05-08 13:14:57 +03:00
Program.cs	Handle audio up to 30 seconds	2024-05-08 13:14:57 +03:00
README.md	Notes	2024-05-08 15:32:46 +03:00

Azure Ai Transcribing

This project is meant to demonstrate and document my attempts to create a video transcribe service using Azure AI.

Creating Azure Ai Resource

I created just plain Azure AI Services resource from Azure Portal
I chose Sweden Central as location and standard S0 as pricing tier
I am using Visual Studio license attached Azure account so I have roughly 150€ of free credits
here's how the portal looked after

ffmpeg -i [INPUT].mp3 -acodec pcm_s16le -ac 1 -ar 16000 [OUTPUT].wav

    export SPEECH_KEY=your_key
    export SPEECH_REGION=your_region 
    
    dotnet run

I used three different files of varying lengths to try to transcribe each
It seems there is a limit as to how long the audio file can be
It might be that the silence detection is too strict
- Yeah documentation says as much

This example uses the RecognizeOnceAsync operation to transcribe utterances of up to 30 seconds, or until silence is detected.