Live transcription with automatic speaker identification using FastRTC audio streaming.
Lower = more sensitive