Speaker Mode Video in Azure Media Services

I have two hi-res local video recording files from a podcast interview.

I would like to merge them into one output file with the speaker showing at all times.

So we'd need to analyse the audio track and see who is speaking (guest has priority) and then create an array of timestamps of the speaker.

Volume analysis example using ffmpeg similar to what I'm describing

Then I'd like to use AMS to merge the video files based on the timestamps (eg. host.mp4 source for 20 seconds then guest.mp4 for 30 seconds, etc)

How would I go about this?

Solution

This sounds like the speaker enumeration feature in Azure Video Indexer https://learn.microsoft.com/en-us/azure/azure-video-indexer/video-indexer-overview#videoaudio-ai-features.

Get Input Asset or Output Asset from JobId or SmoothStreamingUrl - Azure Media Services
Azure Media Player controls missing css classes
Azure Media Services Live Video Streaming download stored video
How to get a published video URL
Azure Media Services V3 audio analyzer transcript timestamp incorrect
Azure Media Services Stream not displaying in Blazor App using Azure Media Player
Error when executing a sample code for Azure Media Service v3
How to get the data from Application Insights in a C# class (Azure Media Player Plugin)
How to get Average Visualization Time of an asset in Azure Media Services v3
How to get the duration of a video from the Azure media services?
Speaker Mode Video in Azure Media Services
Azure Media Services - Programmatically Transcode Blob into Asset
Azure media services video files virus scanning
Unable to upload to azure media service
Why does the Azure Media Services Verizon Premium CDN Streaming Endpoint actually create more latency when compared to the default non-CDN option
How to construct a streaming URL for Azure Media Services live stream event? Where is the streaming locator path on the rest api call?
amp azure playback speed python
ffmpeg: ffplay reading audio live stream from azure media services but no sound
Azure Media Service authentication type
How do I use Azure media service to apply watermark on video files which is of format mp4, OGG
TextTrack LanguageCode parameter in Tracks.BeginCreateOrUpdateAsync
Querying assets with filter returns too small subset of assets
Add an audio track to existing asset (via Azure Media Services)
Azure CLI "az ams job start" is returning "index out of range"
How to embed azure media services live stream to a personal website
Azure Media Player Video Shifted from Left on Android Devices Only
Azure Media Services - Live Streaming without Rewind / Timeshift
Azure Media services, create locator fails using .NET library
Can you use Azure AMS V3 for DRM Key Delivery only?
Streaming with live captions