question

SaurabhSinghSengar-7709 avatar image
0 Votes"
SaurabhSinghSengar-7709 asked AnilMurching-7279 answered

Need to do a Face detection on rtsp stream using Azure services

I would like to use some of the Azure service for my project. Idea is to stream encoded video stream (h264), expecting Azure to decode and perform face detection on it in real time. What are the best services to use it.
One option I explore is to use computer vision/face Api with media services, however computed vision API requires image in jpeg/bif format which requires reencoding of decoding stream, or I need to do screen grab, which I dont want to do as it will increase latency.
If there is service which does ML operations on raw frame decoded by media services would help here.

azure-machine-learningazure-media-servicesazure-computer-vision
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

romungi-MSFT avatar image
0 Votes"
romungi-MSFT answered

@SaurabhSinghSengar-7709 The service that offers the capability to enable identification of faces in real time with streaming video is Azure Video Analyzer. It should support HLS and other popular formats, you can take a look at the limitations and quotas from the documentation.

With computer vision or face API you need to capture the frame or a screen shot and pass the same to the API to get the results.


  • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how

  • Want a reminder to come back and check responses? Here is how to subscribe to a notification

  • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators





5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

AnilMurching-7279 avatar image
0 Votes"
AnilMurching-7279 answered

In particular, review https://docs.microsoft.com/en-us/azure/azure-video-analyzer/video-analyzer-docs/use-intel-openvino-tutorial that shows you how you can run Intel's face detection model on an RTSP stream.

5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.