Getting started with the Video Indexer using Microsoft Cognitive Services


Microsoft introduced the public preview of video indexer as part of cognitive service. previously its Video API now its replaced into Video Indexer. Video Indexer automatically extract metadata and build intelligent innovative application based Video and audio.

In this article, I will share how to sign in to video Indexer and upload your video and extracting the meta data and translation.

Create Account:

The developing video indexer application, you must login or create account using any of one below account Azure active directory, Microsoft Scholl account, LinkedIn, google or Facebook to Microsoft VI.

Upload Video:

We need to upload video into MS video Indexer portal, after login, select Upload and Drag drop your video file or provide video web URL for upload to the portal Provide basic details about video file name, language and privacy setting and click on upload button .

After uploaded video to the portal, Microsoft VI will do process for analyzing and indexing the video.

Once video indexer is done the analyze, you will get email notification with link of video, short description and people face detection.

You can edit the privacy setting from the portal and Microsoft cognitive service will return following analyze report.

Face identification:

Microsoft AI will help detection of faces in a Video. The faces are matched against a celebrity, it will identify the matched name or user can also edit label faces that do not match the celebrity.

Speech to Text:

The Microsoft Video Indexer has speech to text functionality, this will help user to transcript to spoken language .it will support Tamil, English, Hindi, etc. and also you can edit the text . Video Indexer has the ability to map and understand which speaker spoke which words and when.

Identify Objects:

Video Indexer identify the pre-defined 2000 objects based on video background.

Keyword Extractions:

The meta keyword will help for search the video into the large library. Video Indexer extracts keywords based on the transcript of the spoken words and text recognized by visual text recognizer.

Sentiment analysis:

Video Indexer performs sentiment analysis on the text extracted using speech-to-text and optical character recognition, and provide that information in the form of positive, negative of neutral sentiments, along with timecodes.


Video Indexer has the ability to translate the audio transcript from one language to another. Video Indexer will supported following multiple language like Tamil, English, Spanish, etc.. 

Once Video Indexer is done the processing and analyzing video , you can review , edit ,delete and publish the video into the Microsoft VI Portal .


In this article, you learned how to sign in to video Indexer and upload the video and extracting the meta data and translation. If you have any questions/feedback/ issues, please write in the comment box.


Featured Post

Latest Windows 10 Update Is Now Breaking WiFi and IE, Edge, UWP App, Mobile Emulator and office 365 No Internet – Microsoft Shared new update Version 1903 with Fix

I have encountered lost Internet connection, network connectivity issue when using Microsoft Office, Office 365, Outlook, Teams, Internet Ex...

Subscribe to the FREE Weekly Newsletter to receive all the latest Microsoft Azure, AI, Xamarin

Popular Posts