Computer Vision API (Preview)

Computer Vision API (Preview)

Extract rich information from images to categorize and process visual data—and protect your users from unwanted content with this Azure Cognitive Service.

Throttling Limits

Name Calls Renewal Period
API calls per connection 1200 60 seconds

Creating a connection

To connect your account, you will need the following information:

Account Key
securestring
Cognitive Services Account Key
Site URL
string
Root site url (Example: https://westus.api.cognitive.microsoft.com ).If not specified site url will be defaulted to 'https://westus.api.cognitive.microsoft.com'.

Actions

Analyze Image

This operation extracts a rich set of visual features based on the image content.

Required Parameters

Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Optional Parameters

Language
string
The service will return recognition results in specified language.

Returns

Describe Image

This operation generates a description of an image in human readable language with complete sentences.

Required Parameters

Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Optional Parameters

Max Candidates
number
Maximum number of candidate descriptions to be returned.

Returns

Describe Image Content

This operation generates a description of image content in human readable language with complete sentences.

Required Parameters

Image Content
binary
Source of the image reference in body.

Optional Parameters

Max Candidates
number
Maximum number of candidate descriptions to be returned.

Returns

Describe Image URL

This operation generates a description of an image URL in human readable language with complete sentences.

Optional Parameters

Max Candidates
number
Maximum number of candidate descriptions to be returned.
Image URL
url
Source of image reference by URL.

Returns

Generate Thumbnail

This operation generates a thumbnail image with the user-specified width and height.

Required Parameters

Thumbnail Width
number
Width of the generated thumbnail - recommended is 50
Thumbnail Height
number
Height of the generated thumbnail - recommended is 50
Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Optional Parameters

Smart Cropping
boolean
Boolean flag for enabling smart cropping

Returns

Thumbnail
binary

Generated thumbnail image

Optical Character Recognition (OCR) to JSON

Optical Character Recognition (OCR) detects text in an image and extracts the recognized characters into a machine-usable JSON stream.

Required Parameters

Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Returns

JSON Response
OCRJsonResponse

Optical Character Recognition (OCR) to Text

Optical Character Recognition (OCR) detects text in an image and extracts the recognized characters into a text file.

Required Parameters

Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Returns

Recognize Domain Specific Content

Recognize celebrities or landmarks in an image.

Required Parameters

Domain Model
string
Supported domain-specific model to recognize in image.
Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Returns

Tag Image

Generates a list of words, or tags, that are relevant to the content of the supplied image.

Required Parameters

Image Source
string
Source of the image - either included or by reference url.
Image
dynamic

Returns

Definitions

DomainModelResponse

Celebrity Confidence
float
Confidence score that image is of celebrity
Celebrity Name
string
Name of the recognized celebrity
Landmark Confidence
float
Confidence score that image is of landmark
Landmark Name
string
Name of the identified landmark
celebrities
array of object
Recognized celebrities in image
landmarks
array of object
Recognized landmarks in image

TagResponse

Tag Confidence Score
float
Confidence score of the identified tag.
Tag Name
string
Name of the tag identified.
tags
array of object
Set of tags returned from the picture analysis.

OCRJsonResponse

Regions Array
array of object
Text regions returned.
Text Language
string
Detected language of the image text.

OCRTextResponse

Detected Text
string
Text detected in the image analyzed

AnalyzeResponse

Caption Confidence Score
float
Confidence score of the image caption
Caption Text
string
Text caption generated from the image
Captions
array of object
List of captions generated from the image
Category Confidence Score
float
Confidence Score for the image category
Category Name
string
Name of the category identified from the image
Tag Confidence Score
float
Confidence score for the identified tags.
Tag Name
string
Name of the tag identified.
Tag Names
array of string
Collection of tag names.
categories
array of object
Categories identified from the image
tags
array of object
Tags identified with confidence scores.

DescribeResponse

Caption Confidence Score
float
Confidence score of the image caption
Caption Text
string
Text caption generated from the image
Captions
array of object
List of captions generated from the image
Tag Names
array of string
Collection of tag names.