Text recognition prebuilt model

Article
01/10/2024

The text recognition prebuilt model extracts words from documents and images into machine-readable character streams. It uses state-of-the-art optical character recognition (OCR) to detect printed and handwritten text in images.

This model processes images and document files to extract lines of printed or handwritten text.

Use in Power Apps

The text recognition prebuilt model is available in Power Apps by using the text recognizer component. More information: Use the text recognizer component in Power Apps

Use in Power Automate

For information about how to use this model in Power Automate, see Use the text recognition prebuilt model in Power Automate.

Supported language, format, and size

The files you can scan with the text recognition model must have these characteristics:

Language for print text: Afrikaans, Albanian, Angika (Devanagiri), Arabic, Asturian, Awadhi-Hindi (Devanagiri), Azerbaijani (Latin), Bagheli, Basque, Belarusian (Cyrillic), Belarusian (Latin), Bhojpuri-Hindi (Devanagiri), Bislama, Bodo (Devanagiri), Bosnian (Latin), Brajbha, Breton, Bulgarian, Bundeli, Buryat (Cyrillic), Catalan, Cebuano, Chamling, Chamorro, Chhattisgarhi (Devanagiri), Chinese (Simplified), Chinese (Traditional), Cornish, Corsican, Crimean Tatar (Latin), Croatian, Czech, Danish, Dari, Dhimal (Devanagiri), Dogri (Devanagiri), Dutch, English, Erzya (Cyrillic), Estonian, Faroese, Fijian, Filipino, Finnish, French, Friulian, Gagauz (Latin), Galician, German, Gilbertese, Gondi (Devanagiri), Greenlandic, Gurung (Devanagiri), Haitian Creole, Halbi (Devanagiri), Hani, Haryanvi, Hawaiian, Hindi, Hmong Daw (Latin), Ho(Devanagiri), Hungarian, Icelandic, Inari Sami, Indonesian, Interlingua, Inuktitut (Latin), Irish, Italian, Japanese, Jaunsari (Devanagiri), Javanese, Kabuverdianu, Kachin (Latin), Kangri (Devanagiri), Karachay-Balkar, Kara-Kalpak (Cyrillic), Kara-Kalpak (Latin), Kashubian, Kazakh (Cyrillic), Kazakh (Latin), Khaling, Khasi, K'iche', Korean, Korku, Koryak, Kosraean, Kumyk (Cyrillic), Kurdish (Arabic), Kurdish (Latin), Kurukh (Devanagiri), Kyrgyz (Cyrillic), Lakota, Latin, Lithuanian, Lower Sorbian, Lule Sami, Luxembourgish, Mahasu Pahari (Devanagiri), Malay (Latin), Maltese, Malto (Devanagiri), Manx, Maori, Marathi, Mongolian (Cyrillic), Montenegrin (Cyrillic), Montenegrin (Latin), Neapolitan, Nepali, Niuean, Nogay, Northern Sami (Latin), Norwegian, Occitan, Ossetic, Pashto, Persian, Polish, Portuguese, Punjabi (Arabic), Ripuarian, Romanian, Romansh, Russian, Sadri (Devanagiri), Samoan (Latin), Sanskrit (Devanagari), Santali (Devanagiri), Scots, Scottish Gaelic, Serbian (Latin), Sherpa (Devanagiri), Sirmauri (Devanagiri), Skolt Sami, Slovak, Slovenian, Somali (Arabic), Southern Sami, Spanish, Swahili (Latin), Swedish, Tajik (Cyrillic), Tatar (Latin), Tetum, Thangmi, Tongan, Turkish, Turkmen (Latin), Tuvan, Upper Sorbian, Urdu, Uyghur (Arabic), Uzbek (Arabic), Uzbek (Cyrillic), Uzbek (Latin), Volapük, Walser, Welsh, Western Frisian, Yucatec Maya, Zhuang, Zulu
Language for handwritten text: English, Chinese (Simplified), French, German, Italian, Japanese, Korean, Portuguese, Spanish
Format:
- JPG
- PNG
- BMP
- PDF
Size: 20 MB maximum
For PDF documents, only the first 2,000 pages are processed.

Model output

If a document is detected, the text recognition model outputs the following information:

Results: A list of lines extracted from the input text.
Text: Strings containing the line of text detected.
BoundingBox: Four values representing the bounding box, described by using the top and left positions along with its width and height.

Limits

Action	Limit	Renewal period
Text recognition calls (per environment)	480	60 seconds