What is Document Translation (Preview)?

Document Translation is a cloud-based feature of the Azure Translator service and is part of the Azure Cognitive Service family of REST APIs. The Document Translation API translates documents to and from more than 70 languages while preserving document structure and data format.

Document Translation key features

Feature Description
Translate large files Translate whole documents asynchronously.
Translate numerous files Translate multiple files to and from more than 70 languages.
Preserve source file presentation Translate files while preserving the original layout and format.
Apply custom translation Translate documents using general and custom translation models.
Apply custom glossaries Translate documents using custom glossaries.

How to get started?

In our how-to guide, you'll learn how to quickly get started using Document Translator. To begin, you'll need an active Azure account. If you don't have one, you can create a free account.

Supported document formats

The following document file types are supported by Document Translation:

File type File extension Description
Adobe PDF .pdf Adobe Acrobat portable document format
HTML .html Hyper Text Markup Language.
Localization Interchange File Format .xlf. , xliff A parallel document format, export of Translation Memory systems. The languages used are defined inside the file.
Microsoft Excel .xlsx A spreadsheet file for data analysis and documentation.
Microsoft Outlook .msg An email message created or saved within Microsoft Outlook.
Microsoft PowerPoint .pptx A presentation file used to display content in a slideshow format.
Microsoft Word .docx A text document file.
Tab Separated Values/TAB .tsv/.tab a tab-delimited raw-data file used by spreadsheet programs.
Text .txt An unformatted text document.
Translation Memory Exchange .tmx An open XML standard used for exchanging translation memory (TM) data created by Computer Aided Translation (CAT) and localization applications.

Supported glossary formats

The following glossary file types are supported by Document Translation:

File type File extension Description
Localization Interchange File Format .xlf. , xliff A parallel document format, export of Translation Memory systems. The languages used are defined inside the file.
Tab Separated Values/TAB .tsv/.tab a tab-delimited raw-data file used by spreadsheet programs.

Next steps