I am writing a Word to HTML converter. I am using the approach of saving Word file as Filtered HTML to carry out the conversion. I have automated this process using Microsoft.Office.Interop.Word for every Word document input that has to be converted. But, I see that the images are getting compressed regardless of the settings explained in office-docuemnt-image-quality-loss . The quality of screenshot images embedded in the word document becomes very poor after conversion to filtered html.
Unfortunately, manual replacement of the images with the original quality images cannot be done because of the sheer volume and size of the word documents. Zipping the docx file and accessing the native images does not help me, as these images are differently numbered from the filtered html images. The order is especially affected when shapes and icons are present in the word file.
Is there any way to remove such compression and have the image quality same as the original embedded images? Or, is there a better way to convert Word documents to HTML retaining the image quality?