FOSS Content Creator - Extract images from Word* docx with One Simple Command on GNU Linux* or use the proprietary application, WinZip*, for Windows*

If you use proprietary applications for documentation, you may not be aware of this but a docx is really an archive.

Often, these *.docx documents are used to transfer content to web developers to be placed for online content in Content Management Systems, wikis or blogs. Often, Word documents are the only resource for the images.

Example: 26 images

This example includes 26 images within a docx document. How, when using a proprietary solution, would you extract these images?

Option 1. Click each image and "Save As Picture" for all 26 images.

Option 2. Unzip in GNU Linux*

Option 3. Unzip in Windows*

Extract Images using GNU Linux

To extract images from a *.docx file, run the command in a GNU Linux terminal:

unzip NameofFile.docx

In this case, the document is 26images.docx so the command would be:

unzip 26images.docx

The docx unzips and extracts the xml and images.

click icon  Click the images below to open a larger display in a new window.

shell

 

Extracting

Extracted files and directories

Location of images:

/extractiondir/word/media

Proprietary Solution

If you must use proprietary software, WinZip* will extract the images. Click File > Open and select the file. The location of the images is:

word\media\allthepngsarehere.png

proprietary way

Conclusion

Extracting image files instead of the manual process of saving each image one at at time can save time and dollars for your projects. Streamline your projects using GNU Linux and you can potentially save man hours and dollars.

All images included in this blog post are GFDL. A sample docx is attached so you can try this for yourself. 

AnexoTamanho
Arquivo 26images.docx3.8 MB
standard
Para obter informações mais completas sobre otimizações do compilador, consulte nosso aviso de otimização.