Microsoft Office Tutorials and References
In Depth Information
Understanding the Office 2010 Default Formats: Office Open XML
Understanding the Office 2010 Default Formats:
Office Open XML
By default, the three document-centric programs in Office—Word, Excel, and PowerPoint—
use documents stored in the Office Open XML (OOXML) format. This XML-based format,
which was introduced with Office 2007, offers several advantages over the binary formats
used in Office 2003 and earlier versions. Most notably:
● Smaller files Each document file (a .docx, .xlsx, or .pptx file, for example) is
actually a container file in the ZIP archive format. The document parts—including the
document text, embedded images, formatting commands, and so on—are saved as
individual files within the container. Zip compression technology reduces the file size
as much as 75 percent. Office automatically unzips the container when you open a
document and rezips it when you’re done.
This not only means you need less disk space to store documents, but it facilitates
sending documents as e-mail attachments and reduces the time needed to move
files to and from network or Internet locations.
● More robust files Because the document file contains discrete files within a ZIP
archive, you should be able to open and work with the document even if some
component parts are missing or damaged.
● Better control over personal information With the older binary formats, some
notoriously embarrassing incidents arose in which snoops were able to find personal
information and earlier edits to a document. The Open XML formats support the use
of the document inspector (see Figure 4-1), which identifies potential privacy
problems (and other issues) and lets you remove personally identifiable information such
as author names, document revisions, file paths, and so on. For more information
about the document inspector and privacy concerns, see “Inspecting and Removing
Personal and Confidential Information” on page 821.
INSIDE OUT Dig into an Open XML document
If you’re curious about the structure and content of an Open XML document, it’s easy
to take a look inside. Start by changing the file name extension to .zip, and then open
the file as a compressed folder in Windows Explorer. In the root of the compressed
folder, you’ll see a document named [Content_Types].xml. This file, which you can view
in a web browser, identifies the parts of the document. The files that make up these
parts are located within subfolders of the compressed folder.
“How To: Manipulate Office Open XML Formats Documents” provides an introductory
tour of the structure and content of the files within a document file; you can find this
article in the MSDN library at w7io.com/10402 .