Upload
robert-m
View
215
Download
1
Embed Size (px)
Citation preview
PARALLEL PROCESSING tools for today's optiker
Electronic Information
Metadata: What You Don't Know Might
Hurt You BY ROBERT M. JOPSON
Metadata are data about data. Here we discuss metadata that
describe the contents of a word processor document. Because a document can contain information of which you are unaware, if you're not careful, you will be providing this information to others when you give them the document. The most notorious case involves the hacker who wrote the Melissa virus, which was a Microsoft Word document. The metadata it contained was used to track down the unsuspecting suspect.
We concentrate in this article on metadata produced by Microsoft Word 97 Service Release 2. Although Microsoft Word generates documents in a proprietary format, this format is one commonly accepted for technical publications and is used extensively. Documents written using any popular word processor are likely to contain some metadata similar to the types described here.
Document metadata often include the name of the author and perhaps an address. A virus writer would want to eliminate this information, for obvious reasons, but anonymity is also desirable in more innocent pursuits, such as refereeing a paper. OSA does not forward referee reports by e-mail, but some organizations do. If the organization sends your report to the author without modification, the author
can find out who wrote it. You can eliminate this "feature" from future documents by clicking "Tools," then "Options..." When you click on "User Information" you will see boxes containing the information the word processor has for your name, initials and address. The name information seems to be populated automatically, perhaps when Word is installed, so it is usually present. The other information is often absent. Modify or delete any information as desired and then click "OK." All newly created documents will have the new information. However, if you do not change the file name, existing documents retain the "User" information with which they were created even when modified and saved.
A comment contains the name and initials of the user who created it. Comments in your document may be visible or not depending on your view options (Click "Tools," then "Options...." then "View"). To see if any comments are present, put check marks next to "Hidden Text" and "Screen Tips" by clicking on the boxes if they are empty and then click OK. A yellow comment mark will appear in the location of each comment. You can write anonymous comments by modifying your User Information as described above. Remove an existing comment by right-clicking on the associated comment mark and clicking "Delete Comment."
A record is also created when you work on your document with revision tracking enabled (click "Tools," then "Track Changes," then "Highlight Changes..." put a check mark next to "Track changes while editing" by clicking the box, and click "OK"). Again, your "User Information" is used so if you have "anonymized" it, your name will not appear. However, a record will still be made of the revision history
if you allow changes to be tracked. To avoid this, remove the checkmark next to "Track changes while editing" using the procedure described above. This will put a halt to future revision tracking. To see if a document contains a revision history, click "Tools," then "Track Changes," then "Highlight Changes...," and put a checkmark next to "Highlight changes on screen." To cleanse any revision history you find, click "Tools," then "Track Changes," then "Accept or Reject Changes..." and follow the prompts for each separate revision.
Undesired information may be saved as a "property" of the document. To view and modify this information, click "File" and then "Properties." Information contained in the "Summary" tab can be changed. This can become difficult when you are connected to a network. The system may inject your network user name into the document when you open or save the document. If your system allows you to do so, modify the "Summary" information as desired while not logged into the network. Then log on and transfer the file as desired without opening it.
A simple technique for stripping off some types of metadata is to save the file in rich-text format (rtf), open it, and then save it as a MSWord document.
MSWord documents contain more types of metadata than can be described here. A more complete description of them together with methods of elimination can be found at http://support.microsoft.com/support/kb/articles/Q223/7/90.asp.
However, if you wish to know exactly what you are sending, convert your document to a text file and send the text file.
Bob Jopson works on lightwave systems at Bell Laboratories. He can be reached at [email protected].
40 Optics & photonics News/ October 2000