United States-English

Mostly Color

The end of PDF

Published 02 July 2007, 05:47 PM

This post echoes my 8 March 2007 post The end of JPEG, this time pointing to the standardization of print (or paper) specifications. I am reacting to last Friday's post on Andy Updegrove's Standards blog about Microsoft's submission of its XML Paper Specification (XPS) to ECMA International.

PDF is a direct descendant of Interpress, a device independent page description language originally designed at PARC and released in 1980. Although I am linking into Wikipedia, the information there is inaccurate: JaM was indeed based on Forth, and PostScript is a direct descendant of JaM, but Interpress is not really a direct descendant of JaM and PDF is not really a direct descendant of PostScript.

What do I mean by that? Although the same people where involved, JaM and PostScript are philosophically very different from Interpress and PDF. The former was a top notch and very elegant engineering effort by John Warnock to create a real world device independent page description language to replace the device dependent Press page description language.

After JaM was successful, PARC's computer language gurus that were behind Cedar did a clean-sheet design of a device independent page description language carried out with the methodology of specifying a computer language. Although is took advantage of the JaM experience it's author's collaboration, Interpress is much cleaner, more powerful, and more efficient. The main driving forces behind this effort were Butler Lampson and Bob Sproull.

In my view, PDF is more the sibling of Interpress than the son of Postscript, and I believe it is mostly the merit of Ed Taft that PDF has remained a pure and clean language to these days.

Those were the Seventies. If PDF would be specified today, it would very likely be as one of the XML languages. In fact, if one considers the evolution of PDF over the releases, one can say that it is somehow converging to XML.

When they were done with Interpress, what problem did PARC's computer language gurus tackle? In those days there where three computer systems in use at PARC — Cedar, Interlisp, and Smalltalk — and each had their own incompatible document preparation system. Therefore, there was a need for a system independent editable document description language, which would do for WYSIWG editors what Interpress did for printers. The outcome was Interscript and the main authors were Bob Ayers, Jim Horning, Butler Lampson, and Jim Mitchell.

Unfortunately, 1983, when Interscript was finished, was a very turbulent year at PARC and Interpress fell between the cracks. It lingered along in the Spinnaker project and at INRIA Sophia Antipolis, eventually strongly influencing ODA (Open Document Architecture), but there have not been pure-bred descendants with immaculate pedigree.

Although there was no "Ed Taft" for Interscript, through the years it has left its mark on SGML and XML. And this is where I would like to make my point. If a new standard language is proposed today, the inspiration should not come from Interpress but from Interscript.

Useful links:

Technorati tags: ,
Social bookmarking: DiggDigg del.icio.usdel.icio.us NowPublicNowPublic redditreddit NewsvineNewsvine GoogleGoogle

Posted By GiordanoBeretta | 2 Comments | Trackbacks | Permalink
Filed under:


Comments

You write, very correctly I think, that "If PDF would be specified today, it would very likely be as one of the XML languages." This is so clearly the direction the industry is moving in, and Microsoft, bless their sometimes off-target hearts, demonstrate with XPS (and their whole new graphics architecture) just what can be done with XML at the core of a new graphic-oriented file format. You then write, "In fact, if one considers the evolution of PDF over the releases, one can say that it is somehow converging to XML." Upon this I disagree. Adobe has made rudimentary efforts to add XML-style tagging functionality into the core of PDF over the last few releases, but have found, to no one's surprise that (again, as you've suggested) it really is not meant to be a horse of a different color. And so they have Mars, a half-hearted attempt to throw an XML wrapper around PDF (although this is not in the 1.7 spec they've offered to ISO). Too little, too late. We needed a fresh start, as Microsoft has done, and as others will do. That is the future of graphic file formats and their kin.
# Tuesday, July 03, 2007 01:43 AM by thadmc
thadmc, thank you for your insightful comment. I believe for the past eight or so years there has been an increasing chasm between what Adobe's technologists state publicly (e.g., at conferences like Seybold) and what we see in Adobe's products. I wonder what the strategists think about this. At ACM's conferences on digital publishing there are always interesting papers bridging or marrying PDF and XML, but these papers are mostly from European academics, not from the two largest software companies.
# Thursday, July 05, 2007 05:22 PM by Giordano Beretta

Leave a Comment

(required)  
(optional)
(required)  


Type the digits above:
Information disclosed in this community becomes public. Exercise caution when deciding to disclose your personal information. HP reserves the right, but is not obligated to, edit or remove your comment if it contains personally identifiable information or other content HP deems unacceptable.  Opinions expressed are your personal opinions or those of the original authors, and not of HP. Please see HP's web Terms of Use for more details.