PDF/UA, like PDF itself, is internally complex, but used correctly, actually makes things easier.PDF Association expands its board of directors
Catherine Andersz of PDFTron Systems, Alaine Behler of iText Software and Peter Wyatt, ISO Project Leader for ISO 32000 enrich the newly elected board of the PDF Association.PDF Days Europe 2018 concludes with record number of attendees
Richard Cohn, Principal Scientist at Adobe and the co-author of PDF 1.0, gave the opening keynote at the PDF Days Europe 2018.Interview with René Treuber, Product Manager of axaio software, about PDF Days Europe 2018
René Treuber, Product Manager of axaio software, will be hosting a presentation titled “Introducing ISO standards for PDF “processing steps” and “print product metadata”” at the PDF Days Europe 2018. In this Interview he gives some background information about it.Interview with Roman Toda, CTO of Normex, about PDF Days Europe 2018
Roman Toda, CTO of Normex, will be hosting a presentation titled “Encryption with PDF 2.0” at the PDF Days Europe 2018. In this interview he gives some background information about it.
Metadata is everywhere. We have to deal with it each and every day; sometimes its critical for us sometimes its in our way. Although metadata is so ubiquitous the term itself has no broad awareness in our society. Ask your friends what metadata is all about and if they are not IT specialists you might not get an answer. But youll find metadata on every milk bottle, every pharmaceutical product and on the passport in your wallet. But what would happen if there would be no metadata at all? Try to find an answer on Google and youll notice an interesting article written by R. Todd Stephens, Ph.D. In that, he points out three scenarios:
An interesting perspective because it not only highlights the scenario where metadata is absent but also the fact that the lack of context as well as information overload could make it useless.
The first scenario is obvious. For example, if you are in the supermarket and there would be no information on any of the cans youd be in trouble. No further data about the ingredients makes it impossible to pick the right products and the missing price makes it a challenge for the teller to summarize the correct amount of money you have to pay. Similar in the context of digital assets: How effective would be a collection of MP3 sound files without any further information like title, album and artist?
In the second scenario the metadata has no context or lost context. Usually, metadata is closely attached to the content. In other words, theres no semantical difference to the label on a can of beans and the metadata properties stored within a MP3 file. However, whenever metadata looses context it becomes useless. For example, when the can label gets peeled off and you dont find the corresponding can in the shelf or if describing metadata is just stored in a separate database but the assets are transferred out of that environment youre lost.
Finally, the third point highlights that theres either too much information available or the information is not well understood, which makes it difficult to identify the content just by its metadata. Beyond the examples in the referenced article this might be also true for the package insert of a pharmaceutical product or a long time archival document if theres no meta-metadata (for example schema description) available by which a user can judge the meaning of the embedded metadata in the future.
So, the overall goal is to determine the right level of metadata that is descriptive but not overwhelming allowing the user in any situation to find and/or understand the content.
Interestingly enough, the current European Parliament Elections 2009 is currently running the following advertisement campaign:
How much labeling do we need? European Parliament Elections 2009
But lets have a closer look into the world of digital asset management an area where youll usually find a lot of metadata savvy people. Heres an interesting statement related to the challenge of living with metadata as part of assert management products:
From Matt Kloskowski Confessions of a Lightroom Addict:
I hate metadata. Im sorry, I had to say it. I cant stand it when I look at feature lists for Lightroom (or any other product for that matter) and I see anything with the word metadata listed as a feature. Its important stuff, I know, but its also very boring. I just assume it should be there but dont try to sell it to me as a feature.
But actually in the next paragraph he says:
I love the benefits of metadata. The Metadata panel rocks and the benefits of good metadata support is very important. Thats what makes me feel bad about the other confession above. Its an inner struggle I deal with daily ;)
Get the point?
On one hand metadata is the glue that holds our world and processes together but on the other hand its often so cumbersome to deal with. Let me tell you a quick story:
A couple of years ago, at the end of the elementary school of my kids, the parents got together and the question came up on how to reward the teacher. A friend of mine had the idea of producing a small video as a giveaway for everybody and he also suggested who should do the work…
As a result, I took my laptop and visited some parents asking them for analog and digital photo and video material which they had taken during school events. Whenever we tried to find adequate material stored on a computer (or backup CDs) a journey began that took me through years of private family events and vacations before we actually found the relevant material showing the kids as part of school events. The only descriptive metadata have been the surrounding folders of the images. Searching for school photos or class trip 2006 was not possible. However, each one of them had a serious amount of images on their disk already and going forward I expect that searching through their images will get more and more painful.
What is true for personal use sometimes applies to professionals as well. IT knowledgeable people know that an early investment in metadata pays off in the future but what makes it actually so difficult to put this investment in early in the cycle?
As a starting point let me ask the following questions:
The following list will highlight some of the important themes. Its not meant to be complete but should give an impression about the landscape and some of its challenges:
|Manual input of information||Rich metadata through automatic creation|
|No awareness||Long term knowledge|
|No immediate benefit||Faster access to digital content|
|Bad quality and lack of trust||Enabling a lot of workflows|
Manual metadata entry is time consuming and error-prone. That said, metadata works best whenever its being generated automatically (but accurate) and the user experience is adequate. In the domain of digital images the Exif standard is a good example of metadata being generated automatically and being put into context with an asset. For example the information about when a picture has been taken is supported on all cameras and modern software products do honor these properties correctly in subsequent workflows. Going forward, devices and tools will include smarter content analysis techniques to automatically add metadata to an asset. This ranges from automatic geo tagging to face detection/recognition with advanced expression techniques like smile detection, age grouping, predominant colors, etc. As an example, Adobe supports automatic Speech-To-Text transcription in Adobe Premiere Pro CS4 built on top of Adobes Extensible Metadata Platform (XMP) to enable enhanced metadata workflows across the production value chain.
Although people live with metadata its often not obvious for them to use it and manage their environment effectively. The school example above shows that personal use of metadata is often restricted to the metadata being generated by the devices. But even within business, metadata is often not seen as a separate area of investment that allows budget and resources be assigned to it. Metadata is not a feature in itself but it enables features and allows workflows to connect to each other. In particular, metadata is critical for digital media since we cannot add a sticker or turn a digital photo over to write on it therefore it is even more critical that metadata be embedded in the media.
One huge advantage of metadata attached early in the workflow is the ability to effectively manage digital assets across the various production steps and for example find asset quickly and at any time. The Return on Investment (ROI) of metadata becomes more obvious if you compare a digital asset library with and without adequate metadata assigned to its assets. In the end, metadata is one of the most important production time-savers that will reduce your budget long-term.
Beyond that, content is still king. In other words, you often wouldnt need metadata if the interpretation of content is fast and convenient enough to be done in real time. For example detected faces havent to be stored as metadata if every system and tool would be capable of calculating them on the fly; speech-to-text metadata can be ignored if spoken words could be searched via audio within the content directly.
This is related to the approach of search engines which mainly try to gather information about assets by analyzing content. As metadata can be wrong or simply changed this source of information is often not trustworthy for some workflows and systems.
In the end, we will have to deal with metadata and respect it within our workflows as seamless as possible.
European Parliament Elections 2009
Life Without Metadata (R. Todd Stephens, Ph.D.):
Adobe Lightroom Killer Tips Confessions of a Lightroom Addict (Matt Kloskowski):