DPCI Press Releases

DPCI helps launch Breakingnews.com, an aggregated news site > more

DPCI In The News

Bachana interviewed by Lana Gates of Software Magazine for article on content management > more

DPCI Events

Bachana to speak at Gilbane Conference, December 2-4, 2008 > more


In order to remain the first choice of healthcare professionals as well as pharmaceutical companies, Thomson Reuters Healthcare knew it had to reduce the amount of time between when information became available and when the publication would be ready for consumers. Thomson Reuters Healthcare selected DPCI to implement Typefi Publish, an automated publishing platform that supports the development of richly formatted, brand-compliant documents. > more

All case studies

August 01, 2008

Speak the Metadata

Ivan Mironchuk

We do quite a bit of consulting around metadata and taxonomy strategies for organizations, and one thing we find is always a topic of discussion is "How much metadata is too much?" The answer from us usually is: "The only metadata you should consider is information that is really relevant for searching, relating, or repurposing content."

Once the metadata strategy is determined, there is a fine line drawn to whittle down the actual number of metadata fields you'd like to have, vs. how many fields you think will actually be filled out. Filling out metadata can be a time-consuming task. 50 fields of information is most cases would be way too much! Some organizations will hire on additional staff to help fill out metadata, or police and control the quality of metadata entered by others.


While limiting down the number of metadata fields required for users to fill out is smart, what if there were an easy way to get more metadata information without more work?


I'll leave Text Mining Engines (used for metadata and content entity extraction) for another blog post, but I had an idea the other day. I was talking to an old college friend who regularly uses speech to text software to help write journal articles, and I thought "What if we could speak the metadata?" Speech to text software has improved quite a bit over the years, so I can't image that it would be that hard to integrate speech to text to help someone fill out key taxonomy terms for a piece of content or image. I know that I could describe 50 different properties of an image in just a matter of seconds where it could take many minutes to fill out the same 50 properties on a metadata form.


So what do you think? Worth exploring or waste of time? Let me know in the comments.

Posted at 06:15 pm by Ivan Mironchuk

Add comment


More Blogs From Author: