<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule">

<channel>
	<title>Digital Collections Blog &#187; metadata</title>
	<atom:link href="http://library.duke.edu/blogs/digital-collections/tag/metadata/feed/" rel="self" type="application/rss+xml" />
	<link>http://library.duke.edu/blogs/digital-collections</link>
	<description>Notes from the Digital Collections Team at Duke</description>
	<lastBuildDate>Wed, 04 Nov 2009 21:32:13 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/us/</creativeCommons:license>		<item>
		<title>First Wednesday Jan 7 &#8211; A Metadata Tool Project</title>
		<link>http://library.duke.edu/blogs/digital-collections/2009/01/06/first-wednesday-jan-7-a-metadata-tool-project/</link>
		<comments>http://library.duke.edu/blogs/digital-collections/2009/01/06/first-wednesday-jan-7-a-metadata-tool-project/#comments</comments>
		<pubDate>Tue, 06 Jan 2009 17:42:55 +0000</pubDate>
		<dc:creator>wsexton</dc:creator>
				<category><![CDATA[Trident]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://library.duke.edu/blogs/digital-collections/?p=331</guid>
		<description><![CDATA[Slides for my First Wednesday presentation tomorrow on the metadata tool project, including the exciting new project &#8220;code name&#8221; (which has been determined, pending a better idea) &#8230;

]]></description>
			<content:encoded><![CDATA[<p><em>Slides for my First Wednesday presentation tomorrow on the metadata tool project, including the exciting new project &#8220;code name&#8221; (which has been determined, pending a better idea) &#8230;</em></p>
<p><iframe src='http://docs.google.com/EmbedSlideshow?docid=ddpbx8xn_221cx4gwjd8' frameborder='0' width='410' height='342'></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://library.duke.edu/blogs/digital-collections/2009/01/06/first-wednesday-jan-7-a-metadata-tool-project/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>American Song Sheets</title>
		<link>http://library.duke.edu/blogs/digital-collections/2008/12/19/american-song-sheets/</link>
		<comments>http://library.duke.edu/blogs/digital-collections/2008/12/19/american-song-sheets/#comments</comments>
		<pubDate>Fri, 19 Dec 2008 19:17:56 +0000</pubDate>
		<dc:creator>Jill Katte</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[American Song Sheets]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://library.duke.edu/blogs/digital-collections/?p=301</guid>
		<description><![CDATA[
American Song Sheets, another new digital collection we published in October, includes approximately 1,800 broadsides and song sheets from nineteenth-century America. For this collection, we provide the song sheet images, as well as the searchable full text of the song lyrics. Will processed the full text to generate a collection-level &#8220;term cloud&#8221; based on commonly [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://library.duke.edu/digitalcollections/songsheets.bsvg301065/pg.1/"><img src="http://library.duke.edu/digitalcollections/images/bsv/thm/bsvg3010650010-thm.jpg" alt="" width="150" height="236" align="right" /></a></p>
<p><a href="http://library.duke.edu/digitalcollections/songsheets/">American Song Sheets</a>, another new digital collection we published in <a href="http://library.duke.edu/blogs/digital-collections/2008/10/24/changelog-2008-oct-24/">October</a>, includes approximately 1,800 broadsides and song sheets from nineteenth-century America. For this collection, we provide the song sheet images, as well as the searchable full text of the song lyrics. Will processed the full text to generate a collection-level &#8220;<a href="http://library.duke.edu/digitalcollections/songsheets/">term cloud</a>&#8221; based on commonly occurring words within the lyrics. This technique has proven useful for other collections, such as the Sidney Gamble Photographs of China <a href="http://library.duke.edu/digitalcollections/gamble/browse/">term clouds</a> (in two languages!) and the Americans in the Land of Lenin collection <a href="http://library.duke.edu/digitalcollections/esr/">term cloud</a>.</p>
<p>For the Song Sheets, Will also used full-text processing to enhance the <a href="http://library.duke.edu/digitalcollections/songsheets.bsvg301427/">metadata for each item</a> with &#8220;more frequent words&#8221; and &#8220;less frequent words.&#8221; These approaches allow us to support additional browsing pathways for our users without the costs of hand-crafted metadata.</p>
]]></content:encoded>
			<wfw:commentRss>http://library.duke.edu/blogs/digital-collections/2008/12/19/american-song-sheets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why digital collections aren&#8217;t just a big pile of stuff</title>
		<link>http://library.duke.edu/blogs/digital-collections/2008/10/17/why-digital-collections-arent-just-a-big-pile-of-stuff/</link>
		<comments>http://library.duke.edu/blogs/digital-collections/2008/10/17/why-digital-collections-arent-just-a-big-pile-of-stuff/#comments</comments>
		<pubDate>Fri, 17 Oct 2008 18:51:10 +0000</pubDate>
		<dc:creator>Rich</dc:creator>
				<category><![CDATA[metadata]]></category>
		<category><![CDATA[introduction]]></category>

		<guid isPermaLink="false">http://library.duke.edu/blogs/digital-collections/?p=171</guid>
		<description><![CDATA[
Continuing the introductions &#8230; I&#8217;m Rich Murray, and I&#8217;m one of two Metadata Librarians in the Digital Collections Program.  I&#8217;m based in the Cataloging &#38; Metadata Services Department, and I work with Noah Huffman (in the Rare Book, Manuscript, &#38; Special Collections Library) to plan and create metadata for our digital collections.
What does that [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://library.duke.edu/digitalcollections/adaccess.T1572/pg.1/"><img src="http://library.duke.edu/digitalcollections/images/adaccess/T/T15/T1572/T1572-med.jpeg" alt="" width="214" height="305" /></a></p>
<p>Continuing the introductions &#8230; I&#8217;m Rich Murray, and I&#8217;m one of two Metadata Librarians in the Digital Collections Program.  I&#8217;m based in the Cataloging &amp; Metadata Services Department, and I work with Noah Huffman (in the Rare Book, Manuscript, &amp; Special Collections Library) to plan and create metadata for our digital collections.</p>
<p>What does that mean, exactly?  Basically, Noah and I –- and the rest of the metadata team –- work to describe, organize, and allow users to discover the cool stuff in our digital collections.  Metadata is &#8220;data about data,&#8221; and without it, a 5000-item digital collection is like 5000 photographs thrown into a big pile.  You might be able to find what you want by going through them all one at a time, but it will probably take forever, and you may get to the end and discover that what you were looking for wasn&#8217;t in the big pile anyway.</p>
<p>With good metadata, though, you can find what you&#8217;re looking for much more efficiently and painlessly.  We group objects into categories based on subject, format, time period, or anything else that makes sense.  We apply captions to images, keywords to advertisements, plot summaries to videos, and anything else we think will help you find what you&#8217;re looking for.  And if we&#8217;re doing our job right, the metadata we provide might even lead you to really, really cool stuff that you didn&#8217;t even know you were looking for.</p>
<p>Fortunately, Noah and I don&#8217;t have to do all this on our own.  We work with a great group of folks, including the rest of the Digital Collections Implementation Team, the Metadata Advisory Group, and other staff throughout the libraries.  It&#8217;s a team effort, and as our Digital Collections Program grows, more and more of us are involved in making it happen.</p>
<p>The other part of my job, which may sound completely unrelated at first but really isn&#8217;t, is serving as the Catalog Librarian for Spanish &amp; Portuguese (and Catalan and Galician) Languages.  Both parts of my job involve describing, arranging, and providing access to the library&#8217;s collections so you can find what you need.</p>
<p>Metadata isn&#8217;t something new –- it&#8217;s what librarians and archivists have been doing all along, even if we called it something else.  Connecting people and ideas is what do best, and as part of the Digital Collections Program, I get to spend my days bringing some truly remarkable resources to an audience around the world.  It doesn&#8217;t get much better than that.</p>
]]></content:encoded>
			<wfw:commentRss>http://library.duke.edu/blogs/digital-collections/2008/10/17/why-digital-collections-arent-just-a-big-pile-of-stuff/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>A metadata tool that scales</title>
		<link>http://library.duke.edu/blogs/digital-collections/2008/10/10/a-metadata-tool-that-scales/</link>
		<comments>http://library.duke.edu/blogs/digital-collections/2008/10/10/a-metadata-tool-that-scales/#comments</comments>
		<pubDate>Fri, 10 Oct 2008 12:22:07 +0000</pubDate>
		<dc:creator>wsexton</dc:creator>
				<category><![CDATA[Trident]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://library.duke.edu/blogs/digital-collections/?p=111</guid>
		<description><![CDATA[In January of 2007 I sent a post to the Web4lib list titled &#8220;Metadata tools that scale.&#8221; At Duke we were seeking opinions about a software platform to capture metadata for digital collections and finding databases.  The responses to that inquiry suggested that what we were looking for didn&#8217;t exist.
About a year ago, an OCLC [...]]]></description>
			<content:encoded><![CDATA[<p>In January of 2007 I sent a post to the Web4lib list titled <a href="http://lists.webjunction.org/wjlists/web4lib/2007-January/042719.html">&#8220;Metadata tools that scale.&#8221;</a> At Duke we were seeking opinions about a software platform to capture metadata for digital collections and finding databases.  The responses to that inquiry suggested that what we were looking for didn&#8217;t exist.</p>
<p>About a year ago, an OCLC report on a survey of 18 member institutions, <a href="http://www.oclc.org/programs/publications/reports/2007-03.pdf">&#8220;RLG Programs Descriptive Metadata Practices Survey Results,&#8221;</a> supported that basic conclusion.  When asked about the tools that they used to &#8220;create, edit and store metadata descrptions&#8221; of digital and physical resources, a sizable majority responded &#8220;customized&#8221; or &#8220;homegrown&#8221; tool.</p>
<p>Since my initial inquiry, we launched <a href="http://library.duke.edu/digitalcollections/">a new installation of our digital collections</a>.  Yet we still lack a full-featured software platform for capturing descriptive metadata.<br />
<span id="more-111"></span><br />
We did our own informal survey of peer institutions building digital collections, which further reinforced that familiar conclusion &#8212; there are lots of Excel spreadsheets, Access and FileMaker databases, etc.,<br />
out there, but no enterprise-level solution.</p>
<p>We also articulated a detailed series of specifications for a metadata tool.  The library has committed to hiring two programmers each to a two-year appointment for this purpose.  The job description is <a href="http://library.duke.edu/jobs/systemsprogrammer.html">here</a>, and there are two openings for it.</p>
<p>When we thought through through the specifications for a metadata tool, we went on and envisioned a full-featured platform to support digital collections, including digitization, asset management and a front end.   I&#8217;m going to list in some detail the specifications from what we refer to as the Digitization and Description use cases.</p>
<h3><span style="color: #000000;">Digitization</span><strong> </strong></h3>
<p><strong>* Supports identification</strong> – an itemized list representing each scan in the collection. This list includes 1st pass descriptive metadata, structural metadata (how the components of an item are related to  one<br />
another) and location metadata (box #, folder #, album, chapter, etc) identifiers, item type and dimensions.</p>
<p><strong>* Check out systems</strong> – The ability to create an arbitrary grouping of consecutive components that can be &#8220;checked out&#8221; by a scanner operator during the scanning/QC process. These units include information about the physical location of the materials and any pertinent information needed to scan. Needs to enable multiple users at the same time.</p>
<p><strong>* Reporting interface</strong> – In conjunction with the checkout system above the system needs to be able to report statistics such as total number of items in a collection, number of items left to scan and/or qc, the average amount of time it take to scan a &#8220;unit.”</p>
<p><strong>* Pulls technical metadata from the image header</strong> – Must be able to extract technical metadata from image files.</p>
<p><strong>* Student worker login access</strong> – Restricted read/write access through login credentials</p>
<p><strong>* Generates image derivatives according to dimension specification</strong> – Generation of derivatives via batch processing with the option to change dimensions to fit changing web displays. Must use color profiles and have options to control compression quality.</p>
<p><strong>* Generates checksums</strong> – Creates checksums of files that move through the system to ensure that the files have not become altered.</p>
<h3><span style="color: #000000;">Description</span><strong> </strong></h3>
<p><strong>* Supports Duke Core (multiple metadata schema) metadata creation</strong> – Duke Core, a modified version of qualified Dublin Core, is the standard metadata schema developed for digital collections at the Duke libraries by the Metadata Advisory Group.</p>
<p><strong>*  Authority lists</strong> &#8212; including sharing authority lists between similar projects, setting default dropdowns for all projects/items (e.g. Type), etc.</p>
<p><strong>* Set mandatory fields and cardinality constraints.</strong></p>
<p><strong>* Assign values in mass to every item in a collection</strong> – Collections often have particular metadata that needs to be applied to every item in the collection (e.g., subject terms, creator, etc.).</p>
<p><strong>* Find and edit existing records easily.</strong></p>
<p><strong>* Integrates with digitization workflow.</strong></p>
<p><strong>* See digital object while editing metadata.</strong> Users should be able to see the digital object while they are creating or editing the corresponding metadata.  Does not have to be the highest-resolution image, but a working version.</p>
<p><strong>* Displays record status</strong> – Allows catalogers to specify the state of a record.  System should allow catalogers to specify this status and list records in a way that provides at-a-glance overview of work remaining within a collection.</p>
<p><strong>* Handle item-level metadata-only records</strong> – Some of our digital collections are metadata only. The tool must allow users to create and edit metadata records that do not have an attached digital object.</p>
<p><strong>* User Interface simple and intuitive, distributed system.</strong> Could be web-based – The interface should be simple and intuitive, and should allow multiple users to work at the same time, though they should not be able to edit the same record at the same time.  A web-based tool would allow users to work on digital collections from anywhere and would not require them to use a computer with particular software installed.</p>
<p><strong>* Supports UTF-8 universal character sets</strong> – Metadata for Duke&#8217;s digital collections often includes special characters (diacritics, non-Roman characters, etc.). The tool must accommodate UTF-8 character sets.</p>
]]></content:encoded>
			<wfw:commentRss>http://library.duke.edu/blogs/digital-collections/2008/10/10/a-metadata-tool-that-scales/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Megadata, an Introduction</title>
		<link>http://library.duke.edu/blogs/digital-collections/2008/10/09/megadata-an-introduction/</link>
		<comments>http://library.duke.edu/blogs/digital-collections/2008/10/09/megadata-an-introduction/#comments</comments>
		<pubDate>Thu, 09 Oct 2008 15:22:10 +0000</pubDate>
		<dc:creator>wsexton</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://library.duke.edu/blogs/digital-collections/?p=91</guid>
		<description><![CDATA[I&#8217;m Will Sexton, Metadata Analyst / Programmer here at Duke University Libraries.  My job focuses on technical support for the metadata-heavy stuff:  finding aids, finding databases (like this one) and digital collections.  I&#8217;m part of a great team that includes Sean Aery, who designed the front end for our digital collections platform.  Sean and I [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m Will Sexton, Metadata Analyst / Programmer here at Duke University Libraries.  My job focuses on technical support for the metadata-heavy stuff:  <a href="http://library.duke.edu/digitalcollections/rbmscl/inv/">finding aids</a>, finding databases (<a href="http://library.duke.edu/digitalcollections/road/">like this one</a>) and <a href="http://library.duke.edu/digitalcollections/">digital collections</a>.  I&#8217;m part of a great team that includes Sean Aery, who designed the front end for our digital collections platform.  Sean and I will present on that project next week at the <a href="http://www.ala.org/ala/mgrps/divs/lita/litaevents/litaforum2008/litaforum2008.cfm">LITA National Forum</a>.</p>
<p>When I took this position six years ago my job title was the only &#8220;metadata&#8221; anything in the library.  Now we have two Metadata Librarians, a committee called the Metadata Advisory Group, and an internal metadata &#8220;standard&#8221; named Duke Core (derived from <a href="http://dublincore.org/">that other core</a>).  What used to be our Cataloging Department is now known as Cataloging &amp; Metadata Services.  Yes (rubs hands together, cackles) my plan is working beautifully.  Next I will change my job title to &#8220;Toll House Cookies for Everyone Analyst.&#8221;</p>
<p>Anyway, the first time I told a friend of mine outside of the library field about my job, she said, &#8220;Huh?  Megadata?  What&#8217;s megadata?&#8221;  This particular friend was in law school at the time, so now when people ask her, &#8220;What do you do?&#8221; she says &#8220;I&#8217;m a lawyer.&#8221;  I gave up answering that question directly; now I just say I&#8217;m a computer programmer (partially true) or a librarian (nominally untrue, though true in the sense of &#8220;a person who works in a library on library stuff&#8221;).</p>
<p>But at least now I have to explain my job less when I&#8217;m IN the library than I did six years ago.</p>
<p>Anyway, this Word Press thingie for digital collections has been sitting up on cinder blocks on the side of the house for a while, and I thought I&#8217;d take it for a spin.  Wheeee!  Before I move on to a subject other than &#8220;me me me&#8221; I&#8217;ll add that I contribute 6-to-8 hundred words of topical observation to the <em>Chapel Hill News</em>&#8216; &#8220;My View&#8221; feature every 7 weeks or so.  My <a href="http://www.chapelhillnews.com/opinion/my_view/story/17040.html">most recent column</a> attempts to make issues relating to &#8220;megadata&#8221; and library technology seem like the kind of thing you talk about in a newspaper.</p>
<p>Coming soon &#8230; a post about metadata.</p>
]]></content:encoded>
			<wfw:commentRss>http://library.duke.edu/blogs/digital-collections/2008/10/09/megadata-an-introduction/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
