Guidelines for provision of metadata for digital collections at the Duke University Libraries, including the Duke Core element sets. Created and maintained by the Metadata Advisory Group.
Drafted in support of the Strategic Plan Goal 2 initiative
The primary purpose of descriptive metadata is to assist researchers in the discovery of resources relevant to their research objectives. The library assembles a large number of digital objects in a repository system that researchers can access in a wide variety of scenarios. Two principles come into play when approaching description for these digital objects: interoperability and specificity. Interoperability ensures that the end user can search across collections in a unified repository setting. Specificity, on the other hand, highlights the uniqueness of a collection and its objects. It is important to keep both principles in mind when analyzing materials for digital collections.
In addition to resource discovery, interoperability, and specificity, metadata allows us to express relationships between digital objects, organize digital collections, and manage the digitization workflow.
The Perkins Library system employs a core set of metadata elements for all objects in its digital collections repository system. In many cases, this Duke Core set will be sufficient as-is to describe all of the items in a collection. In other cases, items may be described using the core set as described below with some refinements. For some unique digital resources, however, we may use distinct, domain-specific schemas. Since one of our goals for digital collections is to guarantee interoperability, we require that the deviations from the Duke Core schema be crosswalked or transformed semantically to Duke Core.
The Duke Core metadata standard includes guidelines for describing digital collections at the collection, category, and item levels. Collection-level description is required for all digital collections, but not all collections require category- or item-level description. For example, very large collections generally benefit from being broken into categories based on subject, material type/format, or other characteristics, but this may not be necessary for small collections. Decisions about the level and type of description for a digital collection should be made in conjunction with the Metadata Librarians and the Metadata Advisory Group, who will discuss options with and make recommendations to the project proposer. Among the questions to be considered when developing a metadata plan for a digital collection are:
While the initial inclination may be to provide item-level metadata for every item in every digital collection, this is not always desirable or feasible. Good metadata at the category level can often provide the same quality of access to a digital collection with much less expenditure of resources than item-level description.
“Found metadata” should be used whenever possible. Collections are rarely digitized with no existing metadata whatsoever. Found metadata may include slide labels, logbooks, MARC records, notes jotted on the back of photographs, or other similar forms. It is usually easier to massage existing metadata than to create new metadata from scratch. The Metadata Librarians and Metadata Advisory Group can advise collection sponsors on the best way to use, reshape, or supplement existing metadata.
A three-pass approach should be used for digital collections:
1st pass: Selection, Identification, & Work Plan
Phase of analysis before digitization and description. Identify items in the system; perform analysis on structures to establish item-level records and item parts. Assign identifier, working title, sufficient identifying metadata to assist with digitization and to set up description. Create a work plan for the project that describes how the work will be done, identifies who will do it, and estimates the time required to do the work.
2nd pass A: Description
Metadata providers take over and describe materials following guidelines set out by the Metadata Advisory Group.
2nd pass B: Digitization
The Digital Production Center produces technical metadata to accompany digital objects.
3rd pass: Quality Control
Review for quality and accuracy.
The three element sets (collection-level, category-level, and item-level) of Duke Core are presented below. Within each element set, required elements are presented first and are then followed by non-required elements in alphabetical order.
*Required element
Collection-level metadata provides information about a collection’s existence and availability to interested users. It describes the subject, context, and nature of the materials reflected in the unit being digitized. This information can be used for the indexing, discovery, management, and identification of the digital resource.
Collection Title*
Description: Name or label given to the collection by the sponsor.
Guidelines:
Required: yes
Repeatable: no, but element Collection.Short.Title may be used as a shortened form of long titles
Crosswalks: MARC 245
EAD <archdesc><did><unittitle>
VRA Core <vra: title>
Examples:
Historic American Sheet Music
The Urban Landscape Digital Image Access Project
Medicine and Madison Avenue
Construction of Duke University, 1924-1932
Collection Abstract*
Definition: This element provides brief biographical or historical information about the creator of the collection and information about the nature of the materials so that users can judge the collection’s potential relevance.
Guidelines:
Crosswalks: MARC 5XX
EAD <abstract>
Required: Yes
Repeatable: No
Examples:
William Gedney Photographs and Writings
From the mid 1950s through the early 1980s, William Gedney (1932-1989) photographed throughout the United States, in India, and in Europe. From the commerce of the street outside his Brooklyn apartment to the daily chores of unemployed coal miners, from the indolent lifestyle of hippies in Haight-Ashbury to the sacred rituals of Hindu worshippers, Gedney was able to record the lives of others with remarkable clarity and poignancy. These photographs, along with his notebooks and writings, illuminate the rare vision of an intensely private man who, as a writer and photographer, was able to reveal the lives of others with striking sensitivity. Included here are selections from Gedney's finished prints, work prints, contact sheets, notes, notebooks, handmade photographic books, book dummies, and correspondence. The material ranges in date from the 1950s-1989.
James Cannon III worked as an educator and administrator at Trinity College and Duke University from 1919 to 1960, serving as Dean of the Divinity School from 1951 to 1958. The Canon papers contain correspondence, manuscript drafts of sermons and publications, course materials, subject files, clippings, photographs, ephemera, and scrapbooks relating to the life and work of James Cannon III. Major subjects include biblical literature and the Bible as literature, religious education, missionary training, history of religion, Christianity, Buddhism, comparative religion, and the Cannon family, especially James Cannon, Jr. and James Cannon IV. The material ranges in date from 1912-1974.
The University Task Force on Child Care was appointed by President Keith Brodie in 1988 and chaired by Dr. Paula Burger, Vice Provost for Academic Services. It was dissolved in 1990. The collection includes minutes, correspondence, reports, audiocassettes, and printed material documenting the creation and organization of the task force and its study of child care issues at Duke University. It ranges in date from 1985-1990.
Collection Capsule*
Description: A single-sentence condensation of the Collection Abstract, written in a standard format. The Collection Capsule may be displayed on web pages in conjunction with the Collection Short Title and Collection Icon to provide information about the content of a collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: N/A
Examples:
Images of over 7,000 advertisements printed in U.S. and Canadian newspapers and magazines between 1911 and 1955.
75 photos, letters, and scrapbooks related to the construction of the Duke campus, dated 1924-1932.
Collection Icon*
Description: An image used to brand objects as part of a collection. The Collection Icon stands in as visual shorthand for that collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: N/A
Examples:
See images available at:
http://library.duke.edu/digitalcollections/imgs/adaccessthumb.jpg
http://library.duke.edu/digitalcollections/imgs/hmp-thumb.jpg
Collection Rights*
Description: Information on copyright or restrictions-related conditions regarding use of the collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: MARC 540
EAD <userestrict>
Examples:
The materials in this collection are made available for use in research, teaching and private study. Texts and images from this collection may not be used for any commercial purpose without prior permission from Duke University.
All copyrights that exist in this material have not been transferred to Duke University. When use is made of these texts and images, it is the responsibility of the user to obtain additional permissions as necessary and to observe the stated access policy, the laws of copyright and the educational fair use guidelines.
The materials in this collection are made available for use in research, teaching and private study. Texts and images from this collection may not be used for any commercial purpose without prior permission from Duke University. Although these texts and images are made publicly accessible for the limited uses described above, they are not all in the public domain. Where copyright persists in this material, that right is owned either by Duke University or by the creators of the object or their descendents. When use is made of these texts and images, it is the responsibility of the user to secure any necessary permissions and to observe the stated access policy, the laws of copyright and the educational fair use guidelines.
Owner Name*
Description: The name of the organization or administrative body that provides oversight of the collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: MARC 710, 791
EAD <publisher>
Examples:
Duke University Archives
Public Documents and Maps
Owner Phone*
Description: The phone number of the organization or administrative body that provides oversight of the collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: N/A
Examples:
919-660-5822
Owner URL*
Description: The URL of the organization or administrative body that provides oversight of the collection.
Guidelines:
Required: yes
Repeatable: no
Crosswalks: N/A
Examples:
http://library.duke.edu/uarchives/
http://docs.lib.duke.edu/
Collection Short Title
Description: A shortened version of the collection title used for display purposes to prevent formatting problems.
Guidelines:
Required: no, use only when the collection title will cause complications in display
Repeatable: no
Crosswalks: MARC 246
EAD <unittitle>
VRA Core <vra: title>
Examples:
Collection Title: The Urban Landscape Digital Image Access Project
Collection Short Title: The Urban Landscape
*Required element
Category-level metadata can be used to provide structure and organization to a collection that facilitates browsing. Categories can make a large digital collection more manageable and navigable for end users. Additional description and access rights can be applied to categories of materials. Each item in a collection can belong to more than one category.
Category Title*
Description: Name or label given to the category. Usually the Category Title relates to common attributes of the digital objects, such as subject, format, provenance, or special copyright restrictions.
Guidelines:
Required: yes
Repeatable: no; each category should have only one Category Title
Examples:
Composers, 1962-1968
Scrapbooks
Kentucky, 1964 and 1971
Category Abstract
Definition: This element provides brief information about the category and the nature of the materials so users can judge the category’s potential relevance.
Guidelines:
Required: No
Repeatable: No
Examples:
Administrative Records, 1925-1930
Administrative records pertaining to Duke Construction. Series includes payroll records and invoices for dormitory items and lumber.
Television (1939 - 1957)
Advertisements featuring television sets, as well as a few other miscellaneous electronics, are included in this category. An interesting feature of many of these ads is their promotion of individual television programs.
Category Subject
Definition: What the content of the category is about or what it is, expressed by topical, personal, corporate, or geographic terms for significant people, places, organizations, events, and topics reflected.
Guidelines:
Required: no, unless category subjects need to be applied to item-level records
Repeatable: yes
Examples:
Business records
Duke University -- Aerial views
Beauty and Hygiene -- Shaving Supplies
Category Rights
Description: Information on copyright or restrictions-related conditions regarding use of the material in the category.
Guidelines:
Required: only when specific categories have special rights conditions
Repeatable: no, each category can only have one rights statement
Examples:
Most of the photographs in this category were taken by Dixon or other photographers working for the Durham Herald and are copyrighted by the Durham Herald-Sun newspaper. They are presented here for the purposes of scholarly research and study. Permission for further use and reproduction of these photographs must be obtained from the Herald-Sun.
Essay Link
Definition: The Essay Link element provides a connection to further information about the category that may be too lengthy for the Category Abstract element.
Guidelines:
Required: No
Repeatable: No
Examples:
Category Title: Beauty and Hygiene
Category Abstract: This category includes products (such as cosmetics, soaps, and shaving supplies) that were targeted for personal hygiene and enhancing one's attractiveness.
Essay Link: http://library.duke.edu/digitalcollections/adaccess/cosmetics.html
Parent Category
Definition: The larger category to which a subcategory belongs.
Guidelines:
Required: Yes, when applicable
Repeatable: No
Examples:
Transportation (1938 - 1957)
[Parent Category of Airlines: 1951]
Photographs
[Parent Category of Kentucky, 1964 and 1971]
*Required element
Title*
Description: Name or label given to the resource by the creator or publisher; may also be identifying phrase or name of the resource supplied by the cataloger. Title can be used for captions, advertising headlines, or other distinguishing information.
Guidelines:
Required: yes
Repeatable: no, but refinement Title.Alternative may be used to record other title information, such as a caption title, former title, spine title, artist's title, object name, translation of title, or other variations of the title. Apply the same guidelines for both Title and Title.Alternative.
Crosswalks: MARC 245
EAD <unittitle>
VRA Core <vra: title>
Examples:
North wall of columns of Temple of Bacchus
What is Riot Grrrl?
Tonight! No large pores need mar loveliness
Duke Men’s Basketball vs. South Carolina
Two women at Norton State Hospital, Kansas
Diary entries for March 27-April 2, 1925, from the diary of B. M. Hall, concerning the layout of Duke University
Type*
Definition: A term drawn from a controlled vocabulary that describes the genre or nature of the resource. At least one term should be drawn from a vocabulary supplied by the Library; additional terms may be drawn from vocabularies specific to the domain from which the materials are drawn.
Guidelines:
Required: yes
Repeatable: yes
Crosswalks: MARC 655 (MARC)
EAD <genreform>
VRA Core: <vra: worktype> in <vra: work> or <vra: collection>
Examples:
Broad terms:
Sheet music
Advertising
Photographs
Examples of more specific terms for photographs: Documentary photographs, Portraits.
Identifier*
Definition: URL required to access the resource in its digital format.
Guidelines:
Required: yes
Repeatable: no, provide only one “actionable identifier” or URL pointing to the digital object.
Crosswalks: MARC: 856
EAD: <dao>
Examples:
DC.Identifier= http://library.duke.edu/digitalcollections/gedney.KY0178/
Creator
Definition: Entity or entities primarily responsible for creating the intellectual content of the resource, including individuals, families, and corporate bodies. Examples include authors of written documents, artists, illustrators, photographers, collectors, organizations, etc.
Guidelines:
Creator role = “illustrator”
Required: Yes, if known/available
Repeatable: Yes
Crosswalks: MARC 1XX, 7XX
EAD <author> in <eadheader>
<name>, <origination> <persname>, <origination>, <famname> in <archdesc>
VRA Core <vra: agent> <display> in <vra: work> or <vra: collection>
Examples:
Few, William Preston, 1867-1940
Nicholas School of the Environment and Earth Sciences (Duke University)
Blackwell family
Hicks, Henrietta Virginia
Tomlinson Store (Randolph County, N.C.)
<creator role=“composer”>Berlin, Irving, 1888-</creator>
<creator role=“performer”>Carus, Emma</creator>
<creator role=“performer”>Marks, Clarence</creator>
<creator role=“illustrator”>Frew</creator>
Date
Definition: Creation or modification dates for the original resource; may include span dates for collections of material.
Guidelines:
Required: if applicable
Repeatable: no
Crosswalks: MARC 260$c, 245$f, 008/07-14
EAD: <unitdate>; <date>
Examples:
|
Date Type |
Display Date |
Normalized ISO 8601 Equivalent |
|
Single dates |
2003 July 4 |
2003-07-04 |
|
2003 July |
2003-07 |
|
|
2003 |
2003 |
|
|
circa 1995 |
1995 |
|
|
2002? |
2002 |
|
|
|
|
|
|
Range dates |
2003 July 4 - 2003 July 10 |
2003-07-04/2003-07-10 |
|
2003 July - 2003 August |
2003-07/2003-08 |
|
|
1912 September 26 - 1919 April 16 |
1912-09-26/1919-04-16 |
|
|
2003 - 2004 |
2003/2004 |
|
|
1970s |
1970/1979 |
|
|
13th century |
1201/1300 |
|
|
13th or 14th century? |
1201/1400 |
|
|
late 15th century |
1450/1500 |
|
|
mid-1950s |
1952/1958 |
|
|
|
1985 or 1990? |
1985/1990 |
Description
Definition: A textual description of the content of the resource, such as an abstract, table of contents, or a free-text account of the object. Use to provide information to supplement the Title, if necessary.
Guidelines:
Required: no
Repeatable: no
Crosswalks: MARC 520, 505
EAD <abstract>, <scopecontent>
VRA Core <vra: description> in <vra: work> or <vra: collection>
Examples:
Children with young couple on ferry; pier and water in background.
Photograph of an architectural rendering of the library looking toward the Chapel.
Includes list of rental properties for workers with addresses and weekly rent rates. In the letter, Powe offers Lee the rental of a large house near the Erwin Mills offices.
Extent
Description: Physical description of the original object that has been digitized, such as number of sheets/pages, dimensions of the physical object, etc.
Guidelines:
Required: no
Repeatable: noDouble-sided ; 10 cm x 8 cm
4 pages ; 11 cm x 6 cm
Identifier.Location
Definition: A call number or location code required to access the original resource.
Guidelines:
Required: if applicable, i.e., if digital resource is a surrogate of an accessible resource in Duke University Libraries.
Repeatable: yes, you may reflect multiple “holdings” or physical items from which a digital item may have been derived.
Crosswalks: MARC: 05X-09X
EAD: <container>, <physloc>
Examples:
Identifier.Callnumber.Box: Box 7
Identifier.Callnumber.Folder: Folder 23, Duke's Chapel
Identifier.Callnumber.Box-Folder: Box 22, Folder 147
Identifier.Callnumber.Call Number: AC75 .M87 2003
Identifier.CallnumberPam: E Pam 778
Illustrated
Description: This field indicates whether an item that is primarily textual also has one or more illustrations.
Guidelines:
no
Repeatable:
noLanguage
Description: The language(s) of the intellectual content of the resource. This is the language(s) in which a text is written or the spoken language(s) of audio or video resources. Still images do not usually have a language unless there is significant text in a caption or in the image itself.
Guidelines:
if applicable
Repeatable:
yes
Examples:eng
chi
glg
Rights
Description: Contains a rights management or usage statement, a URL that links to a rights management statement, or a URL that links to a service providing information on rights management for the resource. A rights management statement may contain information concerning accessibility, reproduction of images, copyright holder, restrictions, securing permissions for use of text or images, etc.
Guidelines:
Required: yes, if applicable when certain digital objects need item-specific rights statements. Otherwise, most digital objects will inherit rights statements from collection-level metadata.
Repeatable: yes
Crosswalks: MARC 506, 540, 561
EAD <accessrestrict>, <userestrict>
VRA Core
Examples:
This work may be used freely by individuals for research, teaching and personal use as long as this statement of availability is included in the text. See our statement on use and reproduction at http://scriptorium.lib.duke.edu/eaa/copyright.html for more information.
Source
Definition: Name of the collection or resource from which the digital object was derived.
Guidelines:
Required: if applicable
Repeatable: if applicable
Crosswalks: MARC 786, 534
Examples:
Horace Trumbauer Architectural Drawings
J. Walter Thompson Company Archives
Public Documents and Maps
ISBN: 0970022506
ISBN: 9781591583646
Subject
Definition: What the content of the resource is about or what it is, expressed by topical, personal, corporate, or geographic terms for significant people, places, organizations, events, and topics reflected.
Guidelines:
Crosswalks: MARC 6XX
EAD <subject>, <corpname>, <famname>, <genreform>, <geogname>, <persname>, <title>
VRA Core <vra: stylePeriod>, <vra :location type="creation">,
<vra: subject><term>
Required: no
Repeatable: yes
Examples:
Title: Certificate authorizing acquisition of new adult bicycles
Description: Three part certificate for distribution, inventory and acquisition of new adult bicycles. Part A is the approval form and inventory control. Part B authorizes acquisition of new stock. Part C is the receipt for the consumer. Parts are ink color coded.
Subject:
Rationing -- United States -- History -- 20th century xsi:type="LCSH"
Bicycles xsi:type="LCSH"
Title: Some illustrations of the importance of Duke University to the city of Durham
Description: Speech given on September 29, 1930, by Duke University President William Preston Few to the [Durham] Rotary Club, titled "Some illustrations of the importance of Duke University to the City of Durham." These illustrations include the medical facilities, the international diversity of the Graduate School of Arts and Sciences, and the College for Women.
Subject:
Duke University -- History xsi:type="LCSH"
Durham (N.C.) -- History xsi:type="LCSH"
A project involving documents and pictures from Civil War battles could use a list of the LC subject headings for the battles and campaigns, beginning:
Aiken, Battle of, Aiken, S.C., 1865
Allatoona Pass, Battle of, Ga., 1864
Antietam, Battle of, Md., 1862
Appomattox Campaign, 1865
Asheville, Battle of, Asheville, N.C., 1865
Atlanta Campaign, 1864
Ball's Bluff, Battle of, Va., 1861
Beaver Dam Creek, Battle of, Va., 1862
Belmont, Battle of, Belmont, Mo., 1861
Title.Alternative
Description: Additional name or label given to the resource by the creator or publisher. Title can be used for captions, advertising headlines, or other distinguishing information, such as caption title, former title, spine title, artist's title, object name, translation of title, or other variations of the title.
Guidelines:
Required: no
Repeatable: yes
Crosswalks: MARC 246
EAD <unittitle>
VRA Core <vra: title>
Examples:
Tientsin What You Doing
<title>Alexander's ragtime band</title>
<title.alternative role= “firstline”>Oh, ma honey, oh, ma honey</title>
<title.alternative role= “refrain”>Come on and hear, come on and hear, Alexander's ragtime band</title>