| |
|
|
| |
Glossary of Terms for Metadata, Taxonomies
and Digital Libraries
If you have comments or
contributions, I would be pleased to hear them: wason@mindspring.com.
A glossary is a set of defined terms. Each term
has a single definition. The definition is relative to the
domain, in this case metadata, taxonomies and digital libraries.
Some terms may have multiple definitions according to a standard
dictionary. A glossary will use one of these definitions or will create
a separate definition.
- Aggregate
- To collect or gather together.
- Assessment
- "Any systematic method of obtaining evidence
from tests, examinations, questionnaires, surveys and collateral
sources used to draw inferences about characteristics of people,
objects, or programs for a specific purpose." from the Testing and
Assessment Glossary of Terms of QuestionMark: http://www.questionmark.com/us/glossary.htm.
(Contributed by Eric Shepherd).
- Authority List
- A controlled list of terms, names, phrases or
similar entries relative to a specific domain or scope. An authority
list may or may not contain defintions or other information about each
item. A glossary
or dictionary
contain defintions. Typically an authority list provides the standard
spelling and form of terms. Authority lists have defined managers,
e.g., the British Library's Name Authority List (http://www.bl.uk/services/bibliographic/authority.html).
Authority Lists may have structure, such as the hierarchy of the
Library of Congress' Subject Headings: http://www.loc.gov/cds/lcsh.html
- Category
- A selection, subset or subdivision of a classification.
See also Dr.
Tom’s Classification Guide
- Catalog
- "A database which contains descriptions of
information resources and their locations," Pipher,
Hayes & Davis (1998).
- Classification
- "1 : the act or process of classifying
2 a : systematic arrangement in groups or categories
according to established criteria; specifically : TAXONOMY
b : CLASS, CATEGORY "* Classification is both the systematic
arrangement of labels and the application of those labels. Dr. Tom’s
Classification Guide http://diicoe.disa.mil/coe/index.html
- COE
- Common Operating Environment: http://diicoe.disa.mil/coe/index.html.
See also Robin Cover's U.S. Federal CIO Council XML Working Group
Issues XML Developer's Guide: http://xml.coverpages.org/ni2002-01-16-a.html.
- Concept
- 2 : an abstract or generic idea generalized
from particular instances * See Note
on Subjects and Concepts.
- Collection
- A collection is an aggregation
of resources.
It may—or may not—all be packaged in one resource or content
package. It has no inherent navigational structure. It may have an
index and/or table of contents. Items
may be accessed on an individual basis. An example is a collection of
images. A collection may be housed in a library
or repository.
Most collections have a theme.
- Community
- A group with one or more common interests. The European SchoolNet has many learning communities supported by tools.
- Content
- A specific kind of resource
that is packaged into a usable state. A document is content.
- Content Package
- An assemblage of content items
and possible support files or headers. Support files may include a manifest
and/or sequencing files or sections.
-
- Data Set
- An organized collection
of data.
- Data Base
- A software structure that contains one or more data
sets. A data base that holds one data set may make the two seem to
be one and the same. A data base is an application or tool, it is not
its contents.
Question: do we consider the technical metadata to refer to the data
set or the data base?
- Dictionary
- An organized collection
of definitions and uses of terms. A term may have more than one
definition and more than one use. Contrast with glossary,
thesaurus
and taxonomy.
- Digital Library
- "[N]etworked information in the research and
education communities."
"[Digital] Libraries... [are] ...technological and social developments
that are fueled by information technology, bioinformatics, and
networked information." http://www.cni.org/,
http://www.diglib.org/dlfhomepage.htm.
DLESE is an example of a
digital library within the NSF's
NSDL.
- Directory
- "[An] alphabetical or classified list (as of
names and addresses)".* A directory is simple and flat, usually
providing a set of name - location pairs.
- DTD
- Document Type Definition
for describing an XML
structure: http://xml.coverpages.org/XMLSpecDTD.html.
XML-Schemas
allow richer descriptions at the price of complexity.
- Dublin Core (DC)
- The Dublin Core Metadata: http://dublincore.org/. A core set
of 15 elements or fields that are used to describe resources. It is a
widely used basic system of metadata
good for wide area searching. Often expressed as HTML metatags.
Sometimes "qualifiers" are added to refine the definitions of each
field. When in doubt, DC is a good starting place for metadata, as it
is widely used. DC is implemented with a variety of technical formats.
DC is compatible with Z39.50,
as DC defines descriptive labels and Z39.50 describes technical formats
and transmission protocols.
- Element
- A component in a metadata
structure. Each element has a token.
- Extension
- To go beyond the original form. For instance
for XML see Dr.
Tom’s Guide to IMS XML Extensions and Incorporations
-
- Field
- A hole in a metadata
structure into which appropriate values are placed. This is not to be
confused with an element.
The simplest metadata field is a metatag.
- Glossary
- A list of terms with single definitions
with respect to a particular domain. "[A] collection
of ... specialized terms with their meanings".* Contrast with dictionary,
thesaurus
and authority
list.
- Granularity
- Level of graininess or size.
- The IEEE
LTSC
- Institute
of Electrical and Electronic Engineers' Learning
Technology Standards Committee
: http://ltsc.ieee.org/
- IMS Global Learning
Consortium
- "IMS Global Learning Consortium, Inc. (IMS) [is
a non-profit corporation that] is developing and promoting open specifications
for facilitating online distributed learning activities such as
locating and using educational content, tracking learner progress,
reporting learner performance, and exchanging student records between
administrative systems. " These are busy people. http://www.imsglobal.org/
- ISO 11179
- Standard for describing data elements used in
databases and documents: http://www.diffuse.org/meta.html#ISO11179.
Sponsoring body: ISO/IEC JTC1/SC32. As an example implementation see
the Australian Institute of Health and Welfare Data Standards: http://wason.home.mindspring.com/TDW/Australian%20Institute%20of%20Health%20and%20Welfare.
- Item
- A unit or resource of arbitrary size.
-
Knowledge
Management (KM)
- An organizational process for converting
information into knowledge and making that knowledge accessible. It is
more a state of mind and a committment than a specific set of tools.
Tools are used, however. See Brint: http://www.brint.com/km/.
KM and online learning are merging in some enterprises.
- Learning Object
- The term "learning object" has a variety of
definitions. I prefer a broad definition: a package of one or more
resources that have educational utility. I am not going to try to
specify "educational utility"; that is in the eye of the beholder, so
to speak. Sometimes the definition states that the object must be
specific for learning. Sometimes there is a requirement for an
assessment component. The variety of definitions makes use of the term
"learning object" unreliable without reference to a specific defintion.
- LOM
- The Learning Object Metadata
standard of the IEEE
LTSC. The IEEE LTSC working group is #12: http://ltsc.ieee.org/wg12/index.html.
- Manifest
- A packing slip for a content
package. A manifest in a content package may contain or
point to metadata, define the resources
contained in the content package and define an intended organization of
the resources.
- Memetics
- The study of evolution in information
systems, c.f. Journal
of Memetics - Evolutionary Models of Information Transmission, http://pespmc1.vub.ac.be/MEMES.html,
http://aleph.se/Trans/Cultural/Memetics/or
An
Evolutionary Model of a Data System (Wason)
- Metadata
(bibliographic)
- Information that catalogs
or describes a resource. This may include metadata defining the
resource subject, format, location, ownership, authorship and so forth.
Some may validly consider metadata a resource. See also Dr. Tom’s Metadata
Guide, Wason, T., and Wiley, D. (2001). "Structured
Metadata Spaces".
- Metatag
- A named field.
Typically metatags are not structured, but are used as a simple
name-value pair datum. Metatags a normally associated with HTML as
<meta> elements in the header. See for example the Meta Tag Tutorial at WebDeveloper.com. Some
web search
engines use the metatags, some do not.
- MIME
- Multipurpose Internet Mail Extensions.
Generally accepted designations of file format types. MIME is an
example of a recommended standard
that has not yet been approved but is widely
used. This is an example of how the world works at "Web speed.". If you
wait for the formal standard before proceeding, you may be left at the
back of the pack. On the other hand, if you select a recommendation
that is either not approved or undergoes significant change before
approval, you may have a lot of expensive revision to do. Enterprises
participate in standards groups to try to discern the future.
- NSDL
- The National Science Foundation's National
SMET Digital Library
Initiative: NSF
NSDL.
- OAI
- Open Archives Initiative:
http://www.openarchives.org/
The OAI has developed a metadata harvesting protocol. This is used in
the NSDL
and elsewhere.
- OKI
- Open Knowledge Initiative: http://web.mit.edu/oki/
- Ontology
- 1 : a branch of metaphysics concerned with the
nature and relations of being * See Note
on Subjects and Concepts.
- Perceptual
Coupling
- Matching the nature or structure of information
to the precognitive processing of the human perceptual system to promote the rapid
recognition or meaning of information. Perceptual coupling has a
variety of applications generally involving situation management.
- Qualifier
- Refines or limits the definition of a metatag.
DC
provides qualifiers.
-
- RDF
- Resource Description Framework
of the W3C: http://www.w3.org/RDF/.
Uses XML
for a self-describing data structure system.
- Registry
- Information that defines what is stored in one
or more repositories.
It may contain specialized vocabularies and taxonomies. A "registered"
repository may have to adhere to certain rules in order to be
registered. See for example: http://diides.ncr.disa.mil/xmlreg/user/index.cfm.
- Repository
- A storage site for resources,
including metadata.
A repository may house a collection.
Contrast with a registry
or a collection.
- Resource
- Any asset that can be bounded in some
manner. It may be a data stream, but its parameters are well defined. Content,
data
sets and data streams are resources. A tool may be considered a
resource. A source of supply or support.* Stuff.
- SCORM
- Sharable Content Object Reference
Model http://www.adlnet.org/
- Semantics
- The study of meanings: a : the historical and
psychological study and the classification of changes in the
signification of words or forms viewed as factors in linguistic
development.*
- Semantic Web
- The gleam in Tim Berners-Lee's eye
for a unified Web without metadata: http://www.w3.org/2001/sw/
- SMET
- An acronym for Science, Mathematics,
Engineering and Technology,
sometimes with "
Education" included at the end.
- Specification
- "a detailed precise presentation of something
or of a plan or proposal for something -- usually used in plural".* A
specification is created by an enterprise, as opposed to a standard,
which is maintained by an official body. For example the IMS
Global Learning Consortium and Intel
develop specifications. Examples are the IMS Meta-Data Specification
and the Intel
Audio Codec Specifications.
- Standard
- "[S]omething set up and established by authority
as a rule for the measure of quantity, weight, extent, value, or
quality ".* IEEE
and ISO
develop standards. An example is the IEEE LTSC
Learning Object Metadata standard (under ballot). Recommended, but
not yet approved, standards such as MIME
are widely used. Not yet approved standards may be designated RFC (Request for Comments).
- Subject
- 3 a : a department of knowledge or learning *
See Note
on Subjects and Concepts.
-
- Tag
- See Metatag.
- Tautology
- A self referential definition or system of
logic. See tautology.
- Taxonomy
- A structured vocabulary that embodies
relationships among terms. Each term is contained in a taxon.
The most common relationship is a hierarchy (tree structure). Other
forms may include cross linking and poly-hierarchical structure. A vocabulary
is a single level taxonomy. See also Dr. Tom’s
Taxonomy Guide. Some major taxonomies are: MARC,
LCSH, MeSH. For a long list of many taxonomies in many
domains see: Controlled vocabularies, thesauri and classification
systems available in the WWW. DC Subject (http://www.lub.lu.se/metadata/subject-help.html)
Originally referred to biological classifications: "[O]rderly classification
of plants and animals according to their presumed natural
relationships".* Compare with thesaurus.
- Taxon
- A node in a taxonomy.
A taxon may contain a term, and reference label and links to other
taxons.
- Thesaurus
- A collection
of terms, their definitions (that may be multiple) and relationships to
other terms. Contrast with taxonomy,
dictionary
and glossary.
" [A] list of subject headings or descriptors usually with a
cross-reference system for use in the organization of a collection of
documents for reference and retrieval".*
- Token
- An element's
name or set of characters that are the logical equivalent of a
name.
- Vocabulary
- A list of terms, sometimes with definitions. A
single level taxonomy.
A vocabulary can be considered a very localized glossary. Or not.
-
- XML
- eXtensible Markup Language
of the World Wide Web Consortium: http://www.w3.org/TR/REC-xml.
A method for serializing structured data for use on the Internet. This
is not yet an official standard, but is in a "recommended" state. See
also http://www.xml.org/.
- XML-Schema
- A control system for XML: http://www.w3.org/XML/Schema.
This is not yet an official standard,
but is in a "recommended" state. See Dr. Tom’s Guide to
XML-Schema , IMS XML bindings, DTDs and Examples.http://www.imsglobal.org/xsd/.
- XSL
- The Extensible Stylesheet Language
for creating different views of XML documents: http://www.w3.org/Style/XSL/.
- XSLT
- The Extensible Stylesheet Language
for Translating XML
documents to different formats: http://www.w3.org/Style/XSL/.
This is part of the XSL
specification.
- Z39.50
- " A technical data format and
transmission protocol specification. "Z39.50" refers to the
International Standard, ISO 23950: ’Information Retrieval (Z39.50):
Application Service Definition and Protocol Specification‘, and to
ANSI/NISO Z39.50. The Library of Congress is the Maintenance Agency and
Registration Authority for both standards, which are technically
identical (though with minor editorial differences).
"The standard
specifies a client/server-based protocol for searching and retrieving
information from remote databases.
" http://www.loc.gov/z3950/agency/
This glossary has been prepared by
Thomas D. Wason. Some definitions are taken from the Merriam-Webster
Online Dictionary at http://www.m-w.com/dictionary.htm. If
multiple definitions are available, the most appropriate one is
selected. Entries from the Merriam-Webster Online Dictionary are
designated with an asterisk (*).
|
|
|
| |
|
|
|