Asset Description Metadata Schema Specification

ADMS - Asset Description Metadata Schema

Draft Specification v0.6a for Community Consultation, 2011-03-17

Authors:
Adam Arndt (National IT and Telecom Agency, Denmark)
Renke Fahl-Spiewack (]init[ AG, Berlin, Germany)
Nikos Loutas (DERI - Digital Enterprise Research Institute, NUI Galway, Ireland)
Vassilios Peristeras (Interoperability Solutions for European Public Administrations (ISA), Directorate-General for Informatics, European Commission)
Sebastian Sklarß (]init[ AG, Berlin, Germany)
Sofia Zapounidou (University of Macedonia, Greece)

Reviewers:
Christoph Böhm (Hasso-Plattner Institut, Universität Potsdam, Germany)
Jane Coxon (Cabinet Office, OGCIO & SIRO, United Kingdom)
Makx Dekkers (DCMI - Dublin Core Metadata Initiative, Spain)
Josef El-Rayes (]init[ AG, Berlin, Germany)
Dieter Fensel (University of Innsbruck, Austria)
Aldo Gangemi (Head of the Semantic Technology Lab, Institute of Cognitive Sciences and Technologies, ISTTC-CNR, Italy)
Michael Hausenblas (DERI - Digital Enterprise Research Institute, NUI Galway, Ireland)
Tommi Karttaavi (Association of Finnish Local and Regional Authorities, Finland)
Johannes Keizer (FAO/UN, Rome, Italy)
Horst Krämer (]init[ AG, Brussels, Belgium)
Lars Lämmerhirt (BIT 7 / Bundesverwaltungsamt, Köln, Germany)
Christian Lange (BIT 7 / Bundesverwaltungsamt, Köln, Germany)
Thomas Lee (University of Hong Kong, Hong Kong)
Stephan Meyer (]init[ AG, Berlin, Germany)
Andras Micsik (CORES Registry, Hungary)
Marc Musen (Stanford University, United States)
Felix Naumann (Hasso-Plattner Institut, Universität Potsdam, Germany)
H. Shah Nigam (Stanford University, United States)
Richard Parent (Ministère des services gouvernementaux, Canada)
Priit Parmakson (Estonian Informatics Centre, Estonia)
Klaus Reichling (]init[ AG, Berlin, Germany)
Peter Reichstädter (Bundeskanzleramt, Austria, Austria)
Thomas Rössler (W3C, Luxembourg)
Felix Sasaki (DFKI, W3C, Germany)
Konstantinos Tarabanis (University of Macedonia, Greece)
Mike Thacker (ESD Limited Standards, United Kingdom)
Uuno Vallner (Ministry of Economic Affairs and Communications, Estonia)
Emile van de Maas (Stelselcatalogus, The Netherlands)
Andrew Waters (CLMS - Code List Management Service, United Kingdom)
Ralph Westenberg (Stelselcatalogus, Logius, The Netherlands)
Interoperability Framework Coordination Group (Interoperability Framework Coordination Group, Hong Kong)

back to top


Abstract

The Asset Description Metadata Schema (ADMS) tackles the issue of a federation of repositories for semantic assets to make decentralised resources available through a single point of access. ADMS can be used as a common language to describe semantic artefacts stored in separate systems. It can also provide some directions for new initiatives to create similar repositories. In this way, ADMS will facilitate the federated use of semantic resources, especially of those used for and by public administrations.


Table of Contents

1 Introduction
2 Scope and Definitions
    2.1 Scope and intended usage
    2.2 ADMS Architecture
    2.3 Definitions
3 ADMS - Roadmap
4 ADMS - Design principles
    4.1 ADMS development: Keep it as simple as possible
    4.2 Guidelines of ADMS development
    4.3 ADMS - Use cases
5 ADMS - Asset Description Metadata Schema
    5.1 ADMS Overview
    5.2 ADMS Core Elements
    5.3 ADMS Extension Elements
6 Acknowledgements


1 Introduction

Data models, taxonomies, ontologies, code lists and semantic data exchange formats are a key resource for creating interoperable data exchange. At the time being several national repositories of those "semantic assets" do exist. These repositories differ in scope and in target group they are addressing. They are implemented using different technologies and expose different user interfaces to the end user.

However the semantic content they include can often be reused even bypassing the domain they were originally designed for - once these solutions can be found and be identified as relevant. To enable interoperability of different repositories this document specifies the Asset Description Metadata Schema (ADMS).

The Asset Description Metadata Schema (ADMS) is being developed to enable the federation of repositories while maintaining unique functionalities and strengths of these repositories.

The core of the ADMS contains metadata to enable a federated search of assets.

The focus is in particular on assets that can enable data exchange in cross border or cross domain eGovernment applications. The metadata schema has to be restrictive enough to avoid major semantic conflicts and to allow for the aim of federating distributed resources. The same, the concept of such a schema for describing assets has to be flexible and extensible enough to allow various repositories to use and to participate in the federation of repositories.

ADMS is not designed to be a data model standard for repositories. The main reason to set up ADMS is to establish widespread accepted asset descriptions and agreement on a common metadata concept allowing for the combination of existing metadata schemas and to enable a federated search.

back to top


2 Scope and Definitions

2.3 Definitions

Asset
A container dedicated to group artefact types that ease up data exchange by declaring semantics. Asset may contain a large variety of these solutions such as data models, code lists, taxonomies, dictionaries, thesauruses, ontologies
Asset Provider
The natural person being the maintainer of an asset in the federated repository
Asset Owner
not yet commonly agreed upon in community
Federated Search
not yet commonly agreed upon in community
Vocabulary
Following definitions declared in "Library terminology informally explained" vocabularies for ADMS’s purpose can be divided into
(a) Metadata Element Set that provides elements to be used by others to describe elements (like dc:creator, foaf:organisation) while
(b) Value Vocabulary may represent a "closed list" of allowed values for an element (refered to as Codelist in many cases). In contrast to values from value vocabularies, a metadata ele-ment may also contain literals or identifiers (including URIs) as values.
Repository
not yet commonly agreed upon in community
Repository Federation
not yet commonly agreed upon in community
Registry
not yet commonly agreed upon in community
Search
not yet commonly agreed upon in community

back to top


back to top


4 ADMS - Design principles

These design principles had been sent out for comments and have been revised accordingly.

4.1 ADMS development: Keep it as simple as possible

The following list describes main criteria to be considered by ADMS development.

  1. ADMS has to be clearly defined to access the federation with low barriers of entry.
  2. ADMS has to be flexible enough to cover various types of repositories and future requirements.
  3. ADMS has to enclose metadata to describe different types of artefacts (assets).
  4. ADMS has to contain a small number of elements and attributes to enable a mapping for almost any existing repository.
  5. ADMS has to define sufficient elements and attributes to enable mappings to other available repositories’ description schemas.
  6. ADMS development has to be based on existing metadata standards and vocabularies like DC, DCT, FOAF, GEONAMES, DOAP, voiD, DCAT etc.
  7. ADMS is a semantic concept with different ways of technical implementations. Technical implementations of the semantic concept may comprise RDF-Schema, XML-Schema and a UML class diagram representation.
  8. ADMS has to be designed in a way that changes (if necessary) will not affect current state of integration and the functionality of the federated search. Costs of maintenance and change have to be minimized.
  9. Future modifications to ADMS take place guided and documented by an open and transparent change management process.
  10. In a common spirit of willingness to share results and deliverables of this work, the Asset Development Metadata Schema will be published free to use by anyone under the European Union Public Licence version 1.1.

4.3 ADMS - Use cases

At the moment we see the use case of "federated search". Please feel free to provide further use cases.

ADMS derived from the business need of being able to do a "federated search" in existing repositories.

The scenario can be described as follows:

Working on a new eGovernment project, a user is in the need of a specific code list on - let’s say - a list of delicts for the European Arrest Warrant project.

Without ADMS:
The user will perform a search engine query and/or visit the Danish, Greek and European Repository scouting for "Internal Security", "Justice", "Codelist", "authority table", or "name value pairs".

While browsing the different repositories, the user will be faced with a variety of

  1. different user interface designs,
  2. different classification schemas for domains
  3. different user logins, passwords and usage rights
  4. different metadata

With ADMS:
The user enters one of the federated ADMS-enabled repositories. He will use the commonly agreed taxonomy to navigate to the "Justice" domain. He then defines the artefact he is looking for - e.g. "codelist".

After a few seconds, the repository of his choice provides a result list, based on findings due to a local and a federated search in participating repositories.

The user will be able to do with one initial effort several queries into a number of repositories

  1. using the same domain taxonomies
  2. getting back a standardised set of results

ADMS hereby defines

  1. what to expose to federation partners as a predefined subset of metadata (what is the commonly agreed core?)
  2. how to expose the metadata (which vocabularies to choose).

Metadata that will form the ADMS minimum subset, the so called ADMS Core will only contain elements (classes and attributes) that are needed for most frequent search cases.

back to top


5 ADMS - Asset Description Metadata Schema

5.1 ADMS Overview

Figure 4 visualises the initial choice of vocabularies that are elaborated more in detail in the tables in the following sections.

Visualisation of ADMS Asset and Release
Figure 4: Visualisation of ADMS Asset and Release

Taking into account the draft status and the feedback provided so far by the ADMS community, only a few elements lack of coverage by already existing vocabularies. Currently we think that the ADMS community will be in charge of developing the following vocabularies:

  • adms:ArtefactType
    The type of artefact included in an assets' release
    Examples might be "UML Specification", "Ontology", "Taxonomy", "Codelist", etc.
  • adms:Status
    The current status of an asset or release
    Examples might be "under development", "published", or "withdrawn"
  • adms:Domain
    Domain(s) which are covered by the asset
    Examples might be "Justice", "Internal Security", "Economy->Trade", "Social Affairs->Education", etc.
  • adms:QualityLevel
    The quality level achieved by the asset
    Examples might be "draft", "registered", "final", etc.

For some elements an appropriate vocabulary could not yet be found, needs your initial suggestions or cannot be represented by existing vocabularies:

  • RepositoryOrigin
    The name of the repository this asset is stored in
  • ReleaseNotes
    Information regarding a specific release (changes to previous versions, etc.)
  • FileFormat
    The file extension / file format files of a release are containing e.g. XML, PDF, OWL, etc.
  • AssetImage
    An image to represent the asset (something like an icon or logo)
  • Tags
    comma-separated keywords to make the asset traceable

The following sections are used to define the elements of ADMS and its assignments towards existing or to-be-built vocabularies.

The following defines on a conceptual level

  1. Metadata of repositories to be exposed for federation in general,
  2. their definition (semantic statement) and
  3. Its suggested assignment to ADMS Core and ADMS Extension.
  4. Multiplicity of elements (optional, mandatory, multiple occurrences allowed?)

The element order within the ADMS specification follows the logic:

  1. mandatory Core elements
  2. optional Core elements
  3. Extension elements

Additional information is given for each element

  1. Whether it will be designed as an attribute (A) or a class (C)
  2. What vocabulary the metadata element can be assigned to
    1. Name of the vocabulary
    2. Definition of the term as defined in its vocabulary
    3. URI (if applicable)
  3. What values for this metadata element may be allowed (value vocabulary)
    1. Name of the value vocabulary
    2. Definition of the term as defined in its vocabulary
    3. Data type: URI (if applicable) / Literal / Code list

LOD vocabularies that will be reused in ADMS:

DC:spatial information on an asset's provenance or coverage might be coded using GEONAMES Ids.

5.2 ADMS Core Elements

No. 1.3
Metadata name Asset:Description
Metadata: Semantic Statement An English description of the asset for example comprising its content, use cases, structure, etc.
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set description
Definition An account of the resource. Description may include but is not limited to: an abstract, a table of contents, a graphical representation, or a free-text account of the resource.
URI http://purl.org/dc/terms/description

back to top


No. 1.6
Metadata name Asset:Domain
Metadata: Semantic Statement Domains which are covered by the asset (e.g. health, justice, economy, etc.)
(Core / Extension) Core
Multiplicity 1..*
Data Element
Attribute (A) or Class (C) C
Prefix dc:
Title of Data Element Set subject
Definition The topic of the resource.
URI http://purl.org/dc/terms/subject

back to top


No. 1.8
Metadata name Asset:License
Metadata: Semantic Statement The name of a license specifying the terms of use for an asset
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) C
Prefix dc:
Title of Data Element Set license
Definition A legal document giving official permission to do something with the resource.
URI http://purl.org/dc/terms/license

back to top


No. 1.2
Metadata name Asset:Name
Metadata: Semantic Statement The name/title of the asset / a headline
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set title
Definition A name given to the resource.
URI http://purl.org/dc/terms/title

back to top


No. 1.5
Metadata name Asset:Owner
Metadata: Semantic Statement The intellectual property right owner for this asset (person or organisation)
(Core / Extension) Core
Multiplicity 1..*
Data Element
Attribute (A) or Class (C) C
Prefix foaf:
Title of Data Element Set agent
Definition An agent (eg. person, group, software or physical artifact).
URI http://xmlns.com/foaf/spec/#term_Agent

back to top


No. 1.9
Metadata name Asset:RepositoryOrigin
Metadata: Semantic Statement The name of the repository this asset is stored in
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix ?
Title of Data Element Set ?
Definition ?
URI -

back to top


No. 1.4
Metadata name Asset:RepresentedCountry
Metadata: Semantic Statement Countries involved in the development and/or countries in the scope of the asset
(Core / Extension) Core
Multiplicity 1..*
Data Element
Attribute (A) or Class (C) C
Prefix dc:
Title of Data Element Set spatial
Definition Spatial characteristics of the resource.
URI http://purl.org/dc/terms/spatial

back to top


No. 1.7
Metadata name Asset:Status
Metadata: Semantic Statement Any information about the asset's publication state
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix adms:
Title of Data Element Set status
Definition The current state of an asset or release
URI http://www.semic.eu/ns/adms1/status

back to top


No. 1.1
Metadata name Asset:URI
Metadata: Semantic Statement A URI referring to an Asset.
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set identifier
Definition An unambiguous reference to the resource within a given context
URI http://purl.org/dc/terms/identifier

back to top


No. 2.1
Metadata name Release:AccessURL
Metadata: Semantic Statement an URL where the current release of an asset can be accessed.
(Core / Extension) Core
Multiplicity 1
Data Element
Attribute (A) or Class (C) A
Prefix dcat:
Title of Data Element Set accessURL
Definition describes availability of a dataset. This can be a direct download link, an HTML page containing a link to the actual data, Feed, Web Service etc. the semantic is determined by its domain (Distribution, Feed, Web-Service, Download).
URI http://www.w3.org/ns/dcat#accessURL

back to top


No. 2.2
Metadata name Release:DocumentationLanguage
Metadata: Semantic Statement The language(s) in which the asset is documented
(Core / Extension) Core
Multiplicity 1..*
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set language
Definition A language of the resource.
URI http://purl.org/dc/terms/language

back to top


No. 2.3
Metadata name Release:Type
Metadata: Semantic Statement e.g. code list, ontology, taxonomy, etc.
(Core / Extension) Core
Multiplicity 1..*
Data Element
Attribute (A) or Class (C) A
Prefix adms:
Title of Data Element Set artefacttype
Definition The type of artefact included in an assets' release
URI http://www.semic.eu/ns/adms1/artefacttype

back to top


No. 1.10
Metadata name Asset:HasRelease
Metadata: Semantic Statement A pointer to an adms:release object. There might exist none, one or multiple releases for an asset
(Core / Extension) Core (Optional)
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set relation
Definition A related resource.
URI http://purl.org/dc/terms/relation

back to top


No. 1.18
Metadata name Asset:Tags
Metadata: Semantic Statement One or more keywords to make the asset traceable
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix ?
Title of Data Element Set ?
Definition ?
URI -

back to top


No. 2.5
Metadata name Release:ReleaseDate
Metadata: Semantic Statement The date the release of an asset has been published
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set dateSubmitted
Definition Date of submission of the resource.
URI http://purl.org/dc/terms/dateSubmitted

back to top


No. 2.8
Metadata name Release:ReleaseName
Metadata: Semantic Statement a repository-independent release reference provided by the owner of an asset (eg. FR_Person V1.2)
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set title
Definition A name given to the resource.
URI http://purl.org/dc/terms/title

back to top


No. 2.4
Metadata name Release:ReleaseNotes
Metadata: Semantic Statement Information regarding a specific release (changes to previos versions, etc)
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) ?
Prefix ?
Title of Data Element Set ?
Definition ?
URI -

back to top


No. 2.7
Metadata name Release:Status
Metadata: Semantic Statement Publication status of a release (published, withdrawn, etc)
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix adms:
Title of Data Element Set status
Definition The current state of an asset or release
URI http://www.semic.eu/ns/adms1/status

back to top


No. 2.6
Metadata name Release:UpdateDate
Metadata: Semantic Statement The date of last change of a release
(Core / Extension) Core (Optional)
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set modified
Definition Date on which the resource was changed.
URI http://purl.org/dc/terms/modified

back to top


5.3 ADMS Extension Elements

No. 1.12
Metadata name Asset:Abstract
Metadata: Semantic Statement A short summary of the asset's content. Might be in native language (beside English description)
(Core / Extension) Extension
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set abstract
Definition A summary of the resource.
URI http://purl.org/dc/terms/abstract

back to top


No. 1.14
Metadata name Asset:Image
Metadata: Semantic Statement An image to represent the asset (something like an icon or logo)
(Core / Extension) Extension
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) ?
Prefix ?
Title of Data Element Set ?
Definition ?
URI -

back to top


No. 1.11
Metadata name Asset:LicenseURI
Metadata: Semantic Statement A URI refering to the text of a underlying version of a license
(Core / Extension) Extension
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix ?
Title of Data Element Set ?
Definition ?
URI -

back to top


No. 1.17
Metadata name Asset:LinkToWebsite
Metadata: Semantic Statement A link to another (maybe original) web resource of the asset
(Core / Extension) Extension
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix foaf:
Title of Data Element Set homepage
Definition A 'homepage' in this sense is a public Web document, typically but not necessarily available in HTML format. The page has as a topic the thing whose homepage it is. The homepage is usually controlled, edited or published by the thing whose homepage it is; as such one might look to a homepage for information on its owner from its owner. This works for people, companies, organisations etc.
URI http://xmlns.com/foaf/spec/#term_homepage

back to top


No. 1.16
Metadata name Asset:Publisher
Metadata: Semantic Statement The person or organisation in charge of managing an asset's representation in the repository
(Core / Extension) Extension
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) C
Prefix dc:
Title of Data Element Set publisher
Definition An entity responsible for making the resource available.
URI http://purl.org/dc/terms/description

back to top


No. 1.13
Metadata name Asset:QualityLevel
Metadata: Semantic Statement Any indicator for the quality for an asset (e.g. current level in a clearing process)
(Core / Extension) Extension
Multiplicity 0..1
Data Element
Attribute (A) or Class (C) A
Prefix adms:
Title of Data Element Set quality level
Definition
URI http://www.semic.eu/ns/adms1/qualitylevel

back to top


No. 1.15
Metadata name Asset:RelatedAsset
Metadata: Semantic Statement Any connection to related assets
(Core / Extension) Extension
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set relation
Definition A related resource.
URI http://purl.org/dc/terms/relation

back to top


No. 2.9
Metadata name Release:FileFormat
Metadata: Semantic Statement The file extension / file format files of a release are containing e.g. XML, PDF, OWL, etc.
(Core / Extension) Extension
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set format
Definition The file format, physical medium, or dimensions of the resource.
URI http://purl.org/dc/terms/format

back to top


No. 2.10
Metadata name Release:RelatedRelease
Metadata: Semantic Statement A release related to this release (previous, following)
(Core / Extension) Extension
Multiplicity 0..*
Data Element
Attribute (A) or Class (C) A
Prefix dc:
Title of Data Element Set relation
Definition A related resource.
URI http://purl.org/dc/terms/relation

back to top


6 Acknowledgements

Thanks to all members of the SEMIC.EU ADMS Community.

Special thanks to all those who reviewed the first draft of this document.

back to top