W3C DCAT (and DCAT-AP) defines a property to indicate the thematic scope of datasets. This is, dcat:theme (sub-class of dcterms:subject). The DCAT-AP recommends the use of the 13-term Data Theme Taxonomy defined by the EU Publications Office: http://publications.europa.eu/resource/authority/data-theme
|AGRI||Agriculture, fisheries, forestry and food|
|ECON||Economy and finance|
|EDUC||Education, culture and sport|
|GOVE||Government and public sector|
|JUST||Justice, legal system and public safety|
|REGI||Regions and cities|
|SOCI||Population and society|
|TECH||Science and technology|
A dataset may include none or several dcat:theme properties, so a dataset composed of different data topics may be described using more than one dcat:theme properties. For instance, a registry of persons and vehicles would be classified in the categories TRAN and SOCI.
No discussion on the use of dcat:theme and the 13-term taxonomy for datasets, but this high-level classification seems not being enough to classify master data.
Proposals/options to discuss:
- Use only this 13-term taxonomy
- Create a controlled fine-grained scheme to classify those topics (aligned with Eurovoc)
- Use Eurovoc terms directly to classify the topics.
- Use dcterms:type to classify Datasets directly using other schemes.
- Create a new sub-property of dcterms:subject/dcat:theme to classify the fine-grained elements.
This issue included also the discussion of how to describe the elements of dasets. So, another issue was created to deal with this topic. See also [Issue #05] Content of Datasets (DataElement).
Comments are welcomed!