DEFINITION:
Data Publication Component ABB is an Application Component implementing the functionality of making data available for common use.
Source: DAMA
(http://www.dama.org/)
INTEROPERABILITY SALIENCY:
IoP Dimension: Structural IoP
The Data Pubblication Component ABB is salient for technical interoperability because it provides the implementation of the functionalities to make public data freely available for use and reuse by others unless restriction aoply as stated in the EIF recommendation n.2: "Publish the data you own as open data unless certain restrictions apply."
EXAMPLES:
The following implementation is an example on how this specific Architecture Building Block (ABB) can be instantiated as a Solution Building Block (SBB):
CKAN
CKAN is a data management system that makes data accessible by providing tools to streamline publishing, sharing, finding and using data.
This is a tool for making open data websites. It helps you manage and publish collections of data. It is used by national and local governments, research institutions, and other organizations who collect a lot of data.
Once your data is published, users can use its faceted search features to browse and find the data they need, and preview it using maps, graphs and tables – whether they are developers, journalists, researchers, NGOs, citizens, or even your own staff.
CKAN is open source and free software, with an active community of contributors who develop and maintain its core technology. CKAN is modified and extended by an even larger community of developers who contribute to a growing library of CKAN extensions.
Source:
(https://ckan.org/)
The following implementation is an example on how this specific Architecture Building Block (ABB) can be instantiated as a Solution Building Block (SBB):
Globus
Globus is a leading provider of research data management software application and platform services.
Globus publication capabilities are delivered through a hosted service. Published data is stored on campus, institutional, and group resources that are often managed and operated by different administrators. To associate storage resources with a data collection simply use Globus shared endpoints and associate them with the data repository to publish. Published datasets are organized by "communities" and their member "collections". Globus users can create and manage their own communities and collections through the data publication service. A collection enables the submission of datasets with policies regarding access.
A dataset comprises data and metadata. Policies can be set on communities or collections to manage:
- Metadata (schema, requirements)
- Access control (user and group based)
- Curation workflow
- Submission and distribution licenses
- Storage
Datasets undergo curation based on a workflow defined by the community that will publish the data. Workflows may be customized by each community to capture their specific metadata and to reflect the community's review process. After the dataset is published, it is discoverable using a faceted search that allows the researcher to progressively filter results and rapidly focus in on the data of interest. The data may then be transferred to a Globus endpoint where the investigator can inspect and further process the data.
Data publication is a premium feature available with a Globus Subscription
Source: (https://www.globus.org/data-publication)
|
|
ID | ABB209 |
dct:type | eira:DataPublicationComponent |
dct:publisher | |
dct:modified | |
eira:status | [ Exists | Development planned ] |
eira:data_quality_level | [ Excellent (90-100%) | Very good (75-89,9%) | Fair (50-74,9%) | Poor (0-49,9%) ] |
eira:data_quality_score | |
eira:reusability_level | [ Excellent (90-100%) | Very good (75-89,9%) | Fair (50-74,9%) | Poor (0-49,9%) ] |
eira:reusability_score | |
eira:iop_level | [ Excellent (90-100%) | Very good (75-89,9%) | Fair (50-74,9%) | Poor (0-49,9%) ] |
eira:iop_score | |
eira:actual_reuse | [ Already reused | Reuse planned | No] |
eira:view | Technical view - Infrastructure |