Information technology — Metadata registries (MDR) — Part 7: Metamodel for data set registration

This document provides a specification for an extension to a Metadata Registry (MDR), as specified in ISO/IEC 11179-3:2013, Clauses 5 to 11 in which metadata which describes data sets, collections of data available for access or download in one or more formats, can be registered. Since a set can contain a single element, this document enables the recording of metadata about a single data value. The registered metadata provides information about the data set that includes the provenance and the quality of the dataset.

Technologies de l'information — Registres de métadonnées (RM) — Partie 7: Titre manque

General Information

Status
Withdrawn
Publication Date
18-Dec-2019
Current Stage
9599 - Withdrawal of International Standard
Completion Date
16-Jan-2023
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 11179-7:2019 - Information technology -- Metadata registries (MDR)
English language
39 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 11179-7
First edition
2019-12
Information technology — Metadata
registries (MDR) —
Part 7:
Metamodel for data set registration
Reference number
ISO/IEC 11179-7:2019(E)
©
ISO/IEC 2019

---------------------- Page: 1 ----------------------
ISO/IEC 11179-7:2019(E)

COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2019
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting
on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address
below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Fax: +41 22 749 09 47
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii © ISO/IEC 2019 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 11179-7:2019(E)

Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms, definitions and abbreviated terms . 1
3.1 Terms and definitions . 1
3.2 Abbreviated terms . 3
4 Conformance . 3
4.1 Overview of conformance . . 3
4.2 Conformance by clause . 3
4.3 Standard profiles . 3
4.3.1 General. 3
4.3.2 Data Set Registry profile . . 4
4.3.3 Extended Data Set Registry profile . 4
5 Data Set package . 4
5.1 Data Set metamodel region . 4
5.1.1 Overview of the Data Set metamodel region . 4
5.1.2 Classes in the Data Set metamodel region . 5
5.1.3 Association Classes in the Data Set metamodel region .21
5.1.4 Associations in the Data Set metamodel region .22
6 Relationship to other parts of the ISO/IEC 11179 series .26
6.1 Relationship to ISO/IEC 11179-3 .26
6.1.1 General.26
6.1.2 Use of the common facilities types specified in ISO/IEC 11179-3 .26
6.2 Relationship to ISO/IEC 11179-6 .27
Annex A (informative) Alphabetical list of terms and designations .28
Annex B (informative) Examples of data set registration .31
Annex C (informative) Additional standard conformance profiles .37
Annex D (informative) A complete view of the concept of “provenance” .38
Bibliography .39
© ISO/IEC 2019 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 11179-7:2019(E)

Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that
are members of ISO or IEC participate in the development of International Standards through
technical committees established by the respective organization to deal with particular fields of
technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other
international organizations, governmental and non-governmental, in liaison with ISO and IEC, also
take part in the work.
The procedures used to develop this document and those intended for its further maintenance are
described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for
the different types of document should be noted. This document was drafted in accordance with the
editorial rules of the ISO/IEC Directives, Part 2 (see www .iso .org/ directives).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www .iso .org/ patents) or the IEC
list of patent declarations received (see http:// patents .iec .ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to the
World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www .iso .org/
iso/ foreword .html.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 32, Data management and interchange.
A list of all parts in the ISO/IEC 11179 series can be found on the ISO website.
Any feedback or questions on this document should be directed to the user’s national standards body. A
complete listing of these bodies can be found at www .iso .org/ members .html.
iv © ISO/IEC 2019 – All rights reserved

---------------------- Page: 4 ----------------------
ISO/IEC 11179-7:2019(E)

Introduction
Many data sets are now generally available. These may be part of a government-led open data initiative,
or may be data sets that are used within and across organizations for commercial, scientific or academic
purposes.
There is a requirement for metadata about these data sets to be readily available to enable the consistent
and appropriate use of data and information, and to prevent duplication of work. Having an enhanced
Metadata Registry where metadata that describes data sets is registered will facilitate the discovery of
appropriate data sets.
In ISO/IEC 11179-3:2013, Clauses 5 to 11 the structure of a Metadata Registry is specified in the form
of a conceptual data model. The Metadata Registry is used to keep information about data elements
and associated concepts, such as “data element concepts”, “conceptual domains” and “value domains”.
Generically, these are all referred to as “metadata items”. Such metadata are necessary to clearly
describe, record, analyse, classify and administer data. This document provides a specification of the
extensions to the registry metamodel specified in ISO/IEC 11179-3:2013, Clauses 5 to 11 to enable the
registration of metadata about datasets.
When considering data and metadata, it is important to distinguish between types of data/metadata,
and instances of these types. ISO/IEC 11179-3:2013, Clauses 5 to 11 specifies the types of metadata
objects that form the structure of a basic Metadata Registry. Clause 5 specifies the types of metadata
objects that form an extension to that structure so that the whole structure provides facilities to keep
information about data sets. A Metadata Registry will be populated with instances of these metadata
objects (metadata items), which in turn define, for example, types of data in an application database,
or, in the case of this document, data sets that are held elsewhere. Such data sets could, for example, be
made available over the internet or be included as a table within a word processing document.
The facilities described in this document, together with those described in ISO/IEC 11179-3:2013,
Clauses 5 to 11 provides the ability to record the following data set metadata:
— one or more unique identifiers for the data set;
— the designation or title of the data set;
— a definition or description of the data set that provides sufficient detail to enable a user to quickly
understand whether this data set is of interest;
— the date the data set was issued and, if appropriate, the date that subsequent versions of the data set
were, or will be, issued;
— the access level and rights associated with the data set;
— the provenance of the data set, i.e., information about the place and time of the origin of the data set,
its ownership and the method of the generation of the set;
— a set of keywords or tags that help to explain the data set;
— the language or languages used to describe the data set;
— the temporal and spatial coverages of the data set;
— the accrual periodicity of the data set, i.e., the frequency at which new, revised or updated versions
of the data set are made available;
— the details of the distributions of the data set, including the identifier, the title, a description, the
media type or file format, the size, the issue date, languages, access level and rights and access and
download URLs;
— annotations drawn from a concept system, such as an ontology, to describe the theme or category of
the data set or the collection of data sets;
© ISO/IEC 2019 – All rights reserved v

---------------------- Page: 5 ----------------------
ISO/IEC 11179-7:2019(E)

— the details of any contexts, such as a programme, project or business area that use the data set;
— the details of any quality assessments made in respect of the data set;
— any additional descriptions of the data set, including:
— any data elements that are already registered that are included in the data set;
— any information models that describe the structure of the information in the data set;
— any documents which describe aspects of the data set, such as technical information about the
data set and/or developer documentation such as a graphical representation of the data model
of the data set;
— the details of any superset/subset hierarchies containing the data set;
— the details of any replacement data set if this data set is superseded;
— the details of any collection of data sets of which this data set is a part, including the identifiers,
the designation or title, a definition or description, issue dates, languages, access level, rights, the
spatial coverage, the provenance and any quality assessments of the collection.
This document was prepared taking into account concepts described in the following documents:
[1]
— Data Catalog Vocabulary (DCAT) (published by the World Wide Web Consortium (W3C));
[2]
— The PROV Ontology (Prov-O) (published by the World Wide Web Consortium (W3C));
[3]
— The PROV Data Model (Prov-DM) (published by the World Wide Web Consortium (W3C));
[4]
— Project Open Data Metadata Schema v1.1 (published by the US Government).
Supplementary material is provided in Annexes as follows:
— Annex A provides an alphabetical list of the terms used in this document;
— Annex B provides two examples of the registration of data sets using the facilities specified in this
document;
— Annex C summarizes conformance profiles specified in this document that are additional to those
specified in ISO/IEC 11179-3:2013, Clause 4;
— Annex D provides a complete description of how the concept of provenance can be captured using
the facilities specified in this document.
vi © ISO/IEC 2019 – All rights reserved

---------------------- Page: 6 ----------------------
INTERNATIONAL STANDARD ISO/IEC 11179-7:2019(E)
Information technology — Metadata registries (MDR) —
Part 7:
Metamodel for data set registration
1 Scope
This document provides a specification for an extension to a Metadata Registry (MDR), as specified in
ISO/IEC 11179-3:2013, Clauses 5 to 11 in which metadata which describes data sets, collections of data
available for access or download in one or more formats, can be registered. Since a set can contain a
single element, this document enables the recording of metadata about a single data value.
The registered metadata provides information about the data set that includes the provenance and the
quality of the dataset.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content
constitutes requirements of this document. For dated references, only the edition cited applies. For
undated references, the latest edition of the referenced document (including any amendments) applies.
ISO/IEC 11179-3:2013, Information technology — Metadata registries (MDR) — Part 3: Registry
metamodel and basic attributes
ISO/IEC 11179-6, Information technology — Metadata registries (MDR) — Part 6: Registration
3 Terms, definitions and abbreviated terms
3.1 Terms and definitions
For the purposes of this document, the terms and definitions given in ISO/IEC 11179-3 and the
following apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at http:// www .electropedia .org/
3.1.1
access level
level of authority required from an entity to access a protected resource
Note 1 to entry: In the context of this document, items to which an access level may be specified are limited to a
data set, a data set distribution and a data set collection.
Note 2 to entry: For the public, the level of authority might describe the degree of public availability of a dataset.
EXAMPLE Public, restricted public and non-public.
© ISO/IEC 2019 – All rights reserved 1

---------------------- Page: 7 ----------------------
ISO/IEC 11179-7:2019(E)

3.1.2
accrual periodicity
frequency with which items are added to a collection
Note 1 to entry: In the context of this document, it is assumed that an updated version of a data set is issued or
made available whenever new items are added to that data set.
[SOURCE: ISO 2146:2010, 7.2.6, modified – Note 1 to entry added and EXAMPLE deleted]
3.1.3
data element collection
collection of data elements that may be unordered or ordered
Note 1 to entry: Examples of unordered collections are a set or a bag (or multiset). An example of an ordered
collection is a list.
3.1.4
data set
dataset
identifiable collection of data available for access or download in one or more formats
Note 1 to entry: A data set can be a smaller grouping of data which, though limited by some constraint such as
spatial extent or feature type, is located physically within a larger data set. Theoretically, a data set can be as
small as a single feature or feature attribute contained within a larger data set.
Note 2 to entry: A data set may be presented in a tabular form and stored and distributed in tables in word
processed documents, spread sheets or databases. It could also be presented in any one of a number of alternative
formats, including AVRO, JSON, RDF and XML.
3.1.5
data set collection
curated collection of one or more data sets (3.1.4)
3.1.6
data set distribution
specific available form of a data set (3.1.4) or data set collection (3.1.5)
Note 1 to entry: Each data set might be available in different forms and each of these forms represents a different
format of the data set or a different endpoint.
Note 2 to entry: Examples of distributions include a downloadable CSV file, an API or an RSS feed. This represents
a general availability of a data set. It implies no information about the actual access method of the data, i.e.
whether it is a direct download, API, or through accessing a Web page.
3.1.7
data set specification
additional formal information to describe a data set
Note 1 to entry: A data set specification may consist of zero, one or more ISO/IEC 11179-3 Reference_Documents,
zero, one or more ISO/IEC 19763-12 Information_Models and/or zero, one or more sets of ISO/IEC 11179-3 Data_
Elements.
3.1.8
information model
graphical and textual representation of entities and the relationships between them
Note 1 to entry: An information model can exist as, at the conceptual or logical level, an entity relationship model
or an object class diagram, and, at the physical level, a database schema definition.
[SOURCE: ISO/IEC 19763-12:2015, 4.2.24 modified – Note 1 to entry amended]
2 © ISO/IEC 2019 – All rights reserved

---------------------- Page: 8 ----------------------
ISO/IEC 11179-7:2019(E)

3.1.9
ontology
specification of concrete or abstract things, and the relationships among them, in a prescribed domain
of knowledge
Note 1 to entry: The specification should be computer processable, such as an ontology registered as metadata
using the ISO/IEC 11179-3 Concept Region specification.
[SOURCE: ISO/IEC 19763-3:2010, 3.1.1.1, modified – Note 1 to entry amended]
3.1.10
provenance
information on the place and time of origin, derivation or generation of a resource or a record or proof
of authenticity or of past ownership
3.1.11
rights
information regarding access or restrictions based on privacy, security or other policies
Note 1 to entry: This information may explain why a “non-public” or “restricted public” data set, data set
distribution or data set collection is not “public”.
Note 2 to entry: This definition is derived from Project Open Data Metadata Schema v1.1 (https:// project -open
-data .cio .gov/ v1 .1/ schema/ #dataQuality)
3.1.12
spatial coverage
geographical area which is the subject of a data set (3.1.4) or data set collection (3.1.5)
3.1.13
temporal coverage
period for which a data set (3.1.4) is applicable
3.2 Abbreviated terms
URL uniform resource locator
4 Conformance
4.1 Overview of conformance
Conformance rules for a Metadata Registry are specified in ISO/IEC 11179-3:2013, Clause 4. The
following subclauses extend these rules.
4.2 Conformance by clause
Conformance claims may also be limited to Clause 5. This clause is also dependent upon one or more
of ISO/IEC 11179-3:2013, Clauses 5 to 11, so conformance to this clause shall be understood to imply
conformance also to relevant provisions specified in one or more of ISO/IEC 11179-3:2013, Clauses 5 to 11.
Conformance may, therefore, be claimed for a set of data structures and/or datatypes for Clause 5.
4.3 Standard profiles
4.3.1 General
This document specifies the following standard profiles in addition to those specified in
ISO/IEC 11179-3:2013, 4.4.2.
© ISO/IEC 2019 – All rights reserved 3

---------------------- Page: 9 ----------------------
ISO/IEC 11179-7:2019(E)

4.3.2 Data Set Registry profile
This profile implements ISO/IEC 11179-3:2013, Clauses 7, 8 and 9 and Clause 5.
4.3.3 Extended Data Set Registry profile
This profile implements ISO/IEC 11179-3:2013, Clause 11, and also satisfies all provisions of the Data
Set Registry profile.
5 Data Set package
5.1 Data Set metamodel region
5.1.1 Overview of the Data Set metamodel region
Figure 1 shows the metamodel for the registration of metadata about data sets.
Instances of each of the classes shown in Figure 1 shall be extended by one or more of an identified
item (with its subtypes of registered item, administered item or attached item), a designatable item or a
classifiable item, as specified in ISO/IEC 11179-3:2013, 5.5. See subclause 6.1.2 for details of the possible
types for instances of each of the classes shown in Figure 1.
4 © ISO/IEC 2019 – All rights reserved

---------------------- Page: 10 ----------------------
ISO/IEC 11179-7:2019(E)

Figure 1 — Data Set metamodel region
5.1.2 Classes in the Data Set metamodel region
5.1.2.1 Concept class
The Concept class is described in ISO/IEC 11179-3:2013, 9.1.2.1.
In this document, the instances of the Concept class are used to model constructs within a concept
system, that may or may not be an ontology (3.1.9), that are used to annotate a data set or a data set
collection so as to describe the theme or category of the data set or data set collection.
© ISO/IEC 2019 – All rights reserved 5

---------------------- Page: 11 ----------------------
ISO/IEC 11179-7:2019(E)

A Concept may participate in the following additional associations specified in this document:
— data_set_collection_concept_annotation (5.1.4.3) with zero, one or more concept_annotated_data_
set_collection Data_Set_Collections (5.1.2.4) in which this Concept is a concept_annotation.
— data_set_concept_annotation (5.1.4.8) with zero, one or more concept_annotated_data_set Data_Sets
(5.1.2.5) in which this Concept is a concept_annotation.
The Concept class has no additional attributes specified in this document.
5.1.2.2 Context class
The Context class is described in ISO/IEC 11179-3:2013, 7.3.2.5.
In this document, the instances of the Context class are used to model a particular programme, project
or business area that uses a data set (or collection of data sets), or a project or business area that is used
as the context for a quality assessment of a data set (or collection of data sets),
A Context may participate in the following additional associations specified in this document:
— data_set_context (5.1.4.9) with zero, one or more used_data_set Data_Sets (5.1.2.5) in which this
Context is a data_set_usage.
— data_set_collection_context (5.1.4.4) with zero, one or more used_data_set_collection Data_Set_
Collections (5.1.2.6) in which this Context is a data_set_collection_usage.
— quality_assessment_context (5.1.4.20) with zero, one or more contextualised_quality_assessment
Data_Set_Quality_Assessments (5.1.2.9) in which this Context is a quality_assessment_context.
The Context class has no additional attributes specified in this document.
5.1.2.3 Data_Element class
The Data_Element class is described in ISO/IEC 11179-3:2013, 11.1.2.4.
In this document, the instances of the Data_Element class are used to model any registered data element
that is referenced in a data set specification.
A Data_Element may participate in the following additional associations specified in this document:
— unordered_data_element_collection_data_element (5.1.4.21) with zero, one or more containing_data_
element_collection Unordered_Data_Element_Collections (5.1.2.13) in which this Data_Element is a
contained_data_element.
— ordered_data_element_collection_data_element (5.1.4.19) with zero, one or more containing_data_
element_collection Ordered_Data_Element_Collections (5.1.2.12) in which this Data_Element is a
contained_data_element.
The Data_Element class has no additional attributes specified in this document.
5.1.2.4 Data_Element_Collection class
Data_Element_Collection is a class each instance of which models a particular data element collection
(3.1.3), which is either an unordered collection of data elements or an ordered collection of data
elements.
A data element collection brings together the data elements that are referenced in the specification of a
particular data set.
A Data_Element_Collection shall be either an Unordered_Data_Element_Collection (5.1.2.13) or an
Orderered_Data_Element_Collection (5.1.2.12) but not both.
6 © ISO/IEC 2019 – All rights reserved

---------------------- Page: 12 ----------------------
ISO/IEC 11179-7:2019(E)

A Data_Element_Collection may participate in the following associations:
— data_element_collection_hierarchy (5.1.4.1) with:
— zero or one superset_data_element_collection Data_Element_Collections in which this Data_
Element_Collection is a subset_data_element_collection.
— zero, one or more subset_data_element_collection Data_Element_Collections in which this Data_
Element_Collection is the superset_data_element_collection.
— data_set_specification_data_element_collection (5.1.4.14) with zero, one or more data_element_
collection_data_set_specification Data_Set_Specifications (5.1.2.10) in which this Data_Element_
Collecton is a partial_data_set_specification.
The Data_Element_Collection class has no attributes.
5.1.2.5 Data_Set class
5.1.2.5.1 Description of Data_Set
Data_Set is a class each instance of which models a particular data set (3.1.4), which is a collection of
data available for access or download in one or more formats.
A Data_Set may participate in the following associations:
— data_set_context (5.1.4.9) with zero, one or more data_set_usage Contexts (5.1.2.2) in which this
Data_Set is a used_data_set.
— data_set_data_set_distribution (5.1.4.11) with zero, one or more data_set_distribution Data_Set_
Distributions (5.1.2.7) in which this Data_Set is the distributed_data_set.
— data_set_data_set_collection (5.1.4.10) with zero, one or more containing_collection Data_Set_
Collections (5.1.2.6) in which this Data_Set is an included_data_set.
— data_set_data_set_specification (5.1.4.12) with zero, one or more data_set_specification Data_Set_
Specifications (5.1.2.10) in which this Data_Set is a specified_data_set.
— data_set_hierarchy (5.1.4.13) with:
— zero, one or more subset_data_set Data_Sets in which this Data_Set is the superset_data_set.
— zero or one superset_data_set Data_Set in which this Data_Set is a subset_data_set.
— data_set_concept_annotation (5.1.4.8) with zero, one or more concept_annotation Concepts (5.1.2.1)
in which this Data_Set is a concept_annotated_data_set.
— data_set_provenance (5.1.4.14) with zero, one or more publication_record Data_Set_Provenances
(5.1.2.8) in which this Data_Set is the published_data_set.
— data_set_quality_assessment (5.1.4.15) with zero, one or more data_set_quality_assessment Data_Set_
Quality_Assessments (5.1.2.9) in which this Data_Set is the quality_assessed_data_set.
— data_set_supersession (5.1.4.16) with:
— zero or one replacement_data_set Data_Set in which this Data_Set is the superseded_data_set.
— zero or one superseded_data_set Data_Set in which this Data_Set is the replacement_data_set.
The attributes of the Data_Set class are summarized here and specified more formally in 5.1.2.5.2:
— A Data_Set class may have zero or one access_level (5.1.2.5.2.1) of type String, which is the degree to
which this data set could have been made publicly-available, regardless of whether it has been made
available. If there is an instance of Data_Set_Collection associated with this instance of Data_Set and
© ISO/IEC 2019 – All rights reserved 7

---------------------- Page: 13 ----------------------
ISO/IEC 11179-7:2019(E)

the access_level of the associated instance of Data_Set_Collection differs from the access_level of this
instance of Data_Set then it is assumed that the access_level of this instance of Data_Set overrides
the access_level of
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.