SIST EN ISO/IEC 23053:2023
(Main)Framework for Artificial Intelligence (AI) Systems Using Machine Learning (ML) (ISO/IEC 23053:2022)
Framework for Artificial Intelligence (AI) Systems Using Machine Learning (ML) (ISO/IEC 23053:2022)
This document establishes an Artificial Intelligence (AI) and Machine Learning (ML) framework for describing a generic AI system using ML technology. The framework describes the system components and their functions in the AI ecosystem. This document is applicable to all types and sizes of organizations, including public and private companies, government entities, and not-for-profit organizations, that are implementing or using AI systems.
Framework für Systeme der Künstlichen Intelligenz (KI) basierend auf maschinellem Lernen (ML) (ISO/IEC 23053:2022)
Cadre méthodologique pour les systèmes d'intelligence artificielle (IA) utilisant l'apprentissage machine (ISO/IEC 23053:2022)
Le présent document établit un cadre en matière d'intelligence artificielle (IA) et d'apprentissage machine (ML) pour la description d'un système d'IA générique utilisant la technologie du ML. Le cadre décrit les composants du système et leurs fonctions dans l'écosystème de l'IA. Le présent document s'applique aux organismes de tous types et de toutes tailles, y compris les entreprises publiques et privées, les entités gouvernementales et les organisations à but non lucratif, qui mettent en œuvre ou utilisent des systèmes d'IA.
Okvir za sisteme umetne inteligence (UI), ki temeljijo na strojnem učenju (ISO/IEC 23053:2022)
Navezuje se na mere nastavkov in tolerance za pokrovčke elektrod, adapterje elektrod, držala elektrod in podobne dele, pri katerih sila elektrode Fmax, navedena za premer d1 v preglednicah 1, 2 in 3, ni presežena. Določa mere, opisovanje in označevanje. Razveljavlja in nadomešča priporočilo ISO R 1089-1969 ter predstavlja tehnično popravljeno izdajo.
General Information
Relations
Standards Content (Sample)
SLOVENSKI STANDARD
01-november-2023
Okvir za sisteme umetne inteligence (UI), ki temeljijo na strojnem učenju (ISO/IEC
23053:2022)
Framework for Artificial Intelligence (AI) Systems Using Machine Learning (ML) (ISO/IEC
23053:2022)
Framework für Systeme der Künstlichen Intelligenz (KI) basierend auf maschinellem
Lernen (ML) (ISO/IEC 23053:2022)
Cadre méthodologique pour les systèmes d'intelligence artificielle (IA) utilisant
l'apprentissage machine (ISO/IEC 23053:2022)
Ta slovenski standard je istoveten z: EN ISO/IEC 23053:2023
ICS:
35.020 Informacijska tehnika in Information technology (IT) in
tehnologija na splošno general
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
EUROPEAN STANDARD EN ISO/IEC 23053
NORME EUROPÉENNE
EUROPÄISCHE NORM
June 2023
ICS 35.020
English version
Framework for Artificial Intelligence (AI) Systems Using
Machine Learning (ML) (ISO/IEC 23053:2022)
Cadre méthodologique pour les systèmes d'intelligence Framework für Systeme der Künstlichen Intelligenz
artificielle (IA) utilisant l'apprentissage machine (KI) basierend auf maschinellem Lernen (ML) (ISO/IEC
(ISO/IEC 23053:2022) 23053:2022)
This European Standard was approved by CEN on 26 June 2023.
CEN and CENELEC members are bound to comply with the CEN/CENELEC Internal Regulations which stipulate the conditions for
giving this European Standard the status of a national standard without any alteration. Up-to-date lists and bibliographical
references concerning such national standards may be obtained on application to the CEN-CENELEC Management Centre or to
any CEN and CENELEC member.
This European Standard exists in three official versions (English, French, German). A version in any other language made by
translation under the responsibility of a CEN and CENELEC member into its own language and notified to the CEN-CENELEC
Management Centre has the same status as the official versions.
CEN and CENELEC members are the national standards bodies and national electrotechnical committees of Austria, Belgium,
Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy,
Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Republic of North Macedonia, Romania, Serbia,
Slovakia, Slovenia, Spain, Sweden, Switzerland, Türkiye and United Kingdom.
CEN-CENELEC Management Centre:
Rue de la Science 23, B-1040 Brussels
© 2023 CEN/CENELEC All rights of exploitation in any form and by any means
Ref. No. EN ISO/IEC 23053:2023 E
reserved worldwide for CEN national Members and for
CENELEC Members.
Contents Page
European foreword . 3
European foreword
The text of ISO/IEC 23053:2022 has been prepared by Technical Committee ISO/IEC JTC 1 "Information
technology” of the International Organization for Standardization (ISO) and has been taken over as
secretariat of which is held by DS.
This European Standard shall be given the status of a national standard, either by publication of an
identical text or by endorsement, at the latest by December 2023, and conflicting national standards
shall be withdrawn at the latest by December 2023.
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. CEN-CENELEC shall not be held responsible for identifying any or all such patent rights.
Any feedback and questions on this document should be directed to the users’ national standards body.
A complete listing of these bodies can be found on the CEN and CENELEC websites.
According to the CEN-CENELEC Internal Regulations, the national standards organizations of the
following countries are bound to implement this European Standard: Austria, Belgium, Bulgaria,
Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland,
Ireland, Italy, Latvia, Lithuania, Luxembourg, Malta, Netherlands, Norway, Poland, Portugal, Republic of
North Macedonia, Romania, Serbia, Slovakia, Slovenia, Spain, Sweden, Switzerland, Türkiye and the
United Kingdom.
Endorsement notice
The text of ISO/IEC 23053:2022 has been approved by CEN-CENELEC as EN ISO/IEC 23053:2023
without any modification.
INTERNATIONAL ISO/IEC
STANDARD 23053
First edition
2022-06
Framework for Artificial Intelligence
(AI) Systems Using Machine Learning
(ML)
Cadre méthodologique pour les systèmes d’intelligence artificielle (IA)
utilisant l’apprentissage machine
Reference number
ISO/IEC 23053:2022(E)
© ISO/IEC 2022
ISO/IEC 23053:2022(E)
© ISO/IEC 2022
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
3.1 Model development and use . 1
3.2 Tools . 2
3.3 Data . 2
4 Abbreviated terms . 3
5 Overview . 4
6 Machine learning system .4
6.1 Overview . 4
6.2 Task . 5
6.2.1 General . 5
6.2.2 Regression . 6
6.2.3 Classification . . 6
6.2.4 Clustering . . 6
6.2.5 Anomaly detection . . 6
6.2.6 Dimensionality reduction . 7
6.2.7 Other tasks . 7
6.3 Model . 7
6.4 Data . 8
6.5 Tools . 9
6.5.1 General . 9
6.5.2 Data preparation . 9
6.5.3 Categories of ML algorithms . 10
6.5.4 ML optimisation methods . 14
6.5.5 ML evaluation metrics . 16
7 Machine learning approaches .19
7.1 General . 19
7.2 Supervised machine learning . 20
7.3 Unsupervised machine learning . 22
7.4 Semi-supervised machine learning. 23
7.5 Self-supervised machine learning . 23
7.6 Reinforcement machine learning . 23
7.7 Transfer learning . 24
8 Machine learning pipeline .25
8.1 General . 25
8.2 Data acquisition .26
8.3 Data preparation . 27
8.4 Modelling . 28
8.5 Verification and validation .30
8.6 Model deployment .30
8.7 Operation . 30
8.8 Example machine learning process based on ML pipeline . 31
Annex A (informative) Example data flow and data use statements for supervised learning
process .34
Bibliography .36
iii
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international
organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the
work.
The procedures used to develop this document and those intended for its further maintenance
are described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria
needed for the different types of document should be noted. This document was drafted in
accordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives or
www.iec.ch/members_experts/refdocs).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents) or the IEC
list of patent declarations received (see patents.iec.ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see
www.iso.org/iso/foreword.html. In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 42, Artificial Intelligence.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
iv
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
Introduction
Artificial intelligence (AI) systems, in general, are engineered systems that generate outputs such as
content, forecasts, recommendations or decisions for a given set of human-defined objectives. AI covers
a wide range of technologies that reflect different approaches to dealing with these complex problems.
ML is a branch of AI that employs computational techniques to enable systems to learn from data or
experiences. In other words, ML systems are developed through the optimisation of algorithms to fit to
training data, or improve their performance based through maximizing a reward. ML methods include
deep learning, which is also addressed in this document.
Terms such as knowledge, learning and decisions are used throughout the document. However, it is not
the intent to anthropomorphize machine learning (ML).
This document aims to provide a framework for the description of AI systems that use ML. By
establishing a common terminology and a common set of concepts for such systems, this document
provides a basis for the clear explanation of the systems and various considerations that apply to their
engineering and to their use. This document is intended for a wide audience including experts and non-
practitioners. However, some of the clauses (identified in the overview in Clause 5), include more in-
depth technical descriptions.
This document also provides the basis for other standards directed at specific aspects of ML systems
and their components.
v
© ISO/IEC 2022 – All rights reserved
INTERNATIONAL STANDARD ISO/IEC 23053:2022(E)
Framework for Artificial Intelligence (AI) Systems Using
Machine Learning (ML)
1 Scope
This document establishes an Artificial Intelligence (AI) and Machine Learning (ML) framework
for describing a generic AI system using ML technology. The framework describes the system
components and their functions in the AI ecosystem. This document is applicable to all types and
sizes of organizations, including public and private companies, government entities, and not-for-profit
organizations, that are implementing or using AI systems.
2 Normative references
The following documents are referred to in the text in such a way that some or all of their content
constitutes requirements of this document. For dated references, only the edition cited applies. For
undated references, the latest edition of the referenced document (including any amendments) applies.
ISO/IEC 22989, Information technology—Artificial intelligence — Artificial intelligence concepts and
terminology
3 Terms and definitions
For the purposes of this document, the terms and definitions given in ISO/IEC 22989 and the following
apply.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1 Model development and use
3.1.1
classification model
machine learning model whose expected output for a given input is one or more
classes
3.1.2
regression model
machine learning model whose expected output for a given input is a continuous
variable
3.1.3
generalization
ability of a trained model to make correct predictions on previously unseen input
data
Note 1 to entry: A machine learning model that generalizes well is one that has acceptable prediction accuracies
using previously unseen input data.
Note 2 to entry: Generalization is closely related to overfitting. An overfit machine learning model will not
generalize well as the model fits the training data too precisely.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
3.1.4
overfitting
creating a model which fits the training data too precisely and fails to generalize
on new data
Note 1 to entry: Overfitting can occur because the trained model has learned from non-essential features in the
training data (i.e. features that do not generalize to useful outputs), excessive noise in the training data (e.g.
excessive number of outliers) or because the model is too complex for the training data.
Note 2 to entry: Overfitting can be identified when there is a significant difference between errors measured
on training data and on separate test and validation data. The performance of overfitted models is especially
impacted when there is a significant mismatch between training data and production data.
3.1.5
underfitting
creating a model that does not fit the training data closely enough and produces
incorrect predictions on new data
Note 1 to entry: Underfitting can occur when features are poorly selected, insufficient training time or when the
model is too simple to learn from large training data due to limited model capacity (i.e. expressive power).
3.2 Tools
3.2.1
backpropagation
neural network training method that uses the error at the output layer to adjust and optimise the
weights for the connections from the successive previous layers
3.2.2
learning rate
step size for a gradient method
Note 1 to entry: Learning rate determines whether and how fast a model converges to an optimal solution,
making it an important hyperparameter to set for neural networks.
3.3 Data
3.3.1
class
human-defined category of elements that are part of the dataset and that share common attributes
EXAMPLE "telephone", "table", "chair", "ball bearing" and "tennis ball" are classes. The "table" class includes:
a work table, a dining table, a study desk, a coffee table, a workbench.
Note 1 to entry: Classes are typically target variables and designated by a name.
3.3.2
cluster
automatically induced category of elements that are part of the dataset and that share common
attributes
Note 1 to entry: Clusters do not necessarily have a name.
3.3.3
feature
measurable property of an object or event with respect to a set of characteristics
Note 1 to entry: Features play a role in training and prediction.
Note 2 to entry: Features provide a machine-readable way to describe the relevant objects. As the algorithm
will not go back to the objects or events themselves, feature representations are designed to contain all useful
information.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
3.3.4
distance
measured proximity of two points in space
Note 1 to entry: Euclidean, or straight-line, distance is ordinarily used in machine learning.
3.3.5
unlabelled
property of a sample that does not include a target variable
4 Abbreviated terms
AI artificial intelligence
API application programming interface
AUC area under the curve
BM Boltzmann machines
CapsNet capsule neural network
CG conjugate gradient
CNN convolutional neural network
DBN deep belief networks
DCNN deep convolutional neural network
FFNN feed forward neural network
FNR false negative rate
FPR false positive rate
GRU gated recurrent unit
LSTM long short-term memory
MAE mean absolute error
MDP Markov decision process
ML machine learning
NN neural network
NNEF neural network exchange format
NPV negative predictive value
ONNX open neural network exchange
PCA principal component analysis
PHI personal or protected health information
PII personally identifiable information
PPV positive predictive value
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
REST representational state transfer
RNN recurrent neural network
ROC receiver operating characteristics
SGD stochastic gradient descent
SVM support vector machine
TNR true negative rate
TPR true positive rate
5 Overview
ISO/IEC 22989 defines ML as the process of optimising model parameters through computational
techniques, such that the model's behaviour reflects the data or experience. Since the early 1940s,
modelling of neurons (i.e. neural networks) and the development of computer programs that can learn
from data have been explored. ML is an expanding field with the emergence of new applications in a
wide array of industry sectors. This progression is enabled by the availability of large amounts of data
and computation resources. ML methods include neural networks and deep learning.
In ISO/IEC 22989, an AI ecosystem is presented in terms of its functional layers and ML is a significant
component of this AI ecosystem. Figure 1 illustrates the ML system which breaks down into the
components of model, software tools and techniques and data.
Clause 6 in this document describes in further detail the different components of the ML system.
Clause 7 in this document describes different ML approaches and describes their dependency on
training data.
Clause 8 in this document describes an ML pipeline: the processes involved in developing, deploying
and operating an ML model.
Clauses 6.5 and 7 are more technical than the rest of the document. A stronger technical background
can help the reader to better understand this content.
6 Machine learning system
6.1 Overview
Figure 1 depicts the elements of an ML system. They delineate the roles and their ML-specific functions
that can be implemented by different entities (e.g. different vendors). The examples provided in Figure 1
are not meant to be an exhaustive list. Further explanation on each section of Figure 1 continues
through Clause 6.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
Figure 1 — Elements of an ML system
In Figure 1, the sub elements of model development and use can be considered as a layered approach,
i.e. applications are built from models which are used to solve tasks. Model development and use in turn
have a dependency on software tools and techniques and data.
A single ML system can be composed of several ML models used in combination. The system components
can be described in terms of their input, output and their intent or function. The components can be
tested independently.
ML models, when deployed, produce outputs such as predictions or decisions. A pre-trained model is
an ML model already trained when it was obtained. In some cases, the developed model can be applied
to a similar task, in a different domain. Transfer learning is a technique for modifying a pre-trained ML
model to perform a different related task.
In this document, application refers both to the intended use of one or more ML models and to the
concrete piece of software that implements that use. ML models are usually integrated with other
software components to create applications. Applications using ML differ in the types of the input
data they process and in the types of tasks they perform. In some applications, ML makes high-level
predictions or decisions, while in other applications, ML provides answers to narrowly defined
problems.
Differences in input data and tasks, as well as factors such as deployment options, accuracy and
reliability, result in different application designs. AI applications can use proprietary custom designs or
follow domain-specific design patterns.
Application logic is informed by the format of the input data, the output data, and potentially the
transformation and the flow of data between the ML models in use. In all cases, the choice of ML
algorithms and data preparation techniques is tailored to the application’s tasks.
6.2 Task
6.2.1 General
The term "task" refers to actions required to achieve a specific goal. In ML, this implies identifying a
problem to be solved using the ML model. One or more ML tasks can be defined for an ML application.
Instead of solving a problem using a specific function represented as a set of steps and implemented
in a software code, the defined problem is solved by applying a trained ML model to production data.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
Effectively, the trained ML model implements a target function, which is an approximation of the
hypothetical function that would have been written by a programmer to solve the problem.
An ML task setup involves defining the problem, the data format and the features.
The tasks described in the following subclauses are examples and are not exhaustive.
6.2.2 Regression
Regression tasks comprise predicting a continuous variable by learning a function that best fits a set of
training data. In a regression task, the trained regression model represents a custom space. When the
trained model is applied to a new production data instance, the instance is projected into the custom
space defined by the trained regression model.
Regression is mainly used to predict numerical values of a real-world process based on previous
measurements or observations from the same process. Use cases for regression include:
— predicting stock market price;
— predicting the age of a viewer of streaming videos;
— predicting the amount of prostate-specific antigen in the body based on different clinical
measurements.
6.2.3 Classification
Classification tasks comprise predicting the assignment of an instance of input data to a defined
category or class. Classification can be binary (i.e. true or false), multi-class (i.e. one of several
possibilities) or multilabel (i.e. any number out of several possibilities). For example, classification can
be used to predict whether an object in an image is a cat or a dog, or even from a completely different
species. The classes are typically from a discrete and unordered set, such that the problem cannot be
formalised as a regression task. For example, a medical diagnosis of a set of symptoms can be {stroke,
drug overdose, seizure}, there is no order to the class values, and there is no continuous change from
one class to another.
Use cases for classification include:
— Document classification and email spam filtering, where documents are grouped into several
classes. A spam filter for instance uses two classes, namely “spam” and “not spam”;
— Classifying the species of a specimen. For example, an ML classification model can predict the
species of a flower when provided with data that specifies the sepal length and width, and the petal
length and width;
— Image classification. Given a set of images (e.g. of furniture), an ML system can be used to recognize
and name the objects shown in those images.
6.2.4 Clustering
Clustering tasks comprise grouping input data instances. Unlike classification tasks, the classes are not
predefined in clustering tasks but are determined as part of the clustering process. Clustering can be
used as a data preparation step to identify homogenous data which can then be used as training data for
supervised machine learning. Clustering can also be used to detect outliers or anomalies by identifying
input data instances that are not like other samples. Example applications of clustering tasks include
the sorting or organizing of files.
6.2.5 Anomaly detection
Anomaly detection comprise identifying input data instances that do not conform to an expected
pattern. Anomaly detection can be useful for applications such as detecting fraud or unusual activities.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
For anomaly detection, the ML model predicts whether an input data instance is typical for a given
distribution.
6.2.6 Dimensionality reduction
Dimensionality reduction consists of reducing the number of attributes or dimensions per sample while
retaining most of the useful information.
Dimensionality reduction can promote a dataset’s most useful features and thereby mitigate
computation costs.
Dimensionality reduction alleviates the various less-than-ideal effects of keeping too many features,
collectively known as “the curse of dimensionality”. Dimensionality reduction is also useful for data
exploration and model analysis.
[1]
Methods for dimensionality reduction are unsupervised, supervised or semi-supervised .
6.2.7 Other tasks
There exist many other tasks which have different purposes and expected outputs. These tasks can be
specific to a given application. Examples of other tasks include semantic segmentation of text or images,
machine translation, speech recognition or synthesis, object localisation and image generation.
In planning, the task is to optimise a sequence of actions from an agent or agents through observing the
environmental state.
Despite their diversity, a number of concepts have been formulated to draw connections between some
of these other tasks. Structured prediction, corresponding to tasks in which the expected output of the
model is a structured object as opposed to a single value, is one such concept.
Structured prediction requires computational methods that can account for regularities in the output,
either by explicitly modelling them or by jointly predicting the whole structure with a model that
internally models the regularities.
Use cases for structured prediction include:
— constructing a parse tree for a natural language sentence;
— translating a sentence in one language into a sentence in another language;
— predicting protein structure;
— semantic segmentation of an image.
6.3 Model
ISO/IEC 22989:2022, 3.2.11, defines an ML model as a mathematical construct that generates an
inference or prediction, based on input data or information. The ML model comprises a data structure
and software to process the structure, both determined by a chosen ML algorithm. The model is
configured with inputs and outputs essential to solving the given problem.
The model is populated (also known as “trained”) to represent the relevant statistical properties of the
training data. Effectively, through the training process the model “learns” how to solve the problem for
the training data with the goal to apply this acquired knowledge to a real-world application.
ML models produce results that are approximations of optimal solutions. ML algorithms utilise
statistical optimisation methods to perform this approximation. The resultant mapping from the
inputs to the outputs of the model reflects the patterns learned from the training data. Patterns can
relate to correlations, causal relationships or categories of data objects. ML models are the result of
the training data used. Thus, if the data used is incomplete, or reflects inherent societal bias, then the
model performance will reflect this as well. Therefore, care should be taken with the datasets used for
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
training models. The logic created in the process of machine learning and represented by the trained
model is not specified by a programmer but evolves during the training activities.
To see how well the model performs, it is evaluated using evaluation metrics.
Retraining consists of updating a trained model by training with different training data. It can be
necessary due to many factors, including the lack of large training data, data drift and concept drift.
In data drift, the accuracy of the model’s predictions decays over time due to changes in the statistical
characteristics of the production data. In this case, the model needs to be retrained with new training
data that better represents the production data.
In concept drift, the decision boundary appears to move, which also degrades the accuracy of
predictions, even though the data has not changed. In the case of concept drift, the target variables in
the training data need to be relabelled and the model retrained.
Retraining can also occur for purposes such as transfer learning and optimisation or modification of
the ML model.
Some models can be readily available which can then be retrained and optimised for a specific use
case or used as is. An example can be a commercially available machine translation model that can be
retrained to be used for translating legal documents.
Continuous learning is a special case of retraining in that the model’s performance is continuously
evolving, resulting from on-going training of the model with production data. In such cases, it can
be necessary to continuously monitor the model’s performance or to implement “guard-rails” on the
acceptable behaviour for the model.
6.4 Data
Figure 2 is a rake diagram showing that the data concept is partitioned into four mutually exclusive
categories:
a) Training dataset, used to estimate the parameters of candidate models;
b) Validation dataset, also known as development dataset depending on the AI field (e.g. in natural
language processing), used to select the best model according to a performance criterium;
c) Test dataset, used to check the generalisation capacity of a model and determines its performance
on future data;
d) Production data, comprised of operational data to be used by the model for prediction. The
distribution of production data can differ from that of training, validation and test dataset.
Training, validation and test datasets can be supplemented with simulated and perturbed data.
Figure 2 — Concept diagram: data and datasets
These various types of data can be comprised of either input data alone, or input data associated with
labels (the expected output data). Validation and test data are often labelled, and production data is
typically unlabelled. For training data, it depends on the ML approach: it can be unlabelled, partially
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
labelled, or fully labelled. Labelled training data allows the ML algorithm to identify statistical
relationships between input variables and the target variable. Unlabelled training data allows the ML
algorithm to identify statistical correlations and structure within the input data.
Validation and test data are both used with statistical performance measures (which are discussed
in 6.5.5), but their uses differ: validation data is used to tune the hyperparameters, whereas test data
is about evaluating the model. The aim of test data is to verify that the trained model will perform, or
generalize, well on production data. Trained models that do not generalize well are called “overfitted”
(to the training data).
Note that this use of the term test data is limited to ML-specific processes, and distinct from usage of
the term in the context of verifying and validating an integrated system that uses ML components, in
which case it would refer to any data used for verification and validation purposes, without a particular
relation to ML. Note also that the ML-specific use of the term validation data is unrelated to verification
and validation processes, as it pertains to model development.
For reliable application of the ML processes, training, validation and test data need to be disjoint.
Training, validation and test sets can be obtained from the same dataset, using data splits, or they can
be acquired separately. In an ideal configuration, they all have the same statistical distribution, but
depending on the use case and ML approach it can be necessary to proceed differently.
For faithful evaluation of the trained model, the test data needs to have a distribution as similar as
possible to that of production data.
Production data is seen by the model only after its deployment. For the model to make accurate
predictions, in general the production data needs to have a distribution similar to that of the training
and validation data, although specific techniques exist that can alleviate the degradation in case of
discrepancy.
Over time, the production data distribution can drift which can require the model to be retrained on
new data. In cases where the model needs to adapt dynamically to new patterns in the production data,
the model can be continuously retrained by leveraging information gained from production data. Such
cases are discussed in 6.3.
6.5 Tools
6.5.1 General
ML model creation uses tools categorized as data preparation, ML algorithms, optimisation methods
and evaluation metrics. ML model performance is assessed through tools that generate evaluation
metrics.
ML model creation often requires high-performance compute workloads due to computational demands
and the use of large training datasets. Compute and storage performance can also affect how quickly
ML models can be developed and trained.
Fundamental challenges with ML include statistical analysis, algorithm design and optimisation.
Statistical analysis involves the principles of mathematical models derived from training data.
Algorithm design is the manner of implementation of algorithmic techniques used to build the ML
model. Optimisation of a given ML model is also an important issue in implementing ML. A further
challenge is in understanding the potential and possibilities of ML. It relies on data and so will replicate,
amplify and expedite existing faults and inequities in many cases.
6.5.2 Data preparation
Data preparation is discussed in 8.3.
© ISO/IEC 2022 – All rights reserved
ISO/IEC 23053:2022(E)
6.5.3 Categories of ML algorithms
6.5.3.1 General
The choice of the ML algorithm defines the computational structure of the ML model and its training
approach.
Algorithms can be used for different ML purposes, including:
— an information representation algorithm that can take part of the data preparation stage. This is
related to feature engineering;
— an algorithm used in the creation of an ML model.
The relationship between ML algorithms and ML
...








Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...