Information technology — Internet of media things — Part 1: Architecture

This document describes the architecture of systems for the internet of media things.

Technologies de l'information — Internet des objets media — Partie 1: Architecture

General Information

Status
Published
Publication Date
29-Mar-2022
Current Stage
9092 - International Standard to be revised
Completion Date
18-Jan-2023
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 23093-1:2022 - Information technology — Internet of media things — Part 1: Architecture Released:3/30/2022
English language
23 pages
sale 15% off
Preview
sale 15% off
Preview
Draft
ISO/IEC FDIS 23093-1 - Information technology -- Internet of media things
English language
23 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 23093-1
Second edition
2022-03
Information technology — Internet of
media things —
Part 1:
Architecture
Technologies de l'information — Internet des objets media —
Partie 1: Architecture
Reference number
ISO/IEC 23093-1:2022(E)
© ISO/IEC 2022

---------------------- Page: 1 ----------------------
ISO/IEC 23093-1:2022(E)
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2022
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 23093-1:2022(E)
Contents Page
Foreword .v
Introduction . vi
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
3.1 Internet of media things terms . 1
3.2 Internet of things terms . 3
4 Architecture. 5
5 Use cases . 5
5.1 General . 5
5.2 Smart spaces: Monitoring and control with network of audio-video cameras . 6
5.2.1 General . 6
5.2.2 Human tracking with multiple network cameras . 6
5.2.3 Dangerous region surveillance system . 7
5.2.4 Intelligent firefighting with IP surveillance cameras . 7
5.2.5 Automatic security alert and title generation system using, time, GPS and
visual information . 8
5.2.6 Networked digital signs for customized advertisement . 8
5.2.7 Digital signage and second screen use . 8
5.2.8 Self-adaptive quality of experience for multimedia applications . 9
5.2.9 Ultra-wide viewing video composition . 9
5.2.10 Face recognition to evoke sensorial actuations. 9
5.2.11 Automatic video clip generation by detecting event information . 9
5.2.12 Temporal synchronization of multiple videos for creating 360° or multiple
view video . 9
5.2.13 Intelligent similar content recommendations using information from IoMT
devices . 10
5.2.14 Safety equipment detection on construction sites . 10
5.3 Smart spaces: Multi-modal guided navigation . 10
5.3.1 General . 10
5.3.2 Blind person assistant system . 10
5.3.3 Elderly people assistance with consecutive vibration haptic devices . 11
5.3.4 Personalized navigation by visual communication . 11
5.3.5 Personalized tourist navigation with natural language functionalities .12
5.3.6 Smart identifier: Face recognition on smart glasses .13
5.3.7 Smart advertisement: QR code recognition on smart glasses .13
5.4 Smart audio/video environments in smart cities . 13
5.4.1 General .13
5.4.2 Smart factory: Car maintenance assistance A/V system using smart glasses . 14
5.4.3 Smart museum: Augmented visit using smart glasses. 14
5.4.4 Smart house: Light control, vibrating subtitle, olfaction media content
consumption, odour image recognizer . 15
5.4.5 Smart car: Head-light adjustment and speed monitoring to provide
automatic volume control. 16
5.5 Smart multi-modal collaborative health . 16
5.5.1 General . 16
5.5.2 Increasing patient autonomy by remote control of left-ventricular assisted
devices . 16
5.5.3 Diabetic coma prevention by monitoring networks of in-body/near body
sensors . 17
5.5.4 Enhanced physical activity with smart fabrics networks. 17
5.5.5 Medical assistance with smart glasses . 17
iii
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 3 ----------------------
ISO/IEC 23093-1:2022(E)
5.5.6 Managing healthcare information for smart glasses . 18
5.5.7 Indoor air quality prediction . 19
5.6 Blockchain usage for IoMT transactions authentication and monetizing . 19
5.6.1 General . 19
5.6.2 Reward function in IoMT people counting by using blockchains . 19
5.6.3 Content authentication with blockchains . 19
Annex A (informative) Mapping of the components between IoMT and IoT reference
architectures.21
Bibliography .23
iv
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 4 ----------------------
ISO/IEC 23093-1:2022(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international
organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the
work.
The procedures used to develop this document and those intended for its further maintenance
are described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria
needed for the different types of document should be noted. This document was drafted in
accordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives or
www.iec.ch/members_experts/refdocs).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents) or the IEC
list of patent declarations received (see patents.iec.ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see
www.iso.org/iso/foreword.html. In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
This second edition cancels and replaces the first edition (ISO/IEC 23093-1:2020), which has been
technically revised.
The main changes are as follows:
— use case description and the underlying technology.
A list of all parts in the ISO/IEC 23093 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
v
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 5 ----------------------
ISO/IEC 23093-1:2022(E)
Introduction
The ISO/IEC 23093 series provides an architecture and specifies application programming interfaces
(APIs) and compressed representation of data flowing between media things.
The APIs for the media things facilitate discovering other media things in the network, connecting
and efficiently exchanging data between media things. The APIs also provide means for supporting
transaction tokens in order to access valuable functionalities, resources, and data from media things.
Media things related information consists of characteristics and discovery data, setup information
from a system designer, raw and processed sensed data, and actuation information. The ISO/IEC 23093
series specifies data formats of input and output for media sensors, media actuators, media storages,
media analysers, etc. Sensed data from media sensors can be processed by media analysers to produce
analysed data, and the media analysers can be cascaded in order to extract semantic information.
This document does not specify how the process of sensing and analysing is carried out but specifies
the interfaces between the media things. This document describes the architecture of systems for the
internet of media things.
The International Organization for Standardization (ISO) and International Electrotechnical
Commission (IEC) draw attention to the fact that it is claimed that compliance with this document may
involve the use of a patent.
ISO and IEC take no position concerning the evidence, validity and scope of this patent right.
The holder of this patent right has assured ISO and IEC that they are willing to negotiate licences under
reasonable and non-discriminatory terms and conditions with applicants throughout the world. In this
respect, the statement of the holder of this patent right is registered with ISO and IEC. Information may
be obtained from the patent database available at www.iso.org/patents.
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights other than those in the patent database. ISO and IEC shall not be held responsible for
identifying any or all such patent rights.
vi
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 6 ----------------------
INTERNATIONAL STANDARD ISO/IEC 23093-1:2022(E)
Information technology — Internet of media things —
Part 1:
Architecture
1 Scope
This document describes the architecture of systems for the internet of media things.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1 Internet of media things terms
3.1.1
audio
anything related to sound in terms of receiving, transmitting or reproducing it or of its specific
frequency
3.1.2
camera
special form of an image capture device (3.1.6) that senses and captures photo-optical signals
3.1.3
display
visual representation of the output of an electronic device or the portion of an electronic device that
shows this representation, as a screen, lens or reticle
3.1.4
gesture
movement or position of the hand, arm, body, head or face that is expressive of an idea, opinion, emotion,
etc.
3.1.5
haptics
input or output device that senses or actuates the body's movements by means of physical contact with
the user
3.1.6
image capture device
device which is capable of sensing and capturing acoustic, electrical or photo-optical signals of a
physical entity that can be converted into an image
1
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 7 ----------------------
ISO/IEC 23093-1:2022(E)
3.1.7
internet of media things
IoMT
special subset of IoT (3.2.9) whose main functionalities are related to media processing
3.1.8
IoMT device
IoT (3.2.9) device that contains more than one MThing (3.1.12)
3.1.9
IoMT system
MSystem
IoT (3.2.9) system whose main functionality is related to media processing
3.1.10
loudspeaker
electroacoustic device, connected as a component in an audio system, generating audible acoustic waves
3.1.11
media
data that can be rendered, including audio, video, text, graphics, images, haptic and tactile information
Note 1 to entry: These data can be timed or non-timed.
3.1.12
media thing
MThing
thing (3.2.20) capable of sensing, acquiring, actuating, or processing of media or metadata
3.1.13
media token
virtual token for accessing functionalities, resources and data of media things
3.1.14
microphone
entity capable of capture and transform acoustic waves into changes in electric currents or voltage,
used in recording or transmitting sound
3.1.15
media wearable
MWearable
MThing (3.1.12) intended to be located near, on or in an organism
3.1.16
motion
action or process of changing place or position
3.1.17
natural user interface
NUI
system for human-computer interaction that the user operates through intuitive actions related to
natural, everyday human behaviour
3.1.18
presentation
act of producing human recognizable output of rendered media
2
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 8 ----------------------
ISO/IEC 23093-1:2022(E)
3.2 Internet of things terms
3.2.1
actuator
component which conveys digital information to effect a change of some property of a physical entity
3.2.2
capability
characteristic or property of an entity that can be used to describe its state, appearance or other aspects
EXAMPLE An entity type, address information, telephone number, a privilege, a MAC address, a domain
name are possible attributes, see ISO/IEC 24760-1.
3.2.3
component
modular, deployable and replaceable part of a system that encapsulates implementations
Note 1 to entry: A component may expose or use interfaces (local or on a network) to interact with other entities,
see ISO 19104. A component which exposes or uses network interfaces is called an endpoint.
3.2.4
digital entity
any computational or data element of an IT-based system
Note 1 to entry: It may exist as a service based in a data centre or cloud, or a network element or a gateway.
3.2.5
discovery
service to find unknown resources/entities/services based on a rough specification of the desired
result
Note 1 to entry: It may be utilized by a human or another service; credentials for authorization are considered
when executing the discovery, see ISO/IEC 30141.
3.2.6
entity
anything (physical or non-physical) having a distinct existence
3.2.7
identifier
information that unambiguously distinguishes one entity (3.2.6) from another one in a given identity
context
3.2.8
identity
characteristics determining who or what a person or thing is
3.2.9
internet of things
IoT
infrastructure of interconnected objects, people, systems and information resources together with
intelligent services to allow them to process information of the physical and the virtual world and to
react
3.2.10
interface
shared boundary between two functional components, defined by various characteristics pertaining
to the functions, physical interconnections, signal exchanges, and other characteristics, as appropriate
Note 1 to entry: See ISO/IEC 13066-1.
3
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 9 ----------------------
ISO/IEC 23093-1:2022(E)
3.2.11
IoT system
system that is comprised of functions that provide the system the capabilities for identification, sensing,
actuation, communication and management, and applications and services to a user
[1]
Note 1 to entry: See Bahga and Madisetti .
3.2.12
network
entity that connects endpoints, sources to destinations, and may itself act as a value-added element in
the IoT system or services
3.2.13
process
procedure to carry out operations on data
3.2.14
physical entity
thing (3.2.20) that is discrete, identifiable and observable, and that has material existence in real world
3.2.15
reference architecture
description of common features, common vocabulary, guidelines, interrelations and interactions among
the entities, and a template for an IoT architecture
3.2.16
resource
any element of a data processing system needed to perform required operations
Note 1 to entry: See ISO/IEC 2382.
3.2.17
sensor
device that observes and measures a physical property of a natural phenomenon or a human induced
process and converts that measurement into a signal
Note 1 to entry: A signal can be electrical, chemical, etc., see ISO/IEC 29182-2.
3.2.18
service
distinct part of the functionality that is provided by an entity through interfaces
3.2.19
storage
capacity of a digital entity to store information subject to recall or the components of a digital entity in
which such information is stored
3.2.20
thing
any entity that can communicate with other entities
3.2.21
user
human or any digital entity that is interested in interacting with a particular physical object
3.2.23
visual
any object perceptible by the sense of sight
4
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 10 ----------------------
ISO/IEC 23093-1:2022(E)
4 Architecture
The global IoMT architecture is presented in Figure 1, which identifies a set of interfaces, protocols and
associated media-related information representations related to:
— user commands (setup information) between a system manager and an MThing, with reference to
interface 1.
— user commands (setup information) forwarded by an MThing to another MThing, possibly in a
modified form (e.g., subset of 1), with reference to interface 1’.
— sensed data (raw or processed data) (compressed or semantic extraction) and actuation information,
with reference to Interface 2.
— wrapped interface 2 (e.g., for transmission), with reference to interface 2’.
— MThing characteristics, discovery, with reference to interface 3.
Figure 1 — IoMT architecture
This IoMT architecture can be mapped to the IoT reference architecture, see ISO/IEC 30141, as shown
in Annex A.
5 Use cases
5.1 General
MPEG identified 31 use-cases for IoMT; they are structured in the following five main categories:
a) Smart spaces: Monitoring and control with network of audio-video cameras (see 5.2)
— human tracking with multiple network cameras
— dangerous region surveillance system
— intelligent firefighting with IP surveillance cameras
— automatic security alert generation system using, time, GPS and visual information
— networked digital signs for customized advertisement
— digital signage and second screen use
— self-adaptive quality of experience for multimedia applications
— ultra-wide viewing video composition
— face recognition to evoke sensorial actuations
— automatic video clip generation by detecting event information
5
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 11 ----------------------
ISO/IEC 23093-1:2022(E)
— temporal synchronization of multiple videos for creating 360° or multiple view video
— intelligent similar content recommendations using information from IoMT devices
— safety equipment detection in construction sites
b) Smart spaces: Multi-modal guided navigation (see 5.3)
— blind person assistant system
— elderly people assistance with consecutive vibration haptic devices
— personalized navigation by visual communication
— personalized tourist navigation with natural language functionalities
— smart identifier: face recognition on smart glasses
— smart advertisement: QR code recognition on smart glasses
c) Smart audio/video environments in smart cities (see 5.4)
— smart factory: car maintenance assistance A/V system using smart glasses
— smart museum: augmented visit museum using smart glasses
— smart house: light control, vibrating subtitle, olfaction media content consumption
— smart car: head-light adjustment and speed monitoring to provide automatic volume control
d) Smart multi-modal collaborative health (see 5.5)
— increasing patient autonomy by remote control of left-ventricular assisted devices
— diabetic coma prevention by monitoring networks of in-body/near body sensors
— enhanced physical activity with smart fabrics networks
— medical assistance with smart glasses
— managing healthcare information for smart glass
— indoor air quality prediction
e) Blockchain usage for IoMT transactions authentication and monetizing (see 5.6)
— reward function in IoMT by using blockchains
— content authentication with blockchains
5.2 Smart spaces: Monitoring and control with network of audio-video cameras
5.2.1 General
The large variety of sensors, actuators, displays and computational elements acting in our day-by-day
professional and private space in order to provide us with better and easier accessible services lead to
13 use cases of interest for IoMT, mainly related to the processing of video information.
5.2.2 Human tracking with multiple network cameras
As urban growth is today accompanied by an increase in crimes rate (e.g., theft, vandalism), many
local authorities consider surveillance systems as a possible tool to fight this phenomenon. A city video
6
  © ISO/IEC 2022 – All rights reserved

---------------------- Page: 12 ----------------------
ISO/IEC 23093-1:2022(E)
surveillance system is an IoMT system that includes a set of IP surveillance cameras, a storage unit and
a human tracker unit.
A particular IP surveillance camera captures audio-video data and send them to both the storage and
the human tracker unit. When the human tracker detects a person, it traces the person and extract the
moving trajectory.
If the person gets out of the visual scope of the first IP camera but stay in the area protected by the
city video surveillance system, another IP camera from this system can take over the control and keep
capturing A/V data of the corresponding person.
If the person gets out of the protected area, for example the person enters into a commercial centre, then
the city system searches whether this commercial centre is also equipped with a video surveillance
system. Should this be the case, the city video surveillance system sets up a communication with the
commercial centre video surveillance system in order to allow another IP camera from the commercial
centre video surveillance centre to keep capturing A/V data of the corresponding person.
In both cases, the specific descriptors (e.g., moving trajectory information, appearance information,
media locations of detected moments) can be extracted and sent to the storage.
5.2.3 Dangerous region surveillance system
IoMT can serve as a basis for developing intelligent alerting services providing information and/or
alerts when a person approaches danger zones, for accident prevention. For instance, Figure 2 illustrates
the case of a home (private) environment where a child plays (cf. Figure 2.(1)). Heterogeneous IoMT
data (e.g. video, depth, audio, temperature) are analyzed to automatically generate an alert if the child
approaches the dangerous area around a hot oven (cf. Figure 2.(2)).
(1) illustrates the case of a private environment
(2) illustrates the usage of IoMT for preventing dangerous situations
Figure 2 — Example use-case of dangerous area surveillance system operating in a private
(home) environment
5.2.4 Intelligent firefighting with IP surveillance cameras
Figure 3 illustrates an example use-case of intelligent firefighting with IP surveillance cameras. In
this case, the fire station and the security manager can rapidly receive the fire/smoke detection alert,
thereby averting a potential fire hazard. Unlike conventional security systems, the outdoor scene
7
© ISO/IEC 2022 – All rights reserved

---------------------- Page: 13 ----------------------
ISO/IEC 23093-1:2022(E)
captured by intelligent IP surveillance cameras is immediately analysed and the fire/smoke incident is
automatically alerted to the fire station based on the analysed results of the captured scene.
Figure 3 — Example use-case of intelligent firefighting
5.2.5 Automatic security alert and title generation system using, time, GPS and visual
information
In the sustainable smart city of Seoul, IoMT cameras (smart CCTV) are deployed around the city. These
cameras are continuously capturing video (24 hours/7 d
...

FINAL
INTERNATIONAL ISO/IEC
DRAFT
STANDARD FDIS
23093-1
ISO/IEC JTC 1/SC 29
Information technology — Internet of
Secretariat: JISC
media things —
Voting begins on:
2021-11-16
Part 1:
Voting terminates on:
Architecture
2022-01-11
Technologies de l'information — Internet des objets media —
Partie 1: L’architecture IoMT
RECIPIENTS OF THIS DRAFT ARE INVITED TO
SUBMIT, WITH THEIR COMMENTS, NOTIFICATION
OF ANY RELEVANT PATENT RIGHTS OF WHICH
THEY ARE AWARE AND TO PROVIDE SUPPOR TING
DOCUMENTATION.
IN ADDITION TO THEIR EVALUATION AS
Reference number
BEING ACCEPTABLE FOR INDUSTRIAL, TECHNO-
ISO/IEC FDIS 23093-1:2021(E)
LOGICAL, COMMERCIAL AND USER PURPOSES,
DRAFT INTERNATIONAL STANDARDS MAY ON
OCCASION HAVE TO BE CONSIDERED IN THE
LIGHT OF THEIR POTENTIAL TO BECOME STAN-
DARDS TO WHICH REFERENCE MAY BE MADE IN
NATIONAL REGULATIONS. © ISO/IEC 2021

---------------------- Page: 1 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
COPYRIGHT PROTECTED DOCUMENT
© ISO/IEC 2021
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
ii
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
Contents Page
Foreword .v
Introduction . vi
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
3.1 Internet of media things terms . 1
3.2 Internet of things terms . 3
4 Architecture. 5
5 Use cases . 5
5.1 General . 5
5.2 Smart spaces: Monitoring and control with network of audio-video cameras . 6
5.2.1 General . 6
5.2.2 Human tracking with multiple network cameras . 6
5.2.3 Dangerous region surveillance system . 7
5.2.4 Intelligent firefighting with IP surveillance cameras . 7
5.2.5 Automatic security alert and title generation system using, time, GPS and
visual information . 8
5.2.6 Networked digital signs for customized advertisement . 8
5.2.7 Digital signage and second screen use . 8
5.2.8 Self-adaptive quality of experience for multimedia applications . 9
5.2.9 Ultra-wide viewing video composition . 9
5.2.10 Face recognition to evoke sensorial actuations. 9
5.2.11 Automatic video clip generation by detecting event information . 9
5.2.12 Temporal synchronization of multiple videos for creating 360° or multiple
view video . 9
5.2.13 Intelligent similar content recommendations using information from IoMT
devices . 10
5.2.14 Safety equipment detection on construction sites . 10
5.3 Smart spaces: Multi-modal guided navigation . 10
5.3.1 General . 10
5.3.2 Blind person assistant system . 10
5.3.3 Elderly people assistance with consecutive vibration haptic devices . 11
5.3.4 Personalized navigation by visual communication . 11
5.3.5 Personalized tourist navigation with natural language functionalities .12
5.3.6 Smart identifier: Face recognition on smart glasses .13
5.3.7 Smart advertisement: QR code recognition on smart glasses .13
5.4 Smart audio/video environments in smart cities . 13
5.4.1 General .13
5.4.2 Smart factory: Car maintenance assistance A/V system using smart glasses . 14
5.4.3 Smart museum: Augmented visit using smart glasses. 14
5.4.4 Smart house: Light control, vibrating subtitle, olfaction media content
consumption, odour image recognizer . 15
5.4.5 Smart car: Head-light adjustment and speed monitoring to provide
automatic volume control. 16
5.5 Smart multi-modal collaborative health . 16
5.5.1 General . 16
5.5.2 Increasing patient autonomy by remote control of left-ventricular assisted
devices . 16
5.5.3 Diabetic coma prevention by monitoring networks of in-body/near body
sensors . 17
5.5.4 Enhanced physical activity with smart fabrics networks. 17
5.5.5 Medical assistance with smart glasses . 17
iii
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 3 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
5.5.6 Managing healthcare information for smart glasses . 18
5.5.7 Indoor air quality prediction . 19
5.6 Blockchain usage for IoMT transactions authentication and monetizing . 19
5.6.1 General . 19
5.6.2 Reward function in IoMT people counting by using blockchains . 19
5.6.3 Content authentication with blockchains . 19
Annex A (informative) Mapping of the components between IoMT and IoT reference
architectures.21
Bibliography .23
iv
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 4 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical
activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international
organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the
work.
The procedures used to develop this document and those intended for its further maintenance
are described in the ISO/IEC Directives, Part 1. In particular, the different approval criteria
needed for the different types of document should be noted. This document was drafted in
accordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives or
www.iec.ch/members_experts/refdocs).
Attention is drawn to the possibility that some of the elements of this document may be the subject
of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent
rights. Details of any patent rights identified during the development of the document will be in the
Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents) or the IEC
list of patent declarations received (see patents.iec.ch).
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and
expressions related to conformity assessment, as well as information about ISO's adherence to
the World Trade Organization (WTO) principles in the Technical Barriers to Trade (TBT) see
www.iso.org/iso/foreword.html. In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information.
This second edition cancels and replaces the first edition (ISO/IEC 23093-1:2020), which has been
technically revised.
The main changes are as follows:
— use case description and the underlying technology
A list of all parts in the ISO/IEC 23093 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
v
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 5 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
Introduction
The ISO/IEC 23093 series provides an architecture and specifies application programming interfaces
(APIs) and compressed representation of data flowing between media things.
The APIs for the media things facilitate discovering other media things in the network, connecting
and efficiently exchanging data between media things. The APIs also provide means for supporting
transaction tokens in order to access valuable functionalities, resources, and data from media things.
Media things related information consists of characteristics and discovery data, setup information
from a system designer, raw and processed sensed data, and actuation information. The ISO/IEC 23093
series specifies data formats of input and output for media sensors, media actuators, media storages,
media analysers, etc. Sensed data from media sensors can be processed by media analysers to produce
analysed data, and the media analysers can be cascaded in order to extract semantic information.
This document does not specify how the process of sensing and analysing is carried out but specifies
the interfaces between the media things. This document describes the architecture of systems for the
internet of media things.
This second edition cancels and replaces the first edition (ISO/IEC 23093-1:2019), which has been
technically revised. The main changes compared to the previous edition are as follows:
— Complementary use cases
— Sequence diagrams and mission state diagrams for the use-case description in order to enhance the
readability of the document
vi
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 6 ----------------------
FINAL DRAFT INTERNATIONAL STANDARD ISO/IEC FDIS 23093-1:2021(E)
Information technology — Internet of media things —
Part 1:
Architecture
1 Scope
This document describes the architecture of systems for the internet of media things.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminological databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1 Internet of media things terms
3.1.1
audio
anything related to sound in terms of receiving, transmitting or reproducing it or of its specific
frequency
3.1.2
camera
special form of an image capture device (3.1.6) that senses and captures photo-optical signals
3.1.3
display
visual representation of the output of an electronic device or the portion of an electronic device that
shows this representation, as a screen, lens or reticle
3.1.4
gesture
movement or position of the hand, arm, body, head or face that is expressive of an idea, opinion, emotion,
etc.
3.1.5
haptics
input or output device that senses or actuates the body's movements by means of physical contact with
the user
3.1.6
image capture device
device which is capable of sensing and capturing acoustic, electrical or photo-optical signals of a
physical entity that can be converted into an image
1
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 7 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
3.1.7
internet of media things
IoMT
special subset of IoT (3.2.9) whose main functionalities are related to media processing
3.1.8
IoMT device
IoT (3.2.9) device that contains more than one MThing (3.1.12)
3.1.9
IoMT system
MSystem
IoT (3.2.9) system whose main functionality is related to media processing
3.1.10
loudspeaker
electroacoustic device, connected as a component in an audio system, generating audible acoustic waves
3.1.11
media
data that can be rendered, including audio, video, text, graphics, images, haptic and tactile information
Note 1 to entry: These data can be timed or non-timed.
3.1.12
media thing
MThing
thing (3.2.20) capable of sensing, acquiring, actuating, or processing of media or metadata
3.1.13
media token
virtual token for accessing functionalities, resources and data of media things
3.1.14
microphone
entity capable of capture and transform acoustic waves into changes in electric currents or voltage,
used in recording or transmitting sound
3.1.15
media wearable
MWearable
MThing (3.1.12) intended to be located near, on or in an organism
3.1.16
motion
action or process of changing place or position
3.1.17
natural user interface
NUI
system for human-computer interaction that the user operates through intuitive actions related to
natural, everyday human behaviour
3.1.18
presentation
act of producing human recognizable output of rendered media
2
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 8 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
3.2 Internet of things terms
3.2.1
actuator
component which conveys digital information to effect a change of some property of a physical entity
3.2.2
capability
characteristic or property of an entity that can be used to describe its state, appearance or other aspects
EXAMPLE An entity type, address information, telephone number, a privilege, a MAC address, a domain
name are possible attributes, see ISO/IEC 24760-1.
3.2.3
component
modular, deployable and replaceable part of a system that encapsulates implementations
Note 1 to entry: A component may expose or use interfaces (local or on a network) to interact with other
entities, see ISO 19104. A component which exposes or uses network interfaces is called an endpoint.
3.2.4
digital entity
any computational or data element of an IT-based system
Note 1 to entry: It may exist as a service based in a data centre or cloud, or a network element or a gateway.
3.2.5
discovery
service to find unknown resources/entities/services based on a rough specification of the desired
result
Note 1 to entry: It may be utilized by a human or another service; credentials for authorization are considered
when executing the discovery, see ISO/IEC 30141.
3.2.6
entity
anything (physical or non-physical) having a distinct existence
3.2.7
identifier
information that unambiguously distinguishes one entity (3.2.6) from another one in a given identity
context
3.2.8
identity
characteristics determining who or what a person or thing is
3.2.9
internet of things
IoT
infrastructure of interconnected objects, people, systems and information resources together with
intelligent services to allow them to process information of the physical and the virtual world and to
react
3.2.10
interface
shared boundary between two functional components, defined by various characteristics pertaining
to the functions, physical interconnections, signal exchanges, and other characteristics, as appropriate
Note 1 to entry: See ISO/IEC 13066-1.
3
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 9 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
3.2.11
IoT system
system that is comprised of functions that provide the system the capabilities for identification, sensing,
actuation, communication and management, and applications and services to a user
[1]
Note 1 to entry: See Bahga and Madisetti .
3.2.12
network
entity that connects endpoints, sources to destinations, and may itself act as a value-added element in
the IoT system or services
3.2.13
process
procedure to carry out operations on data
3.2.14
physical entity
thing (3.2.20) that is discrete, identifiable and observable, and that has material existence in real world
3.2.15
reference architecture
description of common features, common vocabulary, guidelines, interrelations and interactions among
the entities, and a template for an IoT architecture
3.2.16
resource
any element of a data processing system needed to perform required operations
Note 1 to entry: See ISO/IEC 2382.
3.2.17
sensor
device that observes and measures a physical property of a natural phenomenon or a human induced
process and converts that measurement into a signal
Note 1 to entry: A signal can be electrical, chemical, etc., see ISO/IEC 29182-2.
3.2.18
service
distinct part of the functionality that is provided by an entity through interfaces
3.2.19
storage
capacity of a digital entity to store information subject to recall or the components of a digital entity in
which such information is stored
3.2.20
thing
any entity that can communicate with other entities
3.2.21
user
human or any digital entity that is interested in interacting with a particular physical object
3.2.23
visual
any object perceptible by the sense of sight
4
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 10 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
4 Architecture
The global IoMT architecture is presented in Figure 1, which identifies a set of interfaces, protocols and
associated media-related information representations related to:
— user commands (setup information) between a system manager and an MThing, with reference to
interface 1.
— user commands (setup information) forwarded by an MThing to another MThing, possibly in a
modified form (e.g., subset of 1), with reference to interface 1’.
— sensed data (raw or processed data) (compressed or semantic extraction) and actuation information,
with reference to Interface 2.
— wrapped interface 2 (e.g., for transmission), with reference to interface 2’.
— MThing characteristics, discovery, with reference to interface 3.
Figure 1 — IoMT architecture
This IoMT architecture can be mapped to the IoT reference architecture, see ISO/IEC 30141, as shown
in Annex A.
5 Use cases
5.1 General
MPEG identified 31 use-cases for IoMT; they are structured in the following five main categories:
a) Smart spaces: Monitoring and control with network of audio-video cameras (see 5.2)
— human tracking with multiple network cameras
— dangerous region surveillance system
— intelligent firefighting with IP surveillance cameras
— automatic security alert generation system using, time, GPS and visual information
— networked digital signs for customized advertisement
— digital signage and second screen use
— self-adaptive quality of experience for multimedia applications
— ultra-wide viewing video composition
— face recognition to evoke sensorial actuations
— automatic video clip generation by detecting event information
5
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 11 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
— temporal synchronization of multiple videos for creating 360° or multiple view video
— intelligent similar content recommendations using information from IoMT devices
— safety equipment detection in construction sites
b) Smart spaces: Multi-modal guided navigation (see 5.3)
— blind person assistant system
— elderly people assistance with consecutive vibration haptic devices
— personalized navigation by visual communication
— personalized tourist navigation with natural language functionalities
— smart identifier: face recognition on smart glasses
— smart advertisement: QR code recognition on smart glasses
c) Smart audio/video environments in smart cities (see 5.4)
— smart factory: car maintenance assistance A/V system using smart glasses
— smart museum: augmented visit museum using smart glasses
— smart house: light control, vibrating subtitle, olfaction media content consumption
— smart car: head-light adjustment and speed monitoring to provide automatic volume control
d) Smart multi-modal collaborative health (see 5.5)
— increasing patient autonomy by remote control of left-ventricular assisted devices
— diabetic coma prevention by monitoring networks of in-body/near body sensors
— enhanced physical activity with smart fabrics networks
— medical assistance with smart glasses
— managing healthcare information for smart glass
— indoor air quality prediction
e) Blockchain usage for IoMT transactions authentication and monetizing (see 5.6)
— reward function in IoMT by using blockchains
— content authentication with blockchains
5.2 Smart spaces: Monitoring and control with network of audio-video cameras
5.2.1 General
The large variety of sensors, actuators, displays and computational elements acting in our day-by-day
professional and private space in order to provide us with better and easier accessible services lead to
13 use cases of interest for IoMT, mainly related to the processing of video information.
5.2.2 Human tracking with multiple network cameras
As urban growth is today accompanied by an increase in crimes rate (e.g., theft, vandalism), many
local authorities consider surveillance systems as a possible tool to fight this phenomenon. A city video
6
  © ISO/IEC 2021 – All rights reserved

---------------------- Page: 12 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
surveillance system is an IoMT system that includes a set of IP surveillance cameras, a storage unit and
a human tracker unit.
A particular IP surveillance camera captures audio-video data and send them to both the storage and
the human tracker unit. When the human tracker detects a person, it traces the person and extract the
moving trajectory.
If the person gets out of the visual scope of the first IP camera but stay in the area protected by the
city video surveillance system, another IP camera from this system can take over the control and keep
capturing A/V data of the corresponding person.
If the person gets out of the protected area, for example the person enters into a commercial centre, then
the city system searches whether this commercial centre is also equipped with a video surveillance
system. Should this be the case, the city video surveillance system sets up a communication with the
commercial centre video surveillance system in order to allow another IP camera from the commercial
centre video surveillance centre to keep capturing A/V data of the corresponding person.
In both cases, the specific descriptors (e.g., moving trajectory information, appearance information,
media locations of detected moments) can be extracted and sent to the storage.
5.2.3 Dangerous region surveillance system
IoMT can serve as a basis for developing intelligent alerting services providing information and/or
alerts when a person approaches danger zones, for accident prevention. For instance, Figure 2 illustrates
the case of a home (private) environment where a child plays (cf. Figure 2.(1)). Heterogeneous IoMT
data (e.g. video, depth, audio, temperature) are analyzed to automatically generate an alert if the child
approaches the dangerous area around a hot oven (cf. Figure 2.(2)).
(1) illustrates the case of a private environment
(2) illustrates the usage of IoMT for preventing dangerous situations
Figure 2 — Example use-case of dangerous area surveillance system operating in a private
(home) environment
5.2.4 Intelligent firefighting with IP surveillance cameras
Figure 3 illustrates an example use-case of intelligent firefighting with IP surveillance cameras. In
this case, the fire station and the security manager can rapidly receive the fire/smoke detection alert,
thereby averting a potential fire hazard. Unlike conventional security systems, the outdoor scene
7
© ISO/IEC 2021 – All rights reserved

---------------------- Page: 13 ----------------------
ISO/IEC FDIS 23093-1:2021(E)
captured by intelligent IP surveillance cameras is immediately analysed and the fire/smoke incident is
automatically alerted to the fire station based on the analysed results of the captured scene.
Figure 3 — Example use-case of intelligent firefighting
5.2.5 Automatic security alert and title generation system using, time, GPS and visual
information
In the sustainable smart city of Seoul, IoMT cameras (smart CCTV) are deployed around the city. These
c
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.