ISO/IEC 23773-3:2024
(Main)Information technology — User interfaces for automatic simultaneous interpretation systems — Part 3: System architecture
Information technology — User interfaces for automatic simultaneous interpretation systems — Part 3: System architecture
This document provides a description of a system architecture for real-time automatic simultaneous interpretation systems for spontaneous speech designed to interoperate among different natural languages. While traditional speech-to-speech translation addresses the functional equivalent of consecutive interpretation, this document focuses on the functional equivalent of simultaneous interpretation. This document does not cover sign language interpretation.
Technologies de l'information — Interfaces utilisateur pour les systèmes d'interprétation simultanée automatique — Partie 3: Architecture du système
General Information
Buy Standard
Standards Content (Sample)
International
Standard
ISO/IEC 23773-3
First edition
Information technology —
2024-07
User interfaces for automatic
simultaneous interpretation
systems —
Part 3:
System architecture
Technologies de l'information — Interfaces utilisateur pour les
systèmes d'interprétation simultanée automatique —
Partie 3: Architecture du système
Reference number
© ISO/IEC 2024
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
© ISO/IEC 2024 – All rights reserved
ii
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Architecture of simultaneous interpretation system . 2
5.1 General .2
5.2 Continuous speech recognition module .4
5.2.1 General .4
5.2.2 Feature extraction/noise processing .5
5.2.3 Utterance adaptation.5
5.2.4 Multiple decoder .5
5.2.5 Interfaces of submodules .5
5.3 Interpretation unit extraction module .5
5.3.1 General .5
5.3.2 Input buffer manager .6
5.3.3 Unit separating morpheme analyser .6
5.3.4 Voice unit analyser.6
5.3.5 Interpretation unit former .6
5.3.6 Interfaces of submodules .6
5.4 Real-time simultaneous interpretation module .6
5.4.1 General .6
5.4.2 Rule-based translation .8
5.4.3 Machine learning (ML)-based translation .8
5.4.4 Context management .8
5.4.5 Hybrid translation (rule+DNN) .8
5.4.6 Interfaces of submodules .8
5.5 Incremental knowledge learning module .9
5.5.1 General .9
5.5.2 Error-based enhancement .9
5.5.3 Knowledge extraction/learning .9
5.5.4 Knowledge support and tools .9
5.5.5 Knowledge adaptation .10
5.5.6 Interfaces of submodules .10
5.6 Presentation of translation results .10
5.6.1 General .10
5.6.2 Personalized speech synthesis .11
5.6.3 Text presentation on the screen .11
5.6.4 Error correction for translation results .11
5.6.5 Interfaces of submodules .11
Bibliography .12
© ISO/IEC 2024 – All rights reserved
iii
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical activity.
ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations,
governmental and non-governmental, in liaison with ISO and IEC, also take part in the work.
The procedures used to develop this document and those intended for its further maintenance are described
in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the different types
of document should be noted. This document was drafted in accordance with the editorial rules of the ISO/
IEC Directives, Part 2 (see www.iso.org/directives or www.iec.ch/members_experts/refdocs).
ISO and IEC draw attention to the possibility that the implementation of this document may involve the
use of (a) patent(s). ISO and IEC take no position concerning the evidence, validity or applicability of any
claimed patent rights in respect thereof. As of the date of publication of this document, ISO and IEC had not
received notice of (a) patent(s) which may be required to implement this document. However, implementers
are cautioned that this may not represent the latest information, which may be obtained from the patent
database available at www.iso.org/patents and https://patents.iec.ch. ISO and IEC shall not be held
responsible for identifying any or all such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and expressions
related to conformity assessment, as well as information about ISO's adherence to the World Trade
Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www.iso.org/iso/foreword.html.
In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 35, User interfaces.
A list of all parts in the ISO/IEC 23773 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
© ISO/IEC 2024 – All rights reserved
iv
Introduction
Communication between users of different languages is a global trend that is increasing. Real-time, automatic
simultaneous interpretation is needed for different applications such as video calls, live lecture translation
and wearable translation devices. Market demands for real-time automatic simultaneous interpretation of
free-style continuous utterances in the travel sector, global event management, as well as for phone calls,
lectures or meetings, are also increasing. A standardized user interface (UI) for automatic simultaneous
interpretation systems fulfils these different needs for communication.
ISO/IEC 23773-1 provides a general description of an automatic simultaneous interpretation system
designed to interoperate among different natural languages for spontaneous speech and text.
While traditional speech-to-speech translation described in ISO/IEC 20382-1 and ISO/IEC 20382-2 addresses
the functional equivalent of consecutive interpretation, this document focuses on the functional equivalent
of simultaneous interpretation.
ISO/IEC 23773-2 provides the requirements and functional components for the UI of automatic simultaneous
interpretation systems.
This document provides a reference architecture for automatic simultaneous interpretation systems
including functional modules and communication interfaces in a high-level approach.
© ISO/IEC 2024 – All rights reserved
v
International Standard ISO/IEC 23773-3:2024(en)
Information technology — User interfaces for automatic
simultaneous interpretation systems —
Part 3:
System architecture
1 Scope
This document provides a description of a system architecture for real-time automatic simultaneous
interpretation systems for spontaneous speech designed to interoperate among different natural languages.
While traditional speech-to-speech translation addresses the functional equivalent of consecutive
interpretation, this document focuses on the functional equivalent of simultaneous interpretation.
This document does not cover sign language interpretation.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1
incremental knowledge learning
knowledge accumulated through learning from p
...
International
Standard
ISO/IEC 23773-3
First edition
Information technology —
User interfaces for automatic
simultaneous interpretation
system —
Part 3:
System architecture
Technologies de l'information — Interfaces utilisateur pour les
systèmes d'interprétation simultanée automatique —
Partie 3: Architecture du système
PROOF/ÉPREUVE
Reference number
© ISO/IEC 2024
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication may
be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or posting on
the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below
or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: +41 22 749 01 11
Email: copyright@iso.org
Website: www.iso.org
Published in Switzerland
PROOF/ÉPREUVE
© ISO/IEC 2024 – All rights reserved
ii
Contents Page
Foreword .iv
Introduction .v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Architecture of simultaneous interpretation system . 2
5.1 General .2
5.2 Continuous speech recognition module .4
5.2.1 General .4
5.2.2 Feature extraction/noise processing .5
5.2.3 Utterance adaptation.5
5.2.4 Multiple decoder .5
5.2.5 Interfaces of submodules .5
5.3 Interpretation unit extraction module .5
5.3.1 General .5
5.3.2 Input buffer manager .6
5.3.3 Unit separating morpheme analyser .6
5.3.4 Voice unit analyser.6
5.3.5 Interpretation unit former .6
5.3.6 Interfaces of submodules .6
5.4 Real-time simultaneous interpretation module .6
5.4.1 General .6
5.4.2 Rule-based translation .8
5.4.3 Machine learning (ML)-based translation .8
5.4.4 Context management .8
5.4.5 Hybrid translation (rule+DNN) .8
5.4.6 Interfaces of submodules .8
5.5 Incremental knowledge learning module .9
5.5.1 General .9
5.5.2 Error-based enhancement .9
5.5.3 Knowledge extraction/learning .9
5.5.4 Knowledge support and tools .9
5.5.5 Knowledge adaptation .10
5.5.6 Interfaces of submodules .10
5.6 Presentation of translation results .10
5.6.1 General .10
5.6.2 Personalized speech synthesis .11
5.6.3 Text presentation on the screen .11
5.6.4 Error correction for translation results .11
5.6.5 Interfaces of submodules .11
Bibliography .12
PROOF/ÉPREUVE
© ISO/IEC 2024 – All rights reserved
iii
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are
members of ISO or IEC participate in the development of International Standards through technical
committees established by the respective organization to deal with particular fields of technical activity.
ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations,
governmental and non-governmental, in liaison with ISO and IEC, also take part in the work.
The procedures used to develop this document and those intended for its further maintenance are described
in the ISO/IEC Directives, Part 1. In particular, the different approval criteria needed for the different types
of document should be noted. This document was drafted in accordance with the editorial rules of the ISO/
IEC Directives, Part 2 (see www.iso.org/directives or www.iec.ch/members_experts/refdocs).
ISO and IEC draw attention to the possibility that the implementation of this document may involve the
use of (a) patent(s). ISO and IEC take no position concerning the evidence, validity or applicability of any
claimed patent rights in respect thereof. As of the date of publication of this document, ISO and IEC had not
received notice of (a) patent(s) which may be required to implement this document. However, implementers
are cautioned that this may not represent the latest information, which may be obtained from the patent
database available at www.iso.org/patents and https://patents.iec.ch. ISO and IEC shall not be held
responsible for identifying any or all such patent rights.
Any trade name used in this document is information given for the convenience of users and does not
constitute an endorsement.
For an explanation of the voluntary nature of standards, the meaning of ISO specific terms and expressions
related to conformity assessment, as well as information about ISO's adherence to the World Trade
Organization (WTO) principles in the Technical Barriers to Trade (TBT) see www.iso.org/iso/foreword.html.
In the IEC, see www.iec.ch/understanding-standards.
This document was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,
Subcommittee SC 35, User interfaces.
A list of all parts in the ISO/IEC 23773 series can be found on the ISO and IEC websites.
Any feedback or questions on this document should be directed to the user’s national standards
body. A complete listing of these bodies can be found at www.iso.org/members.html and
www.iec.ch/national-committees.
PROOF/ÉPREUVE
© ISO/IEC 2024 – All rights reserved
iv
Introduction
Communication between users of different languages is a global trend that is increasing. Real-time, automatic
simultaneous interpretation is needed for different applications such as video calls, live lecture translation
and wearable translation devices. Market demands for real-time automatic simultaneous interpretation of
free-style continuous utterances in the travel sector, global event management, as well as for phone calls,
lectures or meetings, are also increasing. A standardized user interface (UI) for automatic simultaneous
interpretation systems fulfils these different needs for communication.
ISO/IEC 23773-1 provides a general description of an automatic simultaneous interpretation system
designed to interoperate among different natural languages for spontaneous speech and text.
While traditional speech-to-speech translation described in ISO/IEC 20382-1 and ISO/IEC 20382-2 addresses
the functional equivalent of consecutive interpretation, this document focuses on the functional equivalent
of simultaneous interpretation.
ISO/IEC 23773-2 provides the requirements and functional components for the UI of automatic simultaneous
interpretation systems.
ISO/IEC 23773-3 (this document) provides a reference architecture for automatic simultaneous
interpretation systems including functional modules and communication interfaces in a high-level approach.
PROOF/ÉPREUVE
© ISO/IEC 2024 – All rights reserved
v
International Standard ISO/IEC 23773-3:2024(en)
Information technology — User interfaces for automatic
simultaneous interpretation system —
Part 3:
System architecture
1 Scope
This document provides a description of a system architecture for real-time automatic simultaneous
interpretation systems for spontaneous speech designed to interoperate among different natural languages.
While traditional speech-to-speech translation addresses the functional equivalent of consecutive
interpretation, this document focuses on the functional equivalent of simultaneous interpretation.
This document does not cover sign language interpretation.
2 Normative references
There are no normative references in this document.
3 Terms and definitions
For the purposes of this document, the following terms and definitions apply.
ISO and IEC maintain terminology databases for use in standardization at the following addresses:
— ISO Online browsing platform: available at https:// www .iso .org/ obp
— IEC Electropedia: available at https:// www .electropedia .org/
3.1
increme
...
ISO/IEC PRF 23773-3:2024(E)
Date: 2024-0104
ISO /IEC JTC 1/SC 35/WG 5
Secretariat: AFNOR
Date: 2024-04-30
Information technology — User interfaces for automatic
simultaneous interpretation system — —
Part 3:
System architecture
Technologies de l'information — Interfaces utilisateur pour les systèmes d'interprétation simultanée
automatique —
Partie 3: Architecture du système
FDIS stage
ISO/IEC FDISPRF 23773-3:2024(Een)
© ISO/IEC 2024
All rights reserved. Unless otherwise specified, or required in the context of its implementation, no part of this publication
may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying,
or posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO
at the address below or ISO’s member body in the country of the requester.
ISO copyright office
CP 401 • Ch. de Blandonnet 8
CH-1214 Vernier, Geneva
Phone: + 41 22 749 01 11
Fax: +41 22 749 09 47
EmailE-mail: copyright@iso.org
Website: www.iso.orgwww.iso.org
Published in Switzerland
ii © ISO/IEC 2024 – All rights reserved
© ISO/IEC 2024 – All rights reserved
ii
ISO/IEC DISPRF 23773-3:2024(Een)
Contents
Foreword . v
Introduction . vi
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Architecture of simultaneous interpretation system . 2
5.1 General . 2
5.2 Continuous speech recognition module . 6
5.2.1 General . 6
5.2.2 Feature extraction/noise processing . 6
5.2.3 Utterance adaptation . 6
5.2.4 Multiple decoder . 6
5.2.5 Interfaces of submodules . 7
5.3 Interpretation unit extraction module . 7
5.3.1 General . 7
5.3.2 Input buffer manager . 8
5.3.3 Unit separating morpheme analyser . 8
5.3.4 Voice unit analyser . 8
5.3.5 Interpretation unit former . 8
5.3.6 Interfaces of submodules . 8
5.4 Real-time simultaneous interpretation module . 8
5.4.1 General . 8
5.4.2 Rule-based translation . 11
5.4.3 Machine learning (ML)-based translation . 12
5.4.4 Context management . 12
5.4.5 Hybrid translation (rule+DNN) . 12
5.4.6 Interfaces of submodules . 12
5.5 Incremental knowledge learning module . 12
5.5.1 General . 12
5.5.2 Error-based enhancement . 13
5.5.3 Knowledge extraction/learning . 13
5.5.4 Knowledge support and tools . 13
5.5.5 Knowledge adaptation. 14
5.5.6 Interfaces of submodules . 14
5.6 Presentation of translation results . 14
5.6.1 General . 14
5.6.2 Personalized speech synthesis . 15
5.6.3 Text presentation on the screen . 15
5.6.4 Error correction for translation results . 15
5.6.5 Interfaces of submodules . 15
Bibliography . 17
Foreword . iv
© ISO/IEC 2024 – All rights reserved iii
© ISO/IEC 2024 – All rights reserved
iii
ISO/IEC FDISPRF 23773-3:2024(Een)
Introduction . v
1 Scope . 1
2 Normative references . 1
3 Terms and definitions . 1
4 Abbreviated terms . 2
5 Architecture of simultaneous interpretation system . 2
5.1 General . 2
5.2 Continuous speech recognition module . 4
5.2.1 General . 4
5.2.2 Feature extraction/noise processing . 5
5.2.3 Utterance adaptation . 5
5.2.4 Multiple decoder . 5
5.2.5 Interfaces of submodules . 5
5.3 Interpretation unit extraction module . 6
5.3.1 General . 6
5.3.2 Input buffer manager . 6
5.3.3 Unit separating morpheme analyser . 6
5.3.4 Voice unit analyser . 6
5.3.5 Interpretation unit former . 6
5.3.6 Interfaces of submodules . 7
5.4 Real-time simultaneous interpretation module . 7
5.4.1 General . 7
5.4.2 Rule-based translation . 8
5.4.3 ML (Machine learning)-based translation . 8
5.4.4 Context management . 9
5.4.5 Hybrid translation (rule+DNN) . 9
5.4.6 Interfaces of submodules . 9
5.5 Incremental knowledge learning module . 9
5.5.1 General . 9
5.5.2 Error based enhancement . 10
5.5.3 Knowledge extraction/learning . 10
5.5.4 Knowledge support and tools . 10
5.5.5 Knowledge adaptation. 11
5.5.6 Interfaces of submodules . 11
5.6 Presentation of translation results . 11
5.6.1 General . 11
5.6.2 Personalized speech synthesis (TTS) . 11
5.6.3 Text presentation on the screen . 12
5.6.4 Error correction for translation results . 12
5.6.5 Interfaces of submodules . 12
Bibliography . 13
iv © ISO/IEC 2024 – All rights reserved
© ISO/IEC 2024 – All rights reserved
iv
ISO/IEC DISPRF 23773-3:2024(Een)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are members
of ISO or IEC participate in the development of International Standards through technical committees
established by the respective organ
...
Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.