EN 62571:2011
(Main)Digital audiobook file format and player requirements
Digital audiobook file format and player requirements
IEC 62571:2011(E) defines requirements and provides recommendations to publishers, software developers, content providers, and hardware manufacturers for the data structure, usability requirements, playback systems and delivery systems for audiobooks in digital file format. It should be noted that throughout this International Standard, the term audiobook is defined as any audio file or collection of audio files of primarily spoken word content that are played in a linear or specified order. Therefore, spoken word audio with occasional music, a narration of newspaper articles, or other similar spoken word audio is assimilated to audiobooks in this standard.
Digitales Dateiformat und Anforderungen an Abspielgeräte von Hörbüchern
Exigences relatives au format de fichier de livre audio et aux lecteurs
La CEI 62571:2011 définit les exigences et formule des recommandations à l'intention des éditeurs, développeurs de logiciels, fournisseurs de contenu et fabricants de matériel sur la structure de données, les exigences en facilité d'utilisation, les systèmes de lecture et les systèmes de remise des livres audio au format de fichier numérique. Il convient de noter que dans l'ensemble de cette Norme internationale, le terme livre audio désigne un fichier audio ou un assemblage de fichiers audio contenant surtout du langage parlé, fichiers lus ou linéairement ou dans un ordre spécifié. Donc, de l'audio fait de mots parlés avec de la musique de temps en temps, de narration d'articles de journaux, et d'autres discours de ce genre, est dans cette norme considéré comme livre audio. La présente version bilingue, publiée en 2011-07, correspond à la version anglaise.
Zahteve za format datoteke za govoreče knjige in predvajalnike (IEC 62571:2011)
Ta mednarodni standard določa zahteve in podaja priporočila za založnike, razvijalce programske opreme, ponudnike vsebine in proizvajalce strojne opreme za podatkovno strukturo, zahteve glede uporabnosti, sisteme predvajanja in sisteme prenosa za govoreče knjige v digitalnem formatu datotek. Upoštevati je treba, da je v tem mednarodnem standardu izraz govoreča knjiga opredeljen kot vsaka avdiodatoteka ali zbirka avdiodatotek primarne govorne vsebine, ki so predvajane v linearnem ali določenem vrstnem redu. Potemtakem so avdiogovor z občasno glasbo, naracija časopisnega članka ali drugi podobni avdiogovori izenačeni z govorečimi knjigami v tem standardu.
General Information
- Status
- Published
- Publication Date
- 05-May-2011
- Withdrawal Date
- 02-May-2014
- Technical Committee
- CLC/TC 100X - Audio, video and multimedia systems and equipment and related sub-systems
- Drafting Committee
- IEC/TC 100 - IEC_TC_100
- Parallel Committee
- IEC/TC 100 - IEC_TC_100
- Current Stage
- 6060 - Document made available - Publishing
- Start Date
- 06-May-2011
- Completion Date
- 06-May-2011
Overview
EN 62571:2011 (IEC 62571:2011) specifies a standardized digital audiobook file format and player requirements. It provides requirements and recommendations for the data structure, usability, playback systems and delivery systems for audiobooks in digital file format. The standard defines “audiobook” as any primarily spoken‑word audio file or collection played in a linear or specified order, including narrated articles or spoken‑word content with occasional music. EN 62571:2011 is the European adoption of IEC 62571:2011 and is intended to improve interoperability, metadata consistency and playback behavior across platforms.
Key topics and technical requirements
- File structure and container model: detailed components such as MultiAudio extensions, audiobook extendable structures (AB_ExtStruct), fixed and variable chunks (AB_FixedChunk, AB_VariableChunk), and extra data sections.
- Header and indexing: AB_Header, playlist indexes and TOC (table of contents) definitions to enable deterministic navigation and chapter/track mapping.
- Book and level semantics: AB_Book and AB_Level definitions including offsets to title, author, narrator, and hierarchical level navigation (parent/previous/next references).
- Timing and playback metadata: precise time offsets, start/duration fields and TOC_TrackEntry indexes to support accurate seeking and synchronized playback.
- Identifiers and metadata: support for UUID, ISBN/ISSN/DOI, shelf/index numbers and playlist content location identifiers; structured use of Dublin Core metadata and MPV audiobook profiles.
- Profiles and compatibility: MPV (multimedia profile) audiobook profiles, XML schema namespace identifiers and optional use of MPV‑Music playlists for cross‑profile reuse.
- Text and flags: text format indicators and flags governing language, presentation and player behavior.
- Normative references: integration with standards like ISO 639‑1 (language codes), ISO 3166‑1 (country codes), ISO 9660 (CD-ROM file structure) and ISO/IEC 11578 (RPC) where applicable.
Applications and practical value
- Enables publishers to package audiobooks with consistent metadata and navigation structures for multiple distribution channels.
- Helps software developers and streaming platforms implement interoperable playback, seek, and bookshelf features.
- Assists hardware manufacturers (dedicated audiobook players, infotainment systems) to ensure consistent user experience and accessibility.
- Supports libraries, distributors and accessibility services by standardizing identifiers, metadata (Dublin Core) and TOC for cataloging, search and assistive playback functions.
Related standards
- IEC 62571:2011 (international text)
- EN 62571:2011 (CENELEC endorsed European version)
- ISO 639‑1 (language codes), ISO 3166‑1 (country codes), ISO 9660, ISO/IEC 11578
- Dublin Core metadata guidelines and MPV multimedia profile specifications (referenced for profile and metadata usage)
By adopting EN/IEC 62571, organizations working with digital audiobooks gain clearer guidance for packaging, metadata, navigation and playback interoperability-critical for a consistent user experience across devices and platforms.
Frequently Asked Questions
EN 62571:2011 is a standard published by CLC. Its full title is "Digital audiobook file format and player requirements". This standard covers: IEC 62571:2011(E) defines requirements and provides recommendations to publishers, software developers, content providers, and hardware manufacturers for the data structure, usability requirements, playback systems and delivery systems for audiobooks in digital file format. It should be noted that throughout this International Standard, the term audiobook is defined as any audio file or collection of audio files of primarily spoken word content that are played in a linear or specified order. Therefore, spoken word audio with occasional music, a narration of newspaper articles, or other similar spoken word audio is assimilated to audiobooks in this standard.
IEC 62571:2011(E) defines requirements and provides recommendations to publishers, software developers, content providers, and hardware manufacturers for the data structure, usability requirements, playback systems and delivery systems for audiobooks in digital file format. It should be noted that throughout this International Standard, the term audiobook is defined as any audio file or collection of audio files of primarily spoken word content that are played in a linear or specified order. Therefore, spoken word audio with occasional music, a narration of newspaper articles, or other similar spoken word audio is assimilated to audiobooks in this standard.
EN 62571:2011 is classified under the following ICS (International Classification for Standards) categories: 33.160.60 - Multimedia systems and teleconferencing equipment; 35.240.30 - IT applications in information, documentation and publishing. The ICS classification helps identify the subject area and facilitates finding related standards.
EN 62571:2011 is available in PDF format for immediate download after purchase. The document can be added to your cart and obtained through the secure checkout process. Digital delivery ensures instant access to the complete standard document.
Standards Content (Sample)
SLOVENSKI STANDARD
01-julij-2011
=DKWHYH]DIRUPDWGDWRWHNH]DJRYRUHþHNQMLJHLQSUHGYDMDOQLNH,(&
Digital audiobook file format and player requirements (IEC 62571:2011)
Digitales Dateiformat und Anforderungen an Abspielgeräte von Hörbüchern (IEC
62571:2011)
Exigences relatives au format de fichier de livre audio et aux lecteurs (CEI 62571:2011)
Ta slovenski standard je istoveten z: EN 62571:2011
ICS:
33.160.30 Avdio sistemi Audio systems
35.240.30 Uporabniške rešitve IT v IT applications in information,
informatiki, dokumentiranju in documentation and
založništvu publishing
2003-01.Slovenski inštitut za standardizacijo. Razmnoževanje celote ali delov tega standarda ni dovoljeno.
EUROPEAN STANDARD
EN 62571
NORME EUROPÉENNE
May 2011
EUROPÄISCHE NORM
ICS 33.160.60; 35.240.30
English version
Digital audiobook file format and player requirements
(IEC 62571:2011)
Exigences relatives au format de fichier de Digitales Dateiformat und Anforderungen
livre audio et aux lecteurs an Abspielgeräte von Hörbüchern
(CEI 62571:2011) (IEC 62571:2011)
This European Standard was approved by CENELEC on 2011-05-03. CENELEC members are bound to comply
with the CEN/CENELEC Internal Regulations which stipulate the conditions for giving this European Standard
the status of a national standard without any alteration.
Up-to-date lists and bibliographical references concerning such national standards may be obtained on
application to the Central Secretariat or to any CENELEC member.
This European Standard exists in three official versions (English, French, German). A version in any other
language made by translation under the responsibility of a CENELEC member into its own language and notified
to the Central Secretariat has the same status as the official versions.
CENELEC members are the national electrotechnical committees of Austria, Belgium, Bulgaria, Croatia, Cyprus,
the Czech Republic, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Italy,
Latvia, Lithuania, Luxembourg, Malta, the Netherlands, Norway, Poland, Portugal, Romania, Slovakia, Slovenia,
Spain, Sweden, Switzerland and the United Kingdom.
CENELEC
European Committee for Electrotechnical Standardization
Comité Européen de Normalisation Electrotechnique
Europäisches Komitee für Elektrotechnische Normung
Management Centre: Avenue Marnix 17, B - 1000 Brussels
© 2011 CENELEC - All rights of exploitation in any form and by any means reserved worldwide for CENELEC members.
Ref. No. EN 62571:2011 E
Foreword
The text of document 100/1543/CDV, future edition 1 of IEC 62571, prepared by technical area 10,
Multimedia e-publishing and e-book, of IEC TC 100, Audio, video and multimedia systems and
equipment, was submitted to the IEC-CENELEC parallel vote and was approved by CENELEC as
EN 62571 on 2011-05-03.
Attention is drawn to the possibility that some of the elements of this document may be the subject of
patent rights. CEN and CENELEC shall not be held responsible for identifying any or all such patent
rights.
The following dates were fixed:
– latest date by which the EN has to be implemented
at national level by publication of an identical
national standard or by endorsement (dop) 2012-02-03
– latest date by which the national standards conflicting
with the EN have to be withdrawn (dow) 2014-05-03
Annex ZA has been added by CENELEC.
__________
Endorsement notice
The text of the International Standard IEC 62571:2011 was approved by CENELEC as a European
Standard without any modification.
In the official version, for Bibliography, the following note has to be added for the standard indicated:
ISO/IEC 11172-3:1993 NOTE Harmonized as EN ISO 11172-3:1995 (not modified).
__________
- 3 - EN 62571:2011
Annex ZA
(normative)
Normative references to international publications
with their corresponding European publications
The following referenced documents are indispensable for the application of this document. For dated
references, only the edition cited applies. For undated references, the latest edition of the referenced
document (including any amendments) applies.
NOTE When an international publication has been modified by common modifications, indicated by (mod), the relevant EN/HD
applies.
Publication Year Title EN/HD Year
ISO 639-1 - Codes for the representation of names of - -
languages -
Part 1: Alpha-2 code
ISO 3166-1 - Codes for the representation of names of EN ISO 3166-1 -
countries and their subdivisions -
Part 1: Country codes
ISO 9660 1988 Information processing - Volume and file - -
structure of CD-ROM for information
interchange
ISO/IEC 11578 1996 Information technology - Open Systems - -
Interconnection - Remote Procedure Call
(RPC)
IEC 62571 ®
Edition 1.0 2011-03
INTERNATIONAL
STANDARD
Digital audiobook file format and player requirements
INTERNATIONAL
ELECTROTECHNICAL
COMMISSION
PRICE CODE
XD
ICS 33.160.60; 35.240.30 ISBN 978-2-88912-426-8
– 2 – 62571 IEC:2011(E)
CONTENTS
FOREWORD . 7
1 Scope . 9
2 Normative references . 9
3 General . 9
4 MultiAudio audiobook extensions . 11
4.1 Table of contents . 11
4.2 Description . 14
4.2.1 General . 14
4.2.2 Tag.Identifier . 14
4.2.3 Tag.OrdinalNumber . 14
4.2.4 Tag.Reserved . 14
4.2.5 Tag.Length . 14
4.2.6 Data . 14
5 Audiobook extendable structure (AB_ExtStruct) . 15
5.1 General . 15
5.2 Tag . 15
5.3 Offset to extra data. 15
5.4 Fixed length chunk (AB_FixedChunk) . 15
5.4.1 General . 15
5.4.2 Example . 16
5.5 Variable length chunk (AB_VariableChunk) . 16
5.5.1 General . 16
5.5.2 Example . 17
5.6 Extra data . 18
6 Audiobook header (AB_Header) . 18
6.1 General . 18
6.2 Tag . 18
6.3 AB_Header fixed chunk . 18
6.3.1 General . 18
6.3.2 Tag . 19
6.3.3 Audiobook specification major version . 19
6.3.4 Audiobook specification minor version . 19
6.3.5 Number of audiobooks (N_Bk) . 19
6.3.6 Offset to playlist indexes . 19
6.4 AB_Header variable chunk . 19
6.4.1 General . 19
6.4.2 Tag . 19
6.4.3 Playlist indexes . 20
6.5 AB_Header extra data . 20
7 Audiobook book (AB_Book) . 20
7.1 General . 20
7.2 Tag . 21
7.3 AB_Book fixed chunk . 21
7.3.1 General . 21
7.3.2 Tag . 22
62571 IEC:2011(E) – 3 –
7.3.3 Text format . 22
7.3.4 Flags . 22
7.3.5 TOC_Playlist index . 22
7.3.6 Offset to book title . 22
7.3.7 Offset to author . 22
7.3.8 Offset to narrator . 22
7.3.9 Offset to copyright statement . 22
7.3.10 Offset to audiobook UUID . 22
7.3.11 Offset to book ISBN identifier . 23
7.3.12 Offset to book ISSN identifier . 23
7.3.13 Offset to book DOI identifier . 23
7.3.14 Offset to book index number . 23
7.3.15 Offset to book shelf number . 23
7.3.16 Offset to book part number . 23
7.3.17 Offset to other identifier . 23
7.3.18 Offset to first audiobook level . 23
7.3.19 Offset to playlist content location identifier . 23
7.4 AB_Book variable chunk . 24
7.4.1 General . 24
7.4.2 Tag . 24
7.4.3 Book title . 24
7.4.4 Author . 25
7.4.5 Narrator . 25
7.4.6 Copyright Statement . 25
7.4.7 Book UUID . 25
7.4.8 Book ISBN Identifier . 25
7.4.9 Book ISSN Identifier . 25
7.4.10 Book DOI Identifier . 26
7.4.11 Book index number . 26
7.4.12 Book shelf number . 26
7.4.13 Book part number . 26
7.4.14 Other identifier . 26
7.4.15 First audiobook level . 26
7.4.16 Playlist content location identifier . 27
7.5 AB_Book extra data. 27
8 Audiobook level (AB_Level) . 27
8.1 General . 27
8.2 Tag . 28
8.3 AB_Level fixed chunk . 28
8.3.1 General . 28
8.3.2 Tag . 28
8.3.3 Text format . 29
8.3.4 Flags . 29
8.3.5 Number of sub-levels . 29
8.3.6 Level number . 29
8.3.7 TOC_TrackEntry index . 29
8.3.8 Offset to sub-level chunk . 29
8.3.9 Offset to level name . 29
8.3.10 Offset to level description . 29
– 4 – 62571 IEC:2011(E)
8.3.11 Book time offset . 29
8.3.12 Time start offset . 29
8.3.13 Time duration . 30
8.3.14 Offset to parent AB_Level . 30
8.3.15 Offset to previous AB_Level . 30
8.3.16 Offset to next AB_Level . 30
8.4 AB_Level variable chunk . 30
8.4.1 General . 30
8.4.2 Tag . 30
8.4.3 Level name . 31
8.4.4 Level description . 31
8.4.5 Sub-level chunk . 31
8.5 AB_Level extra data . 31
9 MPV audiobook profiles . 31
9.1 General . 31
9.2 XML schema namespace identifier . 32
9.3 Use of MPV-Music profile identifier . 32
9.4 Audiobook file . 33
9.5 Audiobook profile metadata . 33
9.6 MPV audiobook profile metadata . 35
9.7 Dublin core metadata usage . 39
9.8 Audiobook profile using MPV-Music playlists . 41
9.8.1 General . 41
9.8.2 Number of playlists per MPV file . 42
9.8.3 Metadata usage . 42
9.8.4 Background usage . 43
9.8.5 Foreground usage . 44
9.8.6 Related and rendition assets . 44
9.9 Groups of assets in playlists . 44
9.10 MPV-Audiobook schema details . 47
9.10.1 XML schema definition . 47
9.10.2 Audiobook metadata usage definitions . 52
9.11 Audiobook level class identifiers . 52
9.12 Audiobook BOOKMARK.AUB . 54
9.13 MPV-Audiobook-Level metadata using . 61
9.14 Music and Stills Asset . 61
9.15 MPV-Audiobook extensions to MPV-Core . 64
9.15.1 Audiobook manifest file types and extensions . 64
9.15.2 Audiobook manifest MIME media type . 64
9.16 Media types and file formats . 65
9.17 MPV-Audiobook and CEA-2003-C . 65
9.18 CEA-2003-C binary to XML formats – Tag comparison . 65
10 Conforming player and digital audiobook file format structure requirements . 66
10.1 General . 66
10.2 Player Requirements . 66
10.3 Digital audiobook file format structure requirements . 67
11 Player and digital audiobook file format structure functionality recommendations . 67
11.1 General . 67
62571 IEC:2011(E) – 5 –
11.2 Player functionality recommendations . 67
11.3 Digital audiobook file format structure functionality recommendations . 68
12 Digital audiobook file format structures and data constraints . 68
13 Additional production options . 68
13.1 ID3 tagging within MP3 files . 68
13.2 M3U playlists . 69
13.3 DAISY production options . 69
14 Certification . 69
Annex A (informative) Audiobook structure examples . 70
Annex B (normative) Encoding types and identifiers . 82
Annex C (informative) Use case and implementation examples: Spoken word audio . 83
Bibliography . 104
Figure A.1 – AB_Level structure organization in AB_Book variable length chunk area . 74
Figure A.2 – View of an audiobook AB_Level hierarchy . 76
Figure C.1 – Example 1 INDEX.AUB . 88
Figure C.2 – Example 1 TOC.MUM . 89
Figure C.3 – Example 2 INDEX.AUB . 94
Figure C.4 – Example 3 INDEX.AUB . 99
Figure C.5 – Example 4 INDEX.AUB . 103
Table 1 – Table of contents data structure . 12
Table 2 – Audiobook MultiAudio chunk format . 14
Table 3 – AB_ExtStruct format . 15
Table 4 – Example—AB_FixedChunk structure . 16
Table 5 – Example—AB_FixedChunk structure revised . 16
Table 6 – Example—AB_FixedChunk fields . 17
Table 7 – Example—AB_VariableChunk data . 17
Table 8 – AB_Header format . 18
Table 9 – AB_Header fixed chunk format . 18
Table 10 – AB_Header variable chunk format . 19
Table 11 – AB_Book format . 21
Table 12 – AB_Book fixed chunk format . 21
Table 13 – AB_Book variable chunk format . 24
Table 14 – AB_Level structure required field settings . 26
Table 15 – Playlist content location identifier field settings . 27
Table 16 – AB_Section format . 28
Table 17 – AB_Level fixed chunk format . 28
Table 18 – AB_Level variable chunk format . 30
Table 19 – Schema namespace identifier . 32
Table 20 – MPV-Music profile identifier . 32
Table 21 – MPV audiobook profile properties . 34
Table 22 – MPV audiobook profile metadata . 35
Table 23 – Dublin Core metadata (DC-NMF) . 40
– 6 – 62571 IEC:2011(E)
Table 24 – Metadata properties usage . 42
Table 25 – CEA-2003-C binary and XML formats – Tag comparison . 65
Table 26 – Examples of ID3 tags . 69
Table A.1 – Sample table of contents . 70
Table A.2 – Table of contents file structure . 71
Table A.3 – File 1 TOC_TrackEntry structure . 71
Table A.4 – File 2 TOC_TrackEntry structure . 71
Table A.5 – File 3 TOC_TrackEntry structure . 71
Table A.6 – File 4 TOC_TrackEntry structure . 72
Table A.7 – Table of contents with file time offsets . 72
Table A.8 – Example TOC_Playlist structure . 72
Table A.9 – Example AB_Book structure . 73
Table A.10 – AB_Level structure at Level 0 (audiobook) . 77
Table A.11 – AB_Level at Level 1 (chapter) . 77
Table A.12 – AB_Level at Level 1 (chapter) . 78
Table A.13 – AB_Level at Level 1 (chapter) . 78
Table A.14 – AB_Level at Level 2 (section) . 79
Table A.15 – AB_Level at Level 2 (section) . 79
Table A.16 – AB_Level at Level 2 (section) . 80
Table A.17 – AB_Level at Level 2 (section) . 80
Table A.18 – AB_Level at Level 3 (subsection) . 81
Table A.19 – AB_Level at Level 3 (subsection) . 81
Table B.1 – Hex representation and interpretation . 82
62571 IEC:2011(E) – 7 –
INTERNATIONAL ELECTROTECHNICAL COMMISSION
____________
DIGITAL AUDIOBOOK FILE FORMAT
AND PLAYER REQUIREMENTS
FOREWORD
1) The International Electrotechnical Commission (IEC) is a worldwide organization for standardization comprising
all national electrotechnical committees (IEC National Committees). The object of IEC is to promote
international co-operation on all questions concerning standardization in the electrical and electronic fields. To
this end and in addition to other activities, IEC publishes International Standards, Technical Specifications,
Technical Reports, Publicly Available Specifications (PAS) and Guides (hereafter referred to as “IEC
Publication(s)”). Their preparation is entrusted to technical committees; any IEC National Committee interested
in the subject dealt with may participate in this preparatory work. International, governmental and non-
governmental organizations liaising with the IEC also participate in this preparation. IEC collaborates closely
with the International Organization for Standardization (ISO) in accordance with conditions determined by
agreement between the two organizations.
2) The formal decisions or agreements of IEC on technical matters express, as nearly as possible, an international
consensus of opinion on the relevant subjects since each technical committee has representation from all
interested IEC National Committees.
3) IEC Publications have the form of recommendations for international use and are accepted by IEC National
Committees in that sense. While all reasonable efforts are made to ensure that the technical content of IEC
Publications is accurate, IEC cannot be held responsible for the way in which they are used or for any
misinterpretation by any end user.
4) In order to promote international uniformity, IEC National Committees undertake to apply IEC Publications
transparently to the maximum extent possible in their national and regional publications. Any divergence
between any IEC Publication and the corresponding national or regional publication shall be clearly indicated in
the latter.
5) IEC itself does not provide any attestation of conformity. Independent certification bodies provide conformity
assessment services and, in some areas, access to IEC marks of conformity. IEC is not responsible for any
services carried out by independent certification bodies.
6) All users should ensure that they have the latest edition of this publication.
7) No liability shall attach to IEC or its directors, employees, servants or agents including individual experts and
members of its technical committees and IEC National Committees for any personal injury, property damage or
other damage of any nature whatsoever, whether direct or indirect, or for costs (including legal fees) and
expenses arising out of the publication, use of, or reliance upon, this IEC Publication or any other IEC
Publications.
8) Attention is drawn to the Normative references cited in this publication. Use of the referenced publications is
indispensable for the correct application of this publication.
9) Attention is drawn to the possibility that some of the elements of this IEC Publication may be the subject of
patent rights. IEC shall not be held responsible for identifying any or all such patent rights.
International Standard IEC 62571 has been prepared by technical area 10: Multimedia e-
publishing and e-book, of IEC technical committee 100: Audio, video and multimedia systems
and equipment.
The text of this standard is based on the following documents:
CDV Report on voting
100/1543/CDV 100/1629/RVC
Full information on the voting for the approval of this standard can be found in the report on
voting indicated in the above table.
This publication has been drafted in accordance with the ISO/IEC Directives, Part 2.
– 8 – 62571 IEC:2011(E)
The committee has decided that the contents of this publication will remain unchanged until
the stability date indicated on the IEC web site under "http://webstore.iec.ch" in the data
related to the specific publication. At this date, the publication will be
• reconfirmed,
• withdrawn,
• replaced by a revised edition, or
• amended.
A bilingual version of this publication may be issued at a later date.
62571 IEC:2011(E) – 9 –
DIGITAL AUDIOBOOK FILE FORMAT
AND PLAYER REQUIREMENTS
1 Scope
This International Standard defines requirements and provides recommendations to
publishers, software developers, content providers, and hardware manufacturers for the data
structure, usability requirements, playback systems and delivery systems for audiobooks in
digital file format. It should be noted that throughout this International Standard, the term
audiobook is defined as any audio file or collection of audio files of primarily spoken word
content that are played in a linear or specified order. Therefore, spoken word audio with
occasional music, a narration of newspaper articles, or other similar spoken word audio is
assimilated to audiobooks in this standard.
2 Normative references
The following referenced documents are indispensable for the application of this document.
For dated references, only the edition cited applies. For undated references, the latest edition
of the referenced document (including any amendments) applies.
ISO/IEC 11578:1996, Information technology – Open Systems Interconnection – Remote
Procedure Call (RPC)
ISO 639-1, Codes for the representation of names of languages – Part 1: Alpha-2 code
ISO 3166-1, Codes for the representation of names of countries and their subdivisions –
Part 1: Country codes
ISO 9660:1988, Information processing – Volume and file structure of CD-ROM for
information interchange
3 General
This International Standard defines the audiobook file format structure for digital audiobook
media. Goals in creating this International Standard include compatibility with music industry
and multimedia standards as well as effectively presenting and navigating an audiobook. This
International Standard is a compilation standard that straddles early binary architectures
represented by earlier versions of CEA-2003 , and newer XML architectures represented by
the Optical Storage Technology Association’s (OSTA) MusicPhotoVideo™ ((MPV). A
conforming playback system and conforming audiobook content provides a wonderful and
highly functional reading and listening experience. The goal of this International Standard is
to create a broad, extensible standard for audiobook publishers, audiobook device
manufacturers, and audiobook software developers in order to create the best listening
experience for the audiobook consumer. An overview follows.
OSTA MultiAudio defines a binary file be placed in the root of the file format structure on a
CD/MP3. The TOC.MAU serves as the table of contents for the associated MP3 files on a
—————————
See Bibliography.
MusicPhotoVideo is the trade name of a program supplied by the Optical Storage Technology Association. This
information is given for the convenience of users of this document and does not constitute an endorsement by
IEC of the program named.
– 10 – 62571 IEC:2011(E)
compliant CD/MP3. Conforming MultiAudio players read this file to obtain the information the
player needs to present the content to the listener.
OSTA MPV defines an eXtensible Markup Language (XML) schema to be placed in the root of
the file format structure. The INDEX.AUB serves as the table of contents for the associated
audiobook files, regardless of media, codec, or delivery type. Conforming MPV players read
this file to obtain the information the player needs to present the content to the listener.
Audiobooks conforming to this standard MUST utilize the audiobook-specific file
extension, .AUB. In addition, audiobooks conforming to this standard may utilize the same
MPV file extension specified by the MPV Music Profile, .MUM, in order to create additional
compatibility for MPV-compliant players that do not recognize the audiobook extension.
Similarly, while OSTA MPV was primarily designed for the music industry and CD-ROM media,
it provides "extensions" for enhancing the format. This International Standard "extends" OSTA
MPV to support audiobooks and does not require a specific media or content delivery system.
The audiobook extensions are defined as the MultiAudio binary “TOC.MAU” for CD/MP3
audiobooks, and the MPV XML “INDEX.AUB” for all digital audiobooks.
Therefore, throughout this International Standard, “digital audiobook file format structure” is
used to describe the entire audiobook content, regardless of media or method of delivery.
“Delivery system” is used to represent any system used for playback of the content, whether it
is actual physical media or another type of delivery system, such as downloaded or streaming
content. This International Standard defines required MPV data and audiobook-specific Extra
Data requirements. The conforming player, by reading the INDEX.AUB or TOC.MAU, is
presented with the data needed to effectively navigate and play the content in the manner the
publisher intended. Conforming players and conforming audio content shall meet the
requirements contained in Clause 11. In addition, conforming players and conforming audio
content are encouraged, but not required, to conform to the recommendations of Clause 11.
The player identifies and reads the INDEX.AUB or TOC.MAU file in the root of the digital
audiobook file format structure. A conforming player recognizes the audiobook identifier in the
OSTA MPV Extra Data field, and processes the audiobook data as defined in Clauses 4
through 7 accordingly. OSTA MPV provides specific identifiers to tell the player the type of
encoded audio file that is in use. OSTA MPV also provides an “unknown” identifier, which
signals the device to determine the type of encoding used. Once an audiobook and its
encoding method have been detected, the playback system can reference navigational points
and informational fields and present them to the listener. Excessive delays in playback or
moving to the next portion of the audiobook should be minimized; excessive delays are not
acceptable according to this standard.
One or more audiobooks may reside on the system, but if there is more than one audiobook,
they shall each reside in separate directories within the digital audiobook file format structure.
The MPV file contains one or more "playlists" for each audiobook. The playlist identifies the
correct linear reading order of the files to be played for each book, and the location of the
digital audiobook files. A simple digital audiobook file format structure residing on physical
media may need only one playlist; a more complex digital audiobook file format structure that
includes choices of delivery, such as local and streaming, may have multiple playlists to
coincide with each method of delivery.
Navigational levels are defined by the content publisher within the INDEX.AUB or TOC.MAU
file, and may be as simple or as complex as desired. Extra Data fields identify the levels and
“time offsets” to navigational points within files or file locations. Pointers to the “parent” and
“child” locations are also defined within Extra Data fields. In this manner, an unlimited amount
of levels and navigation points may be identified. Within the binary TOC.MAU file, Level 0
navigation is the top level; for example, the audiobook itself, or a collection of several
audiobooks contained in separate folders. Level 1 navigation is the first level of nested
pointers defined, and is required for compliance. In a very simple example, a digital
audiobook file format structure could contain one audiobook comprised of one digital file.
Each of the chapter locations are defined as Level 1 navigation points within the TOC.MAU.
62571 IEC:2011(E) – 11 –
The conforming player is able to present the user with Level 1 navigation by moving to each
of the defined offsets.
More complex digital audiobook file format structures can be achieved by nesting levels within
the INDEX.AUB. A digital audiobook file format structure can contain several layers or levels,
with offsets to the next navigation point, parent level, and child level. As a more complex
example, a digital audiobook file format structure could contain two audiobooks, each in a
separate folder and comprised of ten chapters each, with each chapter encoded as a separate
digital file. Navigation would include pointers to each separate audiobook, and enable the
player to choose between one or the other. Navigation markers would point to the beginning
of each chapter, allowing navigation from chapter to chapter within the audiobook. Nested
levels would point to pages within the chapters.
NOTE 1 Books, chapters and pages are used as level name examples; however, there is no naming requirement
for levels in order to be compliant with this International Standard.
The only constraint on the audio publisher is to encode MP3 files with a "fixed bit rate"
because variable bit rate encoding makes it impossible for low-powered players to calculate
the data offset from the time offset.
NOTE 2 Audiobook content providers are strongly encouraged to name the audio files using an ASCII collated
sequence. This means that the unaware player would still play the audio files in order and the navigation from file
to file would be whatever design the audio publisher chose for structuring the files, that is one chapter per file
would yield chapter navigation; one section per file would yield section navigation. The correct linear reading order
would then
...




Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...