ISO/IEC 15444-12:2012
(Main)Information technology - JPEG 2000 image coding system - Part 12: ISO base media file format
Information technology - JPEG 2000 image coding system - Part 12: ISO base media file format
ISO/IEC 15444-12:2012 specifies the structure and uses of the ISO base media file format. The identical text is published as ISO/IEC 14496-12:2012. This file format is used to contain time-based media such as video and audio. The storage of particular coding schemes is defined in specifications that derive from and reference ISO/IEC 15444-12:2012 and ISO/IEC 14496-12:2012, such as the MPEG-4 file format specified in ISO/IEC 14496-14, or the Motion JPEG file format specified in ISO/IEC 15444-3. This file format is designed to contain timed media information for a presentation in a flexible, extensible format that facilitates interchange, management, editing and presentation of the media. This presentation may be "local" to the system containing the presentation, or may be via a network or other stream delivery mechanism. The file format is designed to be independent of any particular network protocol while enabling efficient support for them in general. The file structure is object-oriented; a file can be decomposed into constituent objects very simply, and the structure of the objects inferred directly from their type. This technically identical text is published as ISO/IEC 14496-12:2012 for MPEG-4, and as ISO/IEC 15444-12:2012 for JPEG 2000, and reference to this specification should be made accordingly. The recommendation is to reference one, for example ISO/IEC 15444-12:2012, and append to the reference a parenthetical comment identifying the other, for example "(technically identical to ISO/IEC 14496-12:2012)".
Technologies de l'information — Système de codage d'images JPEG 2000 — Partie 12: Format ISO de base pour les fichiers médias
General Information
Relations
Frequently Asked Questions
ISO/IEC 15444-12:2012 is a standard published by the International Organization for Standardization (ISO). Its full title is "Information technology - JPEG 2000 image coding system - Part 12: ISO base media file format". This standard covers: ISO/IEC 15444-12:2012 specifies the structure and uses of the ISO base media file format. The identical text is published as ISO/IEC 14496-12:2012. This file format is used to contain time-based media such as video and audio. The storage of particular coding schemes is defined in specifications that derive from and reference ISO/IEC 15444-12:2012 and ISO/IEC 14496-12:2012, such as the MPEG-4 file format specified in ISO/IEC 14496-14, or the Motion JPEG file format specified in ISO/IEC 15444-3. This file format is designed to contain timed media information for a presentation in a flexible, extensible format that facilitates interchange, management, editing and presentation of the media. This presentation may be "local" to the system containing the presentation, or may be via a network or other stream delivery mechanism. The file format is designed to be independent of any particular network protocol while enabling efficient support for them in general. The file structure is object-oriented; a file can be decomposed into constituent objects very simply, and the structure of the objects inferred directly from their type. This technically identical text is published as ISO/IEC 14496-12:2012 for MPEG-4, and as ISO/IEC 15444-12:2012 for JPEG 2000, and reference to this specification should be made accordingly. The recommendation is to reference one, for example ISO/IEC 15444-12:2012, and append to the reference a parenthetical comment identifying the other, for example "(technically identical to ISO/IEC 14496-12:2012)".
ISO/IEC 15444-12:2012 specifies the structure and uses of the ISO base media file format. The identical text is published as ISO/IEC 14496-12:2012. This file format is used to contain time-based media such as video and audio. The storage of particular coding schemes is defined in specifications that derive from and reference ISO/IEC 15444-12:2012 and ISO/IEC 14496-12:2012, such as the MPEG-4 file format specified in ISO/IEC 14496-14, or the Motion JPEG file format specified in ISO/IEC 15444-3. This file format is designed to contain timed media information for a presentation in a flexible, extensible format that facilitates interchange, management, editing and presentation of the media. This presentation may be "local" to the system containing the presentation, or may be via a network or other stream delivery mechanism. The file format is designed to be independent of any particular network protocol while enabling efficient support for them in general. The file structure is object-oriented; a file can be decomposed into constituent objects very simply, and the structure of the objects inferred directly from their type. This technically identical text is published as ISO/IEC 14496-12:2012 for MPEG-4, and as ISO/IEC 15444-12:2012 for JPEG 2000, and reference to this specification should be made accordingly. The recommendation is to reference one, for example ISO/IEC 15444-12:2012, and append to the reference a parenthetical comment identifying the other, for example "(technically identical to ISO/IEC 14496-12:2012)".
ISO/IEC 15444-12:2012 is classified under the following ICS (International Classification for Standards) categories: 35.040 - Information coding; 35.040.30 - Coding of graphical and photographical information. The ICS classification helps identify the subject area and facilitates finding related standards.
ISO/IEC 15444-12:2012 has the following relationships with other standards: It is inter standard links to ISO/IEC 15444-12:2012/Amd 3:2015, ISO/IEC 15444-12:2012/Amd 1:2013, ISO/IEC 15444-12:2012/Amd 2:2014, ISO/IEC 15444-12:2015, ISO/IEC 15444-12:2008/FDAmd 3, ISO/IEC 15444-12:2008/FDAmd 2, ISO/IEC 15444-12:2008, ISO/IEC 15444-12:2008/Amd 1:2009. Understanding these relationships helps ensure you are using the most current and applicable version of the standard.
You can purchase ISO/IEC 15444-12:2012 directly from iTeh Standards. The document is available in PDF format and is delivered instantly after payment. Add the standard to your cart and complete the secure checkout process. iTeh Standards is an authorized distributor of ISO standards.
Standards Content (Sample)
INTERNATIONAL ISO/IEC
STANDARD 15444-12
Fourth edition
2012-07-15
Corrected version
2012-09-15
Information technology — JPEG 2000
image coding system —
Part 12:
ISO base media file format
Technologies de l'information — Système de codage d'images JPEG
2000 —
Partie 12: Format ISO de base pour les fichiers médias
Reference number
©
ISO/IEC 2012
© ISO/IEC 2012
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means,
electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or
ISO's member body in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland
ii © ISO/IEC 2012 – All rights reserved
Contents Page
Foreword .ix
Introduction.xi
1 Scope.1
2 Normative references .1
3 Definitions.2
3.1 Terms and definitions.2
3.2 Abbreviated terms.4
4 Object-structured File Organization.4
4.1 File Structure .4
4.2 Object Structure .4
4.3 File Type Box.5
5 Design Considerations .6
5.1 Usage.6
5.1.1 Introduction .6
5.1.2 Interchange.6
5.1.3 Content Creation .7
5.1.4 Preparation for streaming .8
5.1.5 Local presentation .8
5.1.6 Streamed presentation .8
5.2 Design principles .8
6 ISO Base Media File organization .9
6.1 Presentation structure.9
6.1.1 File Structure .9
6.1.2 Object Structure .9
6.1.3 Meta Data and Media Data.9
6.1.4 Track Identifiers .10
6.2 Metadata Structure (Objects).10
6.2.1 Box.10
6.2.2 Data Types and fields .10
6.2.3 Box Order.11
6.2.4 URIs as type indicators .13
6.3 Brand Identification .14
7 Streaming Support.14
7.1 Handling of Streaming Protocols .14
7.2 Protocol ‘hint’ tracks .14
7.3 Hint Track Format .15
8 Box Structures.16
8.1 File Structure and general boxes .16
8.1.1 Media Data Box .16
8.1.2 Free Space Box .17
8.1.3 Progressive Download Information Box.17
8.2 Movie Structure .18
8.2.1 Movie Box .18
8.2.2 Movie Header Box .18
8.3 Track Structure.19
8.3.1 Track Box.19
8.3.2 Track Header Box.20
8.3.3 Track Reference Box .21
© ISO/IEC 2012 – All rights reserved iii
8.3.4 Track Group Box .22
8.4 Track Media Structure.23
8.4.1 Media Box.23
8.4.2 Media Header Box .23
8.4.3 Handler Reference Box.24
8.4.4 Media Information Box.25
8.4.5 Media Information Header Boxes .25
8.5 Sample Tables .27
8.5.1 Sample Table Box.27
8.5.2 Sample Description Box .28
8.5.3 Degradation Priority Box .33
8.5.4 Sample Scale Box.34
8.6 Track Time Structures.34
8.6.1 Time to Sample Boxes .34
8.6.2 Sync Sample Box .38
8.6.3 Shadow Sync Sample Box .39
8.6.4 Independent and Disposable Samples Box.40
8.6.5 Edit Box .41
8.6.6 Edit List Box.41
8.7 Track Data Layout Structures .43
8.7.1 Data Information Box .43
8.7.2 Data Reference Box.43
8.7.3 Sample Size Boxes.44
8.7.4 Sample To Chunk Box .45
8.7.5 Chunk Offset Box .46
8.7.6 Padding Bits Box.47
8.7.7 Sub-Sample Information Box .48
8.7.8 Sample Auxiliary Information Sizes Box.49
8.7.9 Sample Auxiliary Information Offsets Box .50
8.8 Movie Fragments.51
8.8.1 Movie Extends Box.51
8.8.2 Movie Extends Header Box .52
8.8.3 Track Extends Box .52
8.8.4 Movie Fragment Box .53
8.8.5 Movie Fragment Header Box.53
8.8.6 Track Fragment Box.54
8.8.7 Track Fragment Header Box .54
8.8.8 Track Fragment Run Box.55
8.8.9 Movie Fragment Random Access Box.56
8.8.10 Track Fragment Random Access Box.57
8.8.11 Movie Fragment Random Access Offset Box.58
8.8.12 Track fragment decode time.58
8.8.13 Level Assignment Box .59
8.8.14 Sample Auxiliary Information in Movie Fragments.61
8.9 Sample Group Structures.61
8.9.1 Introduction.61
8.9.2 Sample to Group Box.62
8.9.3 Sample Group Description Box .63
8.9.4 Representation of group structures in Movie Fragments.64
8.10 User Data.65
8.10.1 User Data Box .65
8.10.2 Copyright Box .65
8.10.3 Track Selection Box .66
8.11 Metadata Support .68
8.11.1 The Meta box.68
8.11.2 XML Boxes.69
8.11.3 The Item Location Box .69
8.11.4 Primary Item Box .71
8.11.5 Item Protection Box.72
8.11.6 Item Information Box.72
iv © ISO/IEC 2012 – All rights reserved
8.11.7 Additional Metadata Container Box.74
8.11.8 Metabox Relation Box.75
8.11.9 URL Forms for meta boxes .75
8.11.10 Static Metadata .76
8.11.11 Item Data Box.77
8.11.12 Item Reference Box.77
8.11.13 Auxiliary video metadata .78
8.12 Support for Protected Streams.78
8.12.1 Protection Scheme Information Box .79
8.12.2 Original Format Box.80
8.12.3 IPMPInfoBox .80
8.12.4 IPMP Control Box .80
8.12.5 Scheme Type Box.80
8.12.6 Scheme Information Box .81
8.13 File Delivery Format Support.81
8.13.1 Introduction.81
8.13.2 FD Item Information Box.82
8.13.3 File Partition Box .82
8.13.4 FEC Reservoir Box.84
8.13.5 FD Session Group Box .84
8.13.6 Group ID to Name Box .85
8.13.7 File Reservoir Box.86
8.14 Sub tracks.86
8.14.1 Introduction.86
8.14.2 Backward compatibility .87
8.14.3 Sub Track box.87
8.14.4 Sub Track Information box.87
8.14.5 Sub Track Definition box .88
8.14.6 Sub Track Sample Group box.89
8.15 Post-decoder requirements on media .89
8.15.1 General .89
8.15.2 Transformation .89
8.15.3 Restricted Scheme Information box.90
8.15.4 Scheme for stereoscopic video arrangements .90
8.16 Segments .92
8.16.1 Introduction.92
8.16.2 Segment Type Box .92
8.16.3 Segment Index Box .93
8.16.4 Subsegment Index Box.96
8.16.5 Producer Reference Time Box .98
9 Hint Track Formats .99
9.1 RTP and SRTP Hint Track Format .99
9.1.1 Introduction .99
9.1.2 Sample Description Format .99
9.1.3 Sample Format .101
9.1.4 SDP Information .103
9.1.5 Statistical Information .104
9.2 ALC/LCT and FLUTE Hint Track Format.105
9.2.1 Introduction .105
9.2.2 Design principles .105
9.2.3 Sample Description Format .107
9.2.4 Sample Format .107
9.3 MPEG-2 Transport Hint Track Format.110
9.3.1 Introduction .110
9.3.2 Design Principles .111
9.3.3 Sample Description Format .112
9.3.4 Sample Format .114
9.3.5 Protected MPEG 2 Transport Stream Hint Track .116
9.4 RTP, RTCP, SRTP and SRTCP Reception Hint Tracks.117
© ISO/IEC 2012 – All rights reserved v
9.4.1 RTP Reception Hint Track .117
9.4.2 RTCP Reception Hint Track.120
9.4.3 SRTP Reception Hint Track.121
9.4.4 SRTCP Reception Hint Tracks .123
9.4.5 Protected RTP Reception Hint Track.124
9.4.6 Recording Procedure.124
9.4.7 Parsing Procedure.124
10 Sample Groups .124
10.1 Random Access Recovery Points .124
10.2 Rate Share Groups.125
10.2.1 Introduction.125
10.2.2 Rate Share Sample Group Entry .126
10.2.3 Relationship between tracks .127
10.2.4 Bitrate allocation.128
10.3 Alternative Startup Sequences .128
10.3.1 Definition .128
10.3.2 Syntax .129
10.3.3 Semantics .129
10.3.4 Examples .129
10.4 Random Access Point (RAP) Sample Grouping .131
10.4.1 Definition .131
10.4.2 Syntax .131
10.4.3 Semantics .131
10.5 Temporal level sample grouping .131
10.5.1 Definition .131
10.5.2 Syntax .132
10.5.3 Semantics .132
11 Extensibility .132
11.1 Objects .132
11.2 Storage formats .133
11.3 Derived File formats.133
Annex A (informative) Overview and Introduction.134
A.1 Section Overview.134
A.2 Core Concepts .134
A.3 Physical structure of the media .134
A.4 Temporal structure of the media.135
A.5 Interleave .135
A.6 Composition .135
A.7 Random access.136
A.8 Fragmented movie files.136
Annex B (informative) Patent Statements.138
Annex C (informative) Guidelines on deriving from this specification.139
C.1 Introduction .139
C.2 General Principles .139
C.2.1 General.139
C.2.2 Base layer operations.139
C.3 Boxes .140
C.4 Brand Identifiers .140
C.4.1 Introduction .140
C.4.2 Usage of the Brand.141
C.4.3 Introduction of a new brand .141
C.4.4 Player Guideline.141
C.4.5 Authoring Guideline .142
C.4.6 Example .142
C.5 Storage of new media types .142
C.6 Use of Template fields.143
C.7 Tracks .143
vi © ISO/IEC 2012 – All rights reserved
C.7.1 Data Location.143
C.7.2 Time .143
C.7.3 Media Types .144
C.7.4 Coding Types.144
C.7.5 Sub-sample information .144
C.7.6 Sample Dependency .144
C.7.7 Sample Groups .144
C.7.8 Track-level.144
C.7.9 Protection.145
C.8 Construction of fragmented movies.145
C.9 Meta-data.146
C.10 Registration.146
C.11 Guidelines on the use of sample groups, timed metadata tracks, and sample auxiliary
information.146
Annex D (informative) Registration Authority.148
D.1 Code points to be registered.148
D.2 Procedure for the request of an MPEG-4 registered identifier value.148
D.3 Responsibilities of the Registration Authority.149
D.4 Contact information for the Registration Authority.149
D.5 Responsibilities of Parties Requesting a RID.149
D.6 Appeal Procedure for Denied Applications .150
D.7 Registration Application Form.150
D.7.1 Contact Information of organization requesting a RID.150
D.7.2 Request for a specific RID.150
D.7.3 Short description of RID that is in use and date system was implemented .151
D.7.4 Statement of an intention to apply the assigned RID .151
D.7.5 Date of intended implementation of the RID.151
D.7.6 Authorized representative .151
D.7.7 For official use of the Registration Authority.151
Annex E (normative) File format brands .152
E.1 Introduction.152
E.2 The ‘isom’ brand .153
E.3 The ‘avc1’ brand .154
E.4 The ‘iso2’ brand .154
E.5 The ‘mp71’ brand .155
E.6 The ‘iso3’ brand .155
E.7 The ‘iso4’ brand .155
E.8 The ‘iso5’ brand .156
E.9 The ‘iso6’ brand .156
Annex F (informative) Document Cross-Reference.157
Annex G (informative) URI-labelled metadata forms.159
G.1 UUID-labelled metadata .159
G.2 ISO OID-labelled metadata .159
G.3 SMPTE-labelled metadata.159
Annex H (informative) Processing of RTP streams and reception hint tracks.161
H.1 Introduction.161
H.1.1 Overview.161
H.1.2 Structure.161
H.1.3 Terms and definitions .161
H.2 Synchronization of RTP streams .161
H.3 Recording of RTP streams .162
H.3.1 Introduction.162
H.3.2 Compensation for unequal starting for position of received RTP streams .164
H.3.3 Recording of SDP .165
H.3.4 Creation of a sample within an RTP reception hint track.165
H.3.5 Representation of RTP timestamps.166
H.3.6 Recording operations to facilitate inter-stream synchronization in playback.169
© ISO/IEC 2012 – All rights reserved vii
H.3.7 Representation of reception times.170
H.3.8 Creation of media samples .171
H.3.9 Creation of hint samples referring to media samples.171
H.4 Playing of recorded RTP streams .171
H.4.1 Introdu
...








Questions, Comments and Discussion
Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.
Loading comments...