Information technology — Coding of audio-visual objects — Part 3: Audio — Amendment 4: New levels for AAC profiles

Technologies de l'information — Codage des objets audiovisuels — Partie 3: Codage audio — Amendement 4: Nouveaux niveaux pour profils AAC

General Information

Status
Withdrawn
Publication Date
09-Dec-2013
Withdrawal Date
09-Dec-2013
Current Stage
9599 - Withdrawal of International Standard
Completion Date
12-Dec-2019
Ref Project

Relations

Buy Standard

Standard
ISO/IEC 14496-3:2009/Amd 4:2013 - New levels for AAC profiles
English language
26 pages
sale 15% off
Preview
sale 15% off
Preview
Standard
ISO/IEC 14496-3:2009/Amd 4:2013 - New levels for AAC profiles
English language
26 pages
sale 15% off
Preview
sale 15% off
Preview

Standards Content (Sample)

INTERNATIONAL ISO/IEC
STANDARD 14496-3
Fourth edition
2009-09-01
AMENDMENT 4
2013-12-15


Information technology — Coding of
audio-visual objects —
Part 3:
Audio
AMENDMENT 4: New levels for AAC profiles
Technologies de l'information — Codage des objets audiovisuels —
Partie 3: Codage audio
AMENDEMENT 4: Nouveaux niveaux pour profils AAC




Reference number
ISO/IEC 14496-3:2009/Amd.4:2013(E)
©
ISO/IEC 2013

---------------------- Page: 1 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)

COPYRIGHT PROTECTED DOCUMENT


©  ISO/IEC 2013
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any
means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior written permission.
Permission can be requested from either ISO at the address below or ISO’s member body in the country of the requester.
ISO copyright office
Case postale 56 • CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland

ii © ISO/IEC 2013 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Foreword
ISO (the International Organization for Standardization) and IEC (the International Electrotechnical
Commission) form the specialized system for worldwide standardization. National bodies that are members of
ISO or IEC participate in the development of International Standards through technical committees
established by the respective organization to deal with particular fields of technical activity. ISO and IEC
technical committees collaborate in fields of mutual interest. Other international organizations, governmental
and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information
technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1.
International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.
The main task of the joint technical committee is to prepare International Standards. Draft International
Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as
an International Standard requires approval by at least 75 % of the national bodies casting a vote.
Attention is drawn to the possibility that some of the elements of this document may be the subject of patent
rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights.
Amendment 4 to ISO/IEC 14496-3:2009 was prepared by Joint Technical Committee ISO/IEC JTC 1,
Information technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia
information.
© ISO/IEC 2013 – All rights reserved iii

---------------------- Page: 3 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Information technology — Coding of audio-visual objects —
Part 3:
Audio
AMENDMENT 4: New levels for AAC profiles
1 Notes from the editor
Some modifications with respect to ISO/IEC 14496-3:2009 are highlighted by colored background as follows.
These highlights shall be removed upon integration of the amendment into a future edition of ISO/IEC 14496-3.
YELLOW Cross-references and subclause numbers that need to be checked and aligned with ISO/IEC
14496-3:2009.
GRAY Changes relative to ISO/IEC 14496-3:2009 that are highlighted for better visibility.

2 Changes to the text of ISO/IEC 14496-3:2009
In 1.2 Normative references, add:
ISO/IEC 23001-8, Information technology — MPEG systems technologies — Part 8: Coding-independent
code points

In 1.3 Terms and Definitions, add:
1.3.z SAOC-DE: Spatial Audio Object Coding Dialogue Enhancement
and increase the index-number of subsequent entries.
© ISO/IEC 2013 – All rights reserved 1

---------------------- Page: 4 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In 1.5.1.1 Audio object type definition, amend Table 1.1 with the updates in the table below:
0 Null
[.] […]
43 SAOC
44 LD MPEG Surround
45 SAOC-DE
46 - 95 (reserved)


After 1.5.1.2.39 add the following new subclauses:
1.5.1.2.40 SAOC-DE object type
The SAOC-DE object type conveys Spatial Audio Object Coding Dialogue Enhancement side information (see
ISO/IEC 23003-2:2010/Amd.3) in the MPEG-4 Audio framework.

In 1.5.2.1 (Profiles), Table 1.3 (Audio Profiles definition), add:
Object Type ID Audio Object Type …
… … …
43 SAOC
44 LD MPEG Surround
45 SAOC-DE

2 © ISO/IEC 2013 – All rights reserved

Object Type ID
Audio Object Type
gain control
[…]
Remark

---------------------- Page: 5 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In 1.5.2.3 Levels within the profiles replace:
Table 1.10 – Levels for the AAC Profile
Level Max. Max. Max. PCU Max. RCU
channels/ sampling
object rate [kHz]
1 2 24 3 5
2 2 48 6 5
3 NA NA NA NA
4 5 48 19 15
5 5 96 38 15

With:
Table 1.10 – Levels for the AAC Profile
Level Max. Max. Max. PCU Max. RCU
channels/ sampling
object rate [kHz]
1 2 24 3 5
2 2 48 6 5
3 NA NA NA NA
4 5 48 19 15
5 5 96 38 15
6 7 48 25 19
7 7 96 50 19

Replace:
Table 1.11 – Levels for the High Efficiency AAC Profile
Level Max. Max. AAC Max. AAC Max. SBR Max. PCU Max. RCU Max. PCU Max. RCU
channels/ sampling sampling sampling Low power Low power
object rate, SBR rate, SBR rate [kHz] SBR SBR
not present present [kHz] (in/out)

[kHz]
1 NA NA NA NA NA NA NA NA
2 2 48 24 24/48 9 10 7 8
© ISO/IEC 2013 – All rights reserved 3

---------------------- Page: 6 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
3 2 48 48 48/48 (Note 1)15 10 12 8
4 5 48 24/48 (Note 2) 48/48 (Note 1) 25 28 20 23
5 5 96 48 48/96 49 28 39 23
Note 1: For level 3 and level 4 decoders, it is mandatory to operate the SBR tool in downsampled mode if
the sampling rate of the AAC core is higher than 24kHz. Hence, if the SBR tool operates on a 48kHz AAC
signal, the internal sampling rate of the SBR tool will be 96kHz, however, the output signal will be
downsampled by the SBR tool to 48kHz.
Note 2: For one or two channels the maximum AAC sampling rate, with SBR present, is 48kHz. For more
than two channels the maximum AAC sampling rate, with SBR present, is 24kHz.

With:
Table 1.11 – Levels for the High Efficiency AAC Profile
Level Max. Max. AAC Max. AAC Max. SBR Max. PCU Max. RCU Max. PCU Max. RCU
channels/ sampling sampling sampling Low power Low power
object rate, SBR rate, SBR rate [kHz] SBR SBR
not present present [kHz] (in/out)

[kHz]
1 NA NA NA NA NA NA NA NA
2 2 48 24 24/48 9 10 7 8
3 2 48 48 48/48 (Note 1)15 10 12 8
4 5 48 24/48 (Note 2) 48/48 (Note 1) 25 28 20 23
5 5 96 48 48/96 49 28 39 23
6 7 48 24/48 (Note 2) 48/48 (Note 1) 34 37 27 30
7 7 96 48 48/96 67 37 53 30
Note 1: For level 3, level 4 and level 6 decoders, it is mandatory to operate the SBR tool in downsampled
mode if the sampling rate of the AAC core is higher than 24kHz. Hence, if the SBR tool operates on a 48kHz
AAC signal, the internal sampling rate of the SBR tool will be 96kHz, however, the output signal will be
downsampled by the SBR tool to 48kHz.
Note 2: For one or two channels the maximum AAC sampling rate, with SBR present, is 48kHz. For more
than two channels the maximum AAC sampling rate, with SBR present, is 24kHz.

Replace:
Table 1.12 – Levels for the High Efficiency AAC v2 Profile
Level Max. Max. AAC Max. AAC Max. SBR Max. PCU Max. RCU Max. PCU Max. RCU
(Note 1) channels/o sampling rate, sampling rate, sampling rate HQ / LP SBR HQ / LP SBR
bject SBR not SBR present [kHz] (in/out) (Note 5) (Note 5)
present [kHz] [kHz]
1 NA NA NA NA NA NA NA NA
2 2 48 24 24/48 9 10 9 10
3 2 48 24/48 (Note 3) 48/48 (Note 2) 15 10 15 10
4 5 48 24/48 (Note 4) 48/48 (Note 2) 25 28 20 23
5 5 96 48 48/96 49 28 39 23
4 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 7 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Note 1: Level 2,3, and 4 HE AAC v2 Profile decoders implements the baseline version of the parametric
stereo tool. A level 5 decoder shall not be limited to the baseline version of the parametric stereo tool.
Note 2: For level 3 and level 4 decoders, it is mandatory to operate the SBR tool in downsampled mode if
the sampling rate of the AAC core is higher than 24kHz. Hence, if the SBR tool operates on a 48kHz AAC
signal, the internal sampling rate of the SBR tool will be 96kHz, however, the output signal will be
downsampled by the SBR tool to 48kHz.
Note 3: If Parametric Stereo data is present the maximum AAC sampling rate is 24kHz, if Parametric Stereo
data is not present the maximum AAC sampling rate is 48kHz.
Note 4: For one or two channels the maximum AAC sampling rate, with SBR present, is 48kHz. For more
than two channels the maximum AAC sampling rate, with SBR present, is 24kHz.
Note 5: The PCU/RCU number are given for a decoder operating the LP SBR tool whenever applicable.

With:
Table 1.12 – Levels for the High Efficiency AAC v2 Profile
Level Max. Max. AAC Max. AAC Max. SBR Max. PCU Max. RCU Max. PCU Max. RCU
(Note 1) channels/o sampling rate, sampling rate, sampling rate HQ / LP SBR HQ / LP SBR
bject SBR not SBR present [kHz] (in/out) (Note 5) (Note 5)
present [kHz] [kHz]
1 NA NA NA NA NA NA NA NA
2 2 48 24 24/48 9 10 9 10
3 2 48 24/48 (Note 3) 48/48 (Note 2) 15 10 15 10
4 5 48 24/48 (Note 4) 48/48 (Note 2) 25 28 20 23
5 5 96 48 48/96 49 28 39 23
6 7 48 24/48 (Note 4) 48/48 (Note 2) 34 37 27 30
7 7 96 48 48/96 67 37 53 30
Note 1: Level 2, 3, 4, 6 and 7 HE AAC v2 Profile decoders implement the baseline version of the parametric
stereo tool. A level 5 decoder shall not be limited to the baseline version of the parametric stereo tool.
Note 2: For level 3, level 4 and level 6 decoders, it is mandatory to operate the SBR tool in downsampled
mode if the sampling rate of the AAC core is higher than 24kHz. Hence, if the SBR tool operates on a 48kHz
AAC signal, the internal sampling rate of the SBR tool will be 96kHz, however, the output signal will be
downsampled by the SBR tool to 48kHz.
Note 3: If Parametric Stereo data is present the maximum AAC sampling rate is 24kHz, if Parametric Stereo
data is not present the maximum AAC sampling rate is 48kHz.
Note 4: For one or two channels the maximum AAC sampling rate, with SBR present, is 48kHz. For more
than two channels the maximum AAC sampling rate, with SBR present, is 24kHz.
Note 5: The PCU/RCU number are given for a decoder operating the LP SBR tool whenever applicable.

© ISO/IEC 2013 – All rights reserved 5

---------------------- Page: 8 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In 1.5.2.4 audioProfileLevelIndication
Insert the following new entries into Table 1.14 “audioProfileLevelIndication values” and adapt the “reserved
for ISO use” range accordingly:
0x50 AAC Profile L6
0x51 AAC Profile L7
0x52 HE-AAC Profile L6
0x53 HE-AAC Profile L7
0x54 HE-AACv2 Profile L6
0x55 HE-AACv2 Profile L7
0x56 Extended HE-AAC Profile L6
0x57 Extended HE-AAC Profile L7
0x58 - 0x7F reserved for ISO use -

In 1.5.2.3 after:
"The NA (Not Applicable) levels are introduced to emphasize the hierarchical structure of the AAC Profile and
the High Efficiency AAC Profile. Hence, a decoder supporting the High Efficiency AAC Profile at a given level
can decode an AAC Profile stream of the same or a lower level. The NA levels are not indicated in the
audioProfileLevelIndication table (Table 1.14).
Add:
NOTE: A Level 6 or 7 decoder is not required to decode a Level 5 stream."

In 1.5.2.3 "1.5.2.3 Levels within the profiles",in the AAC, HE-AAC and HE-AACv2 Profile definition after each
occurrence of the sentence:
"For the audio object type 2 (AAC LC), mono or stereo mixdown elements are not permitted."

Add:
"For Levels 1 to 5 the height extension element is not permitted."

6 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 9 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In 1.6.2.1 extend Table 1.15 “AudioSpecificConfig()”as follows:
Table 1.15 – Syntax of AudioSpecificConfig()
Syntax No. of bits Mnemonic
AudioSpecificConfig ()
{



saocPresentFlag = -1;
ldmpsPresentFlag = -1;
saocDePresentFlag = -1;
if ( audioObjectType == 5 ||
 audioObjectType == 29 ) {




case 43:
 saocPresentFlag = 1;
 saocPayloadEmbedding; 1 uimsbf
 SaocSpecificConfig();
 break;
case 44:
 ldmpsPresentFlag = 1;
 ldsacPayloadEmbedding; 1 uimsbf
 LDSpatialSpecificConfig();
 break;
case 45:
 saocDePresentFlag = 1;
 saocDePayloadEmbedding; 1 uimsbf
 SaocDeSpecificConfig();
 break;
default:
 /* reserved */
}



if (extensionIdentifier == -1 && bits_to_decode() >= 11 ) {
 extensionIdentifier; 11 bslbf
}
if ( extensionIdentifier == 0x7cb ) {
 extensionIdentifier = -1;
 if ( audioObjectType != 43 && bits_to_decode() >= 1 ) {

  saocPresentFlag; 1 uimsbf
  if ( saocPresentFlag == 1 ) {
  saocPayloadEmbedding = 1;
  saocscLen; 8 uimsbf
  if ( saocscLen == 0xff ) {
   saocscLenExt; 16 uimsbf
   saocscLen += saocscLenExt;
  }
  SaocSpecificConfig();
  }
 }
© ISO/IEC 2013 – All rights reserved 7

---------------------- Page: 10 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
}
if (extensionIdentifier == -1 && bits_to_decode() >= 11 ) {
 extensionIdentifier; 11 bslbf
}
if ( extensionIdentifier == 0x7cc ) {
 extensionIdentifier = -1;
 if ( audioObjectType != 44 && bits_to_decode() >= 1 ) {
  ldmpsPresentFlag; 1 uimsbf
  if ( ldmpsPresentFlag == 1 ) {
  ldsacPayloadEmbedding = 1;
  ldsscLen; 8 uimsbf
  if ( ldsscLen == 0xff ) {
   ldsscLenExt; 16 uimsbf
   ldsscLen += ldsscLenExt;
  }
  LDSpatialSpecificConfig();
  }
 }
}
if (extensionIdentifier == -1 && bits_to_decode() >= 11 ) {
 extensionIdentifier; 11 bslbf
}
if ( extensionIdentifier == 0x7cd ) {
 extensionIdentifier = -1;
 if ( audioObjectType != 45 && bits_to_decode() >= 1 ) {
  saocDePresentFlag; 1 uimsbf
  if ( saocDePresentFlag == 1 ) {
  saocDePayloadEmbedding = 1;
  saocDescLen; 8 uimsbf
  if ( saocDescLen == 0xff ) {
   saocDescLenExt; 16 uimsbf
   saocDescLen += saocDescLenExt;
  }
  SaocDeSpecificConfig();
  }
 }
}
}

After 1.6.2.1.19 add the new subclause as follows:
1.6.2.1.20 SaocDeSpecificConfig
Defined in ISO/IEC 23003-2:2010/Amd.3.

In 1.6.2.2.1 extend Table 1.17 “Audio Object Types” as follows:
8 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 11 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Table 1.17 — Audio Object Types
Object Audio Object Type definition of elementary stream Mapping of audio payloads to
Type payloads and detailed syntax access units and elementary
ID streams
0 NULL

43 SAOC ISO/IEC 23003-2
44 LD MPEG Surround ISO/IEC 23003-2
45 SAOC-DE ISO/IEC 23003-2:2010/Amd.3


In 1.6.3.5 channelConfiguration replace:

A four bit field indicating the audio output channel configuration:

Table 1.19 – Channel Configuration
value number of audio syntactic elements, channel to speaker mapping
channels listed in order received
0 - - defined in AOT related SpecificConfig
1 1 single_channel_element() center front speaker
2 2 channel_pair_element() left, right front speakers
3 3 single_channel_element(), center front speaker,
channel_pair_element() left, right front speakers
4 4 single_channel_element(), center front speaker,
channel_pair_element(), left, right center front speakers,
single_channel_element() rear surround speakers
5 5 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element() left surround, right surround rear speakers
6 5+1 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround rear speakers,
lfe _element() front low frequency effects speaker
7 7+1 single_channel_element(), center front speaker
channel_pair_element(), left, right center front speakers,
channel_pair_element(), left, right outside front speakers,
channel_pair_element(), left surround, right surround rear speakers,
lfe_element() front low frequency effects speaker
8-15 - - reserved

With:
A four bit field that carries a ChannelConfiguration value as defined in ISO/IEC 23001-8, “Coding Independent
Code Points”. Table 1.19 lists the valid channel configurations for use in MPEG-4 Audio and documents the
association of channel elements to speaker positions: the third column lists the audio syntactic elements in the
order in which they occur in each access unit, and the fourth column lists the channel to speaker mapping
where the first channel element listed maps to the first speaker listed. In the case of channel_pair_elements(),
the first channel element that occurs within the channel pair maps to the next speaker listed. The columns of
Table 1.19 that duplicate information from 23001-8 are informative, as 23001-8 is normative. The mapping of
loudspeaker names to loudspeaker positions is as defined in by OutputChannelPosition in ISO/IEC 23001-8.

© ISO/IEC 2013 – All rights reserved 9

---------------------- Page: 12 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Table 1.19 – Channel Configuration
value number of audio syntactic elements, channel to speaker mapping
channels listed in order received
0 - - defined in AOT related SpecificConfig
1 1 single_channel_element() center front speaker
2 2 channel_pair_element() left, right front speakers
3 3 single_channel_element(), center front speaker,
channel_pair_element() left, right front speakers
4 4 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
single_channel_element() rear center speakers
5 5 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element() left surround, right surround speakers
6 5.1 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround speakers,
lfe _element() low frequency enhancement peaker
7 7.1 Front single_channel_element(), center front speaker
channel_pair_element(), left, right front center speakers,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround speakers,
lfe_element() low frequency enhancement speaker
8-10 - - reserved
11 6.1 single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround speakers,
single_channel_element(), rear center speaker,
lfe_element() low frequency enhancement speaker
12 7.1 Back single_channel_element(), center front speaker
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround speakers,
channel_pair_element(), rear surround left, right speakers,
lfe_element() low frequency enhancement speaker
13 22.2 single_channel_element(), center front speaker,
channel_pair_element(), left, right front center speakers,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left, right side surround speakers,
channel_pair_element(), rear left, right surround speakers,
single_channel_element(), rear center speaker,
lfe_element(), left front low freq. enhancement speaker,
lfe_element(), right front low freq. enhancement speaker,
single_channel_element(), center front vertical height speaker,
channel_pair_element(), left, right front vertical height speakers,
channel_pair_element(), left, right vertical height side surr. speakers,
single_channel_element(),
top center surround speaker,
channel_pair_element(), left, right surr. vertical height rear speakers,
single_channel_element(),
center vertical height rear speaker,
single_channel_element(),
center front vertical bottom speaker,
channel_pair_element()
left, right front vertical bottom speakers
10 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 13 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
14 7.1 Top single_channel_element(), center front speaker,
channel_pair_element(), left, right front speakers,
channel_pair_element(), left surround, right surround speakers,
lfe_element(), low frequency enhancement speaker,
channel_pair_element() left, right front vertical height speakers
15 - - reserved

After 1.6.3.25 add the new text as follows:
1.6.3.26 saocDePayloadEmbedding
The audio Object Type ID 45 SAOC-DE is used to convey spatial audio object coding side information for
SAOC-DE decoding as defined in ISO/IEC 23003-2:2010/Amd.3. Depending on this flag, the SAOC-DE data
payload, i.e., SaocDeFrame(), is available by different means:
Table AMD4.13 – saocDePayloadEmbedding
saocDePayloadEmbedding Meaning
One SaocDeFrame() is mapped into one access unit.
Subsequent access units form one elementary
0 stream. That elementary stream will always depend
on another elementary stream that contains the
underlying (downmixed) audio data.
The top level payload is multiplexed into the
underlying (downmixed) audio data. The actual
multiplexing details depend on the presentation of the
audio data (i.e., usually on the AOT). Note that this
1 leads to an elementary stream with no real payload.
That elementary stream will always depend on
another elementary stream that contains both, the
underlying (downmixed) audio data and the
multiplexed spatial audio data.
1.6.3.27 saocDePresentFlag
A one bit field indicating the presence or absence of SAOC-DE data. The value –1 indicates that the
saocDePresentFlag was not conveyed in the AudioSpecificConfig().
1.6.3.28 saocDescLen
A helper variable indicating the number of bytes of the subsequent SaocSpecificConfig() data function
including possible fill bits.
1.6.3.29 saocDescLenExt
A helper variable indicating the additional number of bytes of the subsequent SaocSpecificConfig() data
function including possible fill bits.

© ISO/IEC 2013 – All rights reserved 11

---------------------- Page: 14 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
After 4.4.1.1 Program config element, after Table 4.2 – Syntax of program_config_element(), add the following
subclause:
4.4.1.2 Height extension element
Table AMD4.1 - Syntax of height extension element
Syntax No. of Bits Mnemonic
height_extension_element(comment_field_bytes) {
PCE_HEIGHT_EXTENSION_SYNC; 8 bslfb
for (i = 0; i < num_front_channel_elements; i++){
front_element_height_info[i]; 2 bslbf
}
for (i = 0; i < num_side_channel_elements; i++){
side_element_height_info[i]; 2 bslbf
}
for (i = 0; i < num_back_channel_elements; i++){
back_element_height_info[i]; 2 bslbf
}
byte_alignment();
height_info_crc_check 8 rpchof
}
Note 1: height_extension_element() may be embedded in comment_field_data[i] of
program_config_element().

The height extension element should not be used if it is possible to signal presence of height channels via
channelConfiguration.
The height extension elements must not change the channel configuration as defined via the
channelConfiguration element.

In 4.4.2.7 extend Table 4.57 “Syntax of extension_payload()” as follows:
Table 4.57 – Syntax of extension_payload()
Syntax No. of bits Mnemonic
extension_payload(cnt)
{
extension_type; 4 uimsbf
align = 4;
switch( extension_type ) {
 case EXT_DYNAMIC_RANGE:
  return dynamic_range_info();
 case EXT_SAC_DATA:
  return sac_extension_data(cnt);
 case EXT_SAOC_DATA:
  return saoc_extension_data(cnt);
 case EXT_LDSAC_DATA:
  return ldsac_extension_data(cnt);
 case EXT_SBR_DATA:
  return sbr_extension_data(id_aac, 0); Note 1
 case EXT_SBR_DATA_CRC:
  return sbr_extension_data(id_aac, 1); Note 1
12 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 15 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
 case EXT_SAOC_DE_DATA:
  return saoc_de_extension_data(cnt);
 case EXT_DATA_LENGTH:
  …

In 4.4.2.7 after Table 4.61 add new Table AMD4.14 “saoc_de_extension_data()” as given below:
Table AMD4.14 – Syntax of saoc_de_extension_data()
Syntax No. of bits Mnemonic
saoc_de_extension_data(cnt)
{
ancType; 2 uimsbf
ancStart; 1 uimsbf
ancStop; 1 uimsbf
for (i=0; i  ancDataSegmentByte[i]; 8 bslbf
}
return (cnt);
}


After 4.5.1.2 Program config element (PCE),add the following subclause:

4.5.1.3 Height extension element
4.5.1.3.1 Definitions
4.5.1.3.1.1 Data elements
PCE_HEIGHT_EXTENSION_SYNC 0xac
front_element_height_info a two-bit field containing height information
side_element_height_info a two-bit field containing height information
back_element_height_info a two-bit field containing height information
Table AMD4.2 - Definition of height information
front_element_height_info
side_element_height_info height information
back_element_height_info
0 Normal height
1 Top speaker
2 Bottom speaker
3 Reserved

If no height extension element is transmitted, height information is set to “Normal height”.
© ISO/IEC 2013 – All rights reserved 13

---------------------- Page: 16 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In the PCE the order of channel elements within the front, side and back channel elements shall follow the
order as implied by Table AMD4.2 i.e. normal height elements first followed by Top and Bottom height
speakers.

height_info_crc_check 8 bit CRC error detection data generated as described in 7.3.4.6. of
ISO/IEC23003-1, CRC Polynom: G(X) = X8 + X2 + X + 1, with initial shift register
state 0xFF.

4.5.1.3.1.2 Help elements
crc_calc() Function that returns CRC value calculated by data of all
front_element_height_info, side_element_height_info and
back_element_height_info.

The following bits are protected and fed into the CRC algorithm in order of their appearance:
1) 8-bit height info sync word
2) front_element_height_info
3) side_element_height_info
4) back_element_height_info
5) byte_alignment bits

NOTE: Channel configurations containing other than normal height speakers should be signaled through
explicit channel configurations as defined in Table 1.19, if possible.
In 4.4.2.1 Payloads for the audio object types AAC main, AAC SSR, AAC LC and AAC LTP, after Table 4.10 –
Syntax of data_stream_element(), insert the following tables:

Table AMD4.3 - Syntax of MPEG4_ancillary_data
Syntax No. of Bits Mnemonic
MPEG4_ancillary_data() {

ancillary_data_sync; bslbf
8
mpeg_audio_type 2 bslbf
dolby_surround_mode 2 bslbf
drc_presentation_mode 2 bslbf
stereo_downmix_mode; 1 bslbf
bs_info_reserved, set to “0” 1 bslbf
anc_data_status_reserved, set to “000” 3 bslbf
downmixing_levels_MPEG4_status 1 bslbf
ancillary_data_extension_status; 1 bslbf
audio_coding_and_compression_status 1 bslbf
coarse_grain_timecode_status 1 bslbf
fine_grain_timecode_status 1 bslbf
14 © ISO/IEC 2013 – All rights reserved

---------------------- Page: 17 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
if (downmixing_levels_MPEG4_status == 1) {
center_mix_level_on 1 bslbf
center_mix_level_value 3 bslbf
surround_mix_level_on 1 bslbf
surround_mix_level_value 3 bslbf
}
If (audio_coding_mode_and_compression_status == 1) {
audio_coding_mode_reserved 7 bslbf
compression_on 1 bslbf
compression_value 8 bslbf
}
if(coarse_grain_timecode_status == 1)
coarse_grain_timecode 16 bslbf
if(fine_grain_timecode_status == 1)
fine_grain_timecode 16 bslbf
if (ancillary_data_extension_status == 1){
reserved, set to "0” 1 bslbf
ext_downmixing_levels_status; 1 bslbf
ext_downmixing_global_gains_status; 1 bslbf
ext_downmixing_lfe_level_status; 1 bslbf
reserved, set to "0000” 4 bslbf

if (ext_downmixing_levels_status == 1){
 dmix_a_idx; 3 bslbf
 dmix_b_idx; 3 bslbf
 reserved, set to "00” 2 bslbf
}
if (ext_downmixing_global_gains_status == 1) {
 dmx_gain_5_sign; 1 bslbf
 dmx_gain_5_idx; 6 bslbf
 reserved, set to "0” 1 bslbf
 dmx_gain_2_sign; 1 bslbf
 dmx_gain_2_idx; 6 bslbf
 reserved, set to "0” 1 bslbf
}
if (ext_downmixing_lfe_level_status == 1) {
 dmix_lfe_idx; 4 bslbf
 reserved, set to "0000” 4 bslbf
}
}
}


In 4.5.2.1.1 after::
data_stream_element() Abbreviation DSE. Syntactic element that contains data. Again, there are 16
element_instance_tags. There is, however, no restriction on the number of
data_stream_element()’s with any one instance tag, as a single data stream
may continue across multiple data_stream_element()’s with the same
instance tag.
Add:
The DSE may contain a MPEG4_ancillary_data element as described in
Table AMD4.3.

© ISO/IEC 2013 – All rights reserved 15

---------------------- Page: 18 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
In 4.5.2.1.4 Data stream element (DSE) after:
See ISO/IEC13818-7 (13818-7:2005, subclause 8.6 "Data Stream Element (DSE)").
Add:
A DSE containing a MPEG4_ancillary_data element shall have the same element_instance_tag as the first
channel element it is associated to. It should follow after the last channel or SBR element it is associated to.

In 4.5.2.9.3 extend Table 4.121 “Values of the extension_type field” as follows:
Table 4.1
...

INTERNATIONAL ISO/IEC
STANDARD 14496-3
Fourth edition
2009-09-01
AMENDMENT 4
2013-12-15

Information technology — Coding of
audio-visual objects —
Part 3:
Audio
AMENDMENT 4: New levels for AAC profiles
Technologies de l'information — Codage des objets audiovisuels —
Partie 3: Codage audio
AMENDEMENT 4: Nouveaux niveaux pour profils AAC




Reference number
ISO/IEC 14496-3:2009/Amd.4:2013(E)
©
ISO/IEC 2013

---------------------- Page: 1 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)



This CD-ROM contains:
1) the publication ISO/IEC 14496-3:2009/Amd.4:2013 in portable document format (PDF), which can be
viewed using Adobe® Acrobat® Reader;
2) electronic attachments.
Adobe and Acrobat are trademarks of Adobe Systems Incorporated.


COPYRIGHT PROTECTED DOCUMENT


©  ISO/IEC 2013
All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any
means, electronic or mechanical, including photocopying, or posting on the internet or an intranet, without prior written permission.
Permission can be requested from either ISO at the address below or ISO’s member body in the country of the requester.
ISO copyright office
Case postale 56  CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 749 09 47
E-mail copyright@iso.org
Web www.iso.org
Published in Switzerland

ii © ISO/IEC 2013 – All rights reserved

---------------------- Page: 2 ----------------------
ISO/IEC 14496-3:2009/Amd.4:2013(E)
Installation
If this publication has been packaged as a zipped file, do NOT open the file from the CD-ROM, but copy it
...

Questions, Comments and Discussion

Ask us and Technical Secretary will try to provide an answer. You can facilitate discussion about the standard in here.