Skip to content

Commit

Permalink
Merge pull request #842 from AOMediaCodec/Center-to-Centre
Browse files Browse the repository at this point in the history
Fix #839 (Center to Centre)
  • Loading branch information
sunghee-hwang authored Jul 10, 2024
2 parents 65824de + 4aeef55 commit f039769
Showing 1 changed file with 9 additions and 9 deletions.
18 changes: 9 additions & 9 deletions index.bs
Original file line number Diff line number Diff line change
Expand Up @@ -285,12 +285,12 @@ Here are some typical IAMF use cases and examples of how to instantiate the mode
- UC4: Four [=Audio Element=]s for multi-language service (e.g., 5.1.2ch and 3 different Stereo dialogues, one for English, the second for Spanish, and the third for Korean) are delivered to an end-user device through a unicast network. The end-user (or the device) selects his preferred language so that 5.1.2ch and the Stereo dialogue associated with the language are rendered to the same loudspeaker layout and are mixed. After applying loudness normalization appropriate to its environment, the [=Rendered Mix Presentation=] is played back on the loudspeakers.

Example 1: UC1 with [=3D audio signal=] = 3.1.2ch.
- Audio Substream: The Left (L) and Right (R) channels are coded as one audio stream, the Left top front (Ltf) and Right top front (Rtf) channels as one audio stream, the Center channel as one audio stream, and the Low-Frequency Effects (LFE) channel as one audio stream.
- Audio Substream: The Left (L) and Right (R) channels are coded as one audio stream, the Left top front (Ltf) and Right top front (Rtf) channels as one audio stream, the Centre channel as one audio stream, and the Low-Frequency Effects (LFE) channel as one audio stream.
- Audio Element (3.1.2ch): Consists of 4 Audio Substreams which are grouped into one [=Channel Group=].
- Mix Presentation: Provides rendering algorithms for rendering the Audio Element to popular loudspeaker layouts and headphones, and the loudness information of the [=3D audio signal=].

Example 2: UC2 with two [=3D audio signal=]s = 5.1.2ch and Stereo.
- Audio Substream: The L and R channels are coded as one audio stream, the Left surround (Ls) and Right surround (Rs) channels as one audio stream, the Ltf and Rtf channels as one audio stream, the Center channel as one audio stream, and the LFE channel as one audio stream.
- Audio Substream: The L and R channels are coded as one audio stream, the Left surround (Ls) and Right surround (Rs) channels as one audio stream, the Ltf and Rtf channels as one audio stream, the Centre channel as one audio stream, and the LFE channel as one audio stream.
- Audio Element 1 (5.1.2ch): Consists of 5 Audio Substreams which are grouped into one [=Channel Group=].
- Audio Element 2 (Stereo): Consists of 1 Audio Substream which is grouped into one [=Channel Group=].
- Parameter Substream 1-1: Contains mixing parameter values that are applied to Audio Element 1 by considering the home environment.
Expand All @@ -306,7 +306,7 @@ Example 3: UC3 with two [=3D audio signal=]s = First Order Ambisonics (FOA) and
- Mix Presentation: Provides rendering algorithms for rendering Audio Elements 1 & 2 to popular loudspeaker layouts and headphones, mixing information based on Parameter Substreams 1-1 & 1-2, and loudness information of the [=Rendered Mix Presentation=].

Example 4: UC4 with four [=3D audio signal=]s = 5.1.2ch and 3 Stereo dialogues for English/Spanish/Korean.
- Audio Substream: The L and R channels are coded as one audio stream, the Left surround (Ls) and Right surround (Rs) channels as one audio stream, the Ltf and Rtf channels as one audio stream, the Center channel as one audio stream, and the LFE channel as one audio stream.
- Audio Substream: The L and R channels are coded as one audio stream, the Left surround (Ls) and Right surround (Rs) channels as one audio stream, the Ltf and Rtf channels as one audio stream, the Centre channel as one audio stream, and the LFE channel as one audio stream.
- Audio Element 1 (5.1.2ch): Consists of 5 Audio Substreams which are grouped into one [=Channel Group=].
- Audio Element 2 (Stereo dialogue for English): Consists of 1 Audio Substream which is grouped into one [=Channel Group=].
- Audio Element 3 (Stereo dialogue for Spanish): Consists of 1 Audio Substream which is grouped into one [=Channel Group=].
Expand Down Expand Up @@ -976,7 +976,7 @@ In this version of the specification, [=loudspeaker_layout=] indicates one of th

</table>

Where C: Center, L: Left, R: Right, Ls: Left Surround, Lss: Left Side Surround, Rs: Right Surround, Rss: Right Side Surround, Lrs: Left Rear Surround, Rrs: Right Rear Surround, Ltf: Left Top Front, Rtf: Right Top Front, Ltr: Left Top Rear, Rtr: Right Top Rear, Ltb: Left Top Back, Rtb: Right Top Back, LFE: Low-Frequency Effects
Where C: Centre, L: Left, R: Right, Ls: Left Surround, Lss: Left Side Surround, Rs: Right Surround, Rss: Right Side Surround, Lrs: Left Rear Surround, Rrs: Right Rear Surround, Ltf: Left Top Front, Rtf: Right Top Front, Ltr: Left Top Rear, Rtr: Right Top Rear, Ltb: Left Top Back, Rtb: Right Top Back, LFE: Low-Frequency Effects

NOTE: The Ltr and Rtr of 5.1.4ch down-mixed from 7.1.4ch is within the range of Ltb and Rtb of 7.1.4ch, in terms of their positions according to [[!ITU-2051-3]].

Expand Down Expand Up @@ -1120,7 +1120,7 @@ The [=Channel Group=] format SHALL conform to the following rules:
- [=Channel Group=] \(\text{#}i\) (as called DCG, \(i = 2, 3, \ldots, n)\): This [=Channel Group=] contains (\(\text{Ci} – \text{Ci}-1)\) number of channels. \((\text{Ci} – \text{Ci}-1)\) channel(s) consists of as follows:
- \((\text{Xi} – \text{Xi-1})\) surround channel(s) if \(\text{Xi} > \text{Xi-1}\) . When \(S_{\text{set}} = \{x \mid \text{Xi-1} < x \le \text{Xi}\} \) and \(x\) is an integer,
- If 2 is an element of \(S_{\text{set}}\), the L2 channel is contained in this \(CG \text{#}i\).
- If 3 is an element of \(S_{\text{set}}\), the Center channel is contained in this \(CG \text{#}i\).
- If 3 is an element of \(S_{\text{set}}\), the Centre channel is contained in this \(CG \text{#}i\).
- If 5 is an element of \(S_{\text{set}}\), the L5 and R5 channels are contained in this \(CG \text{#}i\).
- If 7 is an element of \(S_{\text{set}}\), the Lss7 and Rss7 channels are contained in this \(CG \text{#}i\).
- The LFE channel if \(\text{Yi} > \text{Yi-1}\).
Expand Down Expand Up @@ -1160,7 +1160,7 @@ The order of the [=Audio Substream=]s in each [=Channel Group=] (i.e., the seman
- The [=coupled substream=]s for the surround channels come first and are followed by the [=coupled substream=]s for the top channels.
- The [=coupled substream=]s for the front channels come first and are followed by the [=coupled substream=]s for the side, rear and back channels.
- The [=coupled substream=]s for the side channels come first and are followed by the [=coupled substream=]s for the rear channels.
- The Center (or Front Centre) channel comes first and is followed by the LFE (or LFE1) channel, and then the L channel.
- The Centre (or Front Centre) channel comes first and is followed by the LFE (or LFE1) channel, and then the L channel.

### Ambisonics Config Syntax and Semantics ### {#syntax-ambisonics-config}

Expand Down Expand Up @@ -1792,7 +1792,7 @@ class ReconGainInfoParameterData() {
<pre class = "def">
Byte postion : Bit position : Assigned Channel Name
: b0 (LSB) : Left channel
: b1 : Center channel
: b1 : Centre channel
LSB 7 bits : b2 : Right channel
of : b3 : Left surround channel (or Lss)
the 1st byte : b4 : Right surround channel (or Rss)
Expand Down Expand Up @@ -2539,7 +2539,7 @@ This section defines the renderer to use, given a channel-based [=Audio Element=
- If the input layout is [=22.2ch=], the EAR Direct Speakers renderer ([[ITU-2127-0]]) can be used, for example, to first render the input audio to 7.1.4ch, followed by down-mixing from 7.1.4ch to 3.1.2ch by using the down-mix matrix specified in [[#processing-downmixmatrix-static]].
- Else, the static down-mix matrices specified in [[#processing-downmixmatrix-static]] are used.
- Else if the surround channels (x) of the input layout > 3, the static down-mix matrices specified in [[#processing-downmixmatrix-static]] after inserting empty height channels into the input audio are used.
- Else, empty channels are padded to the input audio relevant to the input layout to make 3.1.2ch. In that case, Mono is regarded as a center channel.
- Else, empty channels are padded to the input audio relevant to the input layout to make 3.1.2ch. In that case, Mono is regarded as a Centre channel.


##### Configuring the EAR Direct Speakers Renderer (Informative) ##### {#processing-mixpresentation-rendering-m2l-configureear}
Expand Down Expand Up @@ -3114,7 +3114,7 @@ For a given channel-based input [=3D audio signal=] (e.g., 7.1.4ch) and a given
- It is composed of an Audio Scene Classification module and a Height Energy Quantification module as depicted in Figure 11-2.
- The Audio Scene Classification module generates 4 parameters (\(\alpha(k)\), \(\beta(k)\), \(\gamma(k)\), \(\delta(k)\)) by classifying audio scenes in the input [=3D audio signal=] into one of three modes.
1. Default scene: Neither Dialog nor Effect
2. Dialog scene: Center-channel oriented and clear dialog/voice sounds
2. Dialog scene: Centre-channel oriented and clear dialog/voice sounds
3. Effect scene: Directional and spatially moving sounds.
- The Height Energy Quantification module generates a surround-to-height mixing parameter ([=w(k)|\(w(k)\)=]) which is decided according to the relative energy difference between the top and surround channels of the input [=3D audio signal=].
- If the energy of the top channels is greater than that of surround channels, then [=w_idx_offset=](k) is set to 1. Otherwise, it is set to -1. Then, [=w(k)|\(w(k)\)=] is calculated based on [=w_idx_offset=](k) and conforms to [[#processing-scalablechannelaudio]].
Expand Down

0 comments on commit f039769

Please sign in to comment.