Principles and applications of 3D reproduction using...

Preview:

Citation preview

Principles and applications of 3D reproduction using Ambisonics

Alois Sontacchi

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi 2

@ the focus of Art and Technology

Institute of Electronic Music and Acoustics

University of Music and Performing Arts Graz

AUSTRIA - 8010 Graz - Inffeldgasse 10/3

3

artistic/scientific technical/scientific

Professors 1 1

Academic non-professorial teaching staff (scientists) 6 12

Academic non-professorial teaching staff (lectors) 2 7

Other staff members5

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

4

Education

• Musicology (Bachelor, Master) in cooperation with KFU Graz

• Composition - Computer Music (Master + NEW: Bachelor)

• Electrical and Audio Engineering (Bachelor, Master) in cooperation with TU Graz

• Doctoral school

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

5

Artistic Research

sound & spaceembodimentalgorithmic composition

Signal Processing and Acoustics

spatial audioaudio signal processingpsychoacoustics

Computer Music

sonificationinteraction design

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Research Areas Sound & Music Computing

6

Motivation

What is the necessity for 3D Audio production ?

Games & Entertainment

Music Performance / Composition

Listening Test: In-Situ Simulation- speech intelligibility in noise

- evaluation of hearing-aids

- improvements of ANC-applications

Immersive Telepresence

LoRA A Loudspeaker-Based Room Auralization SystemS. Favrot, J. M. Buchholz, Acta Acoustica Vol. 96, p.364-375, 2010.

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

7

Outline

Introduction

Ambisonics Theory

Optimization criteria for reproduction quality

Applications

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

8

Psychologists/psychophysicists discover „phantom source“ around 1900

- „phantom“ sound localised, where there‘s nothing physicalmedical term such as „phantom pain“

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

9

X/Y uses first-order transduer characteristics to produce psychoacoustic effect

Introduction

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

10

X/Y uses first-order transduer characteristics to produce psychoacoustic effect

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

11

X/Y uses first-order transduer characteristics to produce psychoacoustic effect

-15dB

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

12

M/S is useful for modifying coincidence stereophony image

- + +

+- ++Encoding

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

13

M/S is useful for modifying coincidence stereophony image

- + +

+- ++Encoding

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

14

M/S is useful for modifying coincidence stereophony image

- + +

+- ++

- + +

+-++

But don‘t increase S: out-of-phase

Encoding

Decoding

Introduction

Advancements of Ambisonics

15

Soundfield microphone

A equilateral triangle is a regular 2D simplex; for 3D: tetrahedron

- +-

+

+

+ + - -

FLD FRU

BLU BRD

Front/Back,Left/Right,Up/Down

+

y

FRU

BRD

FLD

BLUxy

z

+-

x z w

+ - - + + + +-+ - +

Encoding

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

16

Decoding w by sampling

Simplistically

-+

- ++

+

+

+

+

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

17

Decoding x by sampling

Simplistically

-+

- ++

- +

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

18

Simplistically

-+

- ++

-

+

Decoding y by sampling

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

19

Matrix(Decoder)

Lspk. setup

The entire encoding and decoding signal chain

Matrix(Encoder)

Mic. arrayconfig. Andproperties

Ambisonics format

instead of array recording, monosignals can be encoded by vector.

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

20

Matrix(Decoder)

Lspk. setup

The entire encoding and decoding signal chain

Matrix(Encoder)

Mic. arrayconfig. Andproperties

instead of array recording, monosignals can be encoded by vector.

Experimental evidence: what is heard ?

How to arrange loudspeakers optimally?

Ambisonics format

How to best shape the decoder?

Is it sound field synthesis? What‘s its theory?

Higher-order recording unclearbefore 2000… It won‘t workanymore using 2D/3D directionvectors

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Introduction

The theory behind sound field synthesis lies in theKirchhoff-Helmholtz integral (KHI) equation

21

Theory of sound field synthesis, see Schenk (CHIEF point method) or Burton-Miller solution method

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

22

d/dn p|Γ is the surface normal derivative of sound pressure(proportional to normal component of particle velocity)

Theory of sound field synthesis, see Schenk (CHIEF point method) or Burton-Miller solution method

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

The theory behind sound field synthesis lies in the KHI equation and equivalent scattering problem

23

For simplicity (no encoding of source distance):Considering amplitude panning, the driving function onlyselects active sources on the boundary.

g is a virtual panning function: beause it is continuous(loudspeakers will be discrete sources)

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

The theory behind sound field synthesis lies in the KHI equation and equivalent scattering problem

24

Virtual panning function (with height)

red>0

blue<0

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

enc

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

25

Virtual panning function (with height)

red>0

blue<0

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

enc

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

26

Virtual panning function (with height)

red>0

blue<0

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Advancements of Ambisonics

27

Virtual panning function (with height)

red>0

blue<0

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

How to optimally arrange withloudspeakers?

Virtual panning function must bedecoded to discrete loudspeakers

How can they represent it well?

Advancements of Ambisonics

Trying to do sound field synthesis seriously, with spherical functions:

Panning and decoding [Zotter, Pomberger, Frank, „An Alternative Ambisonic Formulation“, 126th AES Conv, May, 2009.]

29

“psychoacoustic” measures were considered for decoding.

Based on theoretical consideration

f<1.5kHz

loudness direction of localisation

f>1.5kHz

loudness direction of localisation

Makita theory.

What is relevant for broad-band sounds? What can we demand?

Trying it more psychoacoustically:Gerzons measures for the decoding problem

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Jérôme Daniel‘s shapes for virtual panning functions: which is best?

Insertion of order weights an

basic

Max-rE

in-phase (a bit more complicated…)

30

Panning and decoding [Jérôme Daniel, PhD thesis, 2000]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

31

loudness direction of localisation

loudness direction of localisation

Gerzons measures applied on virtual panning function

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

linear measures

quadratic measures

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

32

Virtual panning functions have nice properties:

panning-independent properties of Ambisonic panning functions:

• symmetric around panning direction ϑs

• amplitude integral is constant (V)

• directional amplitude concentration:- constant magnitude of concentration ||rV||

- always points towards panning direction ϑs=rV/||rV||

• energy integral is constant (E)

• directional energy concentration:- constant magnitude of concentration ||rE||

- always points towards panning direction ϑs=rE/||rE||

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

33

Nice properties might get lost after decoding:

• symmetric around panning direction ϑs

• amplitude integral is constant (V)

• directional amplitude concentration:- constant magnitude of concentration ||rV||

- always points towards panning direction ϑs=rV/||rV||

• energy integral is constant (E)

• directional energy concentration:- constant magnitude of concentration ||rE||

- always points towards panning direction ϑs=rE/||rE||

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Which are relevant? Psychoacoustically?

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Vector rE is most useful measuring localisation direction

34

broad-band sounds

VBAPVBIP

horizontal-only level panning

Psychoacoustically relevant factors [Matthias Frank… not yet published.]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Which sampling preserves all energetic measures?

35

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

36

Virtual t-design Ambisonics using VBAP

Fig. 6: Triangulation, energy measure, and spread measure as a function of the virtual source direction. [Zotter, Frank, Sontacchi, EAA 2010] [Zotter, Frank, JAES 2012],

PhD-Thesis Matthias Frank, 2013

VBAP

Virtualt-dsgn.Ambi.N = 5

00.510

10

20

30

40

50

60

70

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

37

VBAP and Ambisonics compared

Triplet-wise panning (VBAP)+ constant loudness+ arbitrary layout-- varying spread

Ambisonic Panning~+ constant loudness+ arbitrary layout~+ invariant spread

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Principles and applications of 3D reproduction using Ambisonics Alois Sontacchi

Psychoacousticians are sceptical aboutAmbisonics with height.

• Two vertically stacked loudspeakers do not yield a stablephantom source; should not be more than 30° separated

• A loudspeaker triplet does not yield stable phantom sources in all orientations, and results are individual

• However: What if we do not understand Ambisonics as a simple pair ortriplet?How diffuse must it be to obtain stable playback?

• [Results Braun/Frank/Pomberger 2011]

Advancements of Ambisonics 38

39

[Sabastian Braun, Matthias Frank, „Localization of 3D Ambisonic Recordings …“ ICSA, 2011]

Advancements of Ambisonics

40

[Sabastian Braun, Matthias Frank, „Localization of 3D Ambisonic Recordings …“ ICSA, 2011]

Advancements of Ambisonics

Application for Loudspeaker Reproduction

41Advancements of Ambisonics

1 2

Ambisonics Decoder

Encoderposition Encoder position

Ambi-sonicsBus

3 4

Encoderposition Encoder position

AmbisonicsRecording

rotation

Live electronics in modern operas with Ambisonics:

Alois Sontacchi 2002Thomas Musil

Markus Noisternig 2004Thomas Musil

Peter Plessas 2010

Pomberger, Zotter: An Ambisonics Format for flexible playback layouts, 1st Int. Ambisoncis Symposium, 2009.

Application for Headphone Reproduction

42Advancements of Ambisonics

Sources Encoding Room Head Rotation Filter Reproduction

43

I would particularly like to thank

Franz Zotter

Matthias Frank

Hannes Pomberger

Advancements of Ambisonics

44

Available software tools

• Tools:

CUBEmixer (IEM)

spat (ircam)

AmbDec, Amb plugins (Fons Adriaensen)

ICST Max/MSP plugins

plogue bidule

• Soon more to come.

• If you are a mixing desk manufacturer…

Advancements of Ambisonics

45

Format

• Format

• International Conference on Ambisonics and Spherical Acousticshttp://www.aes-uk.org/uk-conference/25th-call-for-papers/

Advancements of Ambisonics

46

How to record higher order? – upscaling methods from first order

- DirAC: Fraunhofer / V. Pulkki

builds intensity vector from p, vx, vy, vz at each frequencyto assign narrow direction

determines diffuseness

- HARPEX: S. Berge

can isolate even two signal directions at each frequency

Panning and decoding [Franz Zotter, Matthias Frank, „All-Round Ambisonic Panning and Decoding“, JAES, October 2012]

Advancements of Ambisonics

47

Meyer, Elko 2000-

Petersen, Hald, Moller-Juhl, B&K, 2003-

Jin, v.Schaik, 2006-

Li, Duraiswami, O‘Donavan, Grassi, visisonics, 2004-

Rafaely&Park, 2004-

Higher order microphone arrays

Takashima, Nakagawa,Williams, 01db, 2008

Advancements of Ambisonics

Pausch, ZotterPomberger, 2013

40 years of coincidence recording yield Ambisonics

• 1930s- Blumlein‘s coincidentdirectional microphone pair (X-Y, M/S)

- Olson‘s ribbon microphones (figure-of-eight)

• 1940s- microphones with acoustically switchable characteristics

- Bauer‘s supercardioid has maximal front/back ratio

• 1950s- Neumann‘s U57 and AKG‘s C12 (double membrane) studio microphones allow electronic switching

- AKG‘s logo

- Vinyl and tape records available in stereo

- (many non-coincident A/B)

• 1960s- BBC broadcasts in stereo

- Psychoacoustics on stereo

• 1970s- Quadraphony

- Matrix stereophony: encoding/decoding in azimuthalharmonics (Cooper, Shiga)

- Ambisonics(Gerzon/Fellgett/Craven): recording: CALREC-triplet, Soundfield microphone, decoding/encoding circuits

- Panning in higher orders

Time-line

40 years later: a hand-full of higher orders, enriched by sound field synthesis theory, psychoacoustic experiments, and a community

• 1970s to 1990s: - theoretical psychoacoustic decoding, electronics (Gerzon) V,E,rV,rE

• 1990s: - binaural Ambisonics (Jot)- Sursound list- Pulkki‘s papers and experiments about 3D stereophony/VBAP

• 2000s: - Daniel (300 pages: theory), - Ward/Abhayapala truncation error- Malham (energetic distribution), - Sontacchi/Höldrich/Noisternig/Zmölnig(IEM-Bin-Ambi, IEM-CUBE

- Poletti („Mode-matching“),

• More in 2000s: - Meyer/Elko(„Eigenmike“, 4th-order recording)

• End of 2000s:- experimental psychoacosutics: Bertet, Solvang, Kratschmer, Frank, Bates, Marentakis

- Advancement in theory(concepts, sampling, truncation): Ahrens/Spors, Fazi, Li, Rafaely

- First Ambisonics Symposium: HRTFs, musical instrumentradiation, embedding of Ambiinto scene description, …

- Zotter/Pomberger/Frank …

Time-line

Recommended