Player is loading...

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute

Friday, May 3, 2019 · 10:15 a.m. · 13m 00s

Abstract: A recent trend in audio and speech processing consists in training neural networks on raw waveforms for various classification tasks. While this approach has been shown to perform well, there is limited understanding of what kind of information is learned from the waveforms by the neural networks. Such an insight is not only interesting for advancing those techniques but also for understanding better audio and speech signal characteristics. In this talk, taking inspirations from vision community, I will present a gradient-based visualization method that could provide insight into which spectral characteristics in a given input have the highest impact on the prediction score. I will demonstrate the potential of the proposed approach on two classification tasks: phoneme recognition and speaker identification.

Embed

Copy embed code

Share this talk:

Conference Program

57:55

Methods for Rule and Knowledge Extraction from Deep Neural Networks
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 9:10 a.m.

1455 views

04:39

Q&A - Keynote speech: Prof. Pena Carlos Andrés
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 10:08 a.m.

13:00

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:15 a.m.

386 views

01:56

Q&A - Hannah Muckenhirn
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:28 a.m.

254 views

20:07

Concept Measures to Explain Deep Learning Predictions in Medical Imaging
Mara Graziani, HES-SO Valais-Wallis
May 3, 2019 · 10:32 a.m.

464 views

16:17

What do neural network saliency maps encode?
Suraj Srinivas, Idiap Research Institute
May 3, 2019 · 10:53 a.m.

731 views

17:25

Transparency of rotation-equivariant CNNs via local geometric priors
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:30 a.m.

461 views

01:11

Q&A - Dr Vincent Andrearczyk
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:48 a.m.

105 views

15:45

Interpretable models of robot motion learned from few demonstrations
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 11:50 a.m.

195 views

01:23

Q&A - Sylvain Calinon
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 12:06 p.m.

125 views

12:00

The HyperBagGraph DataEdron: An Enriched Browsing Experience of Scientific Publication Databa
Xavier Ouvrard, University of Geneva / CERN
May 3, 2019 · 12:08 p.m.

109 views

12:44

Improving robustness to build more interpretable classifiers
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:21 p.m.

143 views

02:23

Q&A - Seyed Moosavi
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:34 p.m.

14:38

Interpretation of End-to-end one Dimension Convolutional Neural Network for Fault Diagnosis on a Planetary Gearbox
Sooho Kim, UniGe
May 3, 2019 · 12:41 p.m.

Recommended talks

58:56

Raw Waveform-based Acoustic Modeling and its analysis
Mathew Magimai Doss, Idiap Research Institute
Feb. 14, 2019 · 9:12 a.m.

370 views

24:16

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute

Embed

Conference Program

Methods for Rule and Knowledge Extraction from Deep Neural Networks
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 9:10 a.m.

Q&A - Keynote speech: Prof. Pena Carlos Andrés
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 10:08 a.m.

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:15 a.m.

Q&A - Hannah Muckenhirn
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:28 a.m.

Concept Measures to Explain Deep Learning Predictions in Medical Imaging
Mara Graziani, HES-SO Valais-Wallis
May 3, 2019 · 10:32 a.m.

What do neural network saliency maps encode?
Suraj Srinivas, Idiap Research Institute
May 3, 2019 · 10:53 a.m.

Transparency of rotation-equivariant CNNs via local geometric priors
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:30 a.m.

Q&A - Dr Vincent Andrearczyk
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:48 a.m.

Interpretable models of robot motion learned from few demonstrations
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 11:50 a.m.

Q&A - Sylvain Calinon
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 12:06 p.m.

The HyperBagGraph DataEdron: An Enriched Browsing Experience of Scientific Publication Databa
Xavier Ouvrard, University of Geneva / CERN
May 3, 2019 · 12:08 p.m.

Improving robustness to build more interpretable classifiers
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:21 p.m.

Q&A - Seyed Moosavi
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:34 p.m.

Interpretation of End-to-end one Dimension Convolutional Neural Network for Fault Diagnosis on a Planetary Gearbox
Sooho Kim, UniGe
May 3, 2019 · 12:41 p.m.

Recommended talks

Raw Waveform-based Acoustic Modeling and its analysis
Mathew Magimai Doss, Idiap Research Institute
Feb. 14, 2019 · 9:12 a.m.

Acoustic data‐driven lexical modeling
Mathew Magimai Doss, Idiap Research Institute
Sept. 3, 2012 · 4:01 p.m.

Klewel SA

What is Klewel?

Follow Us

Contact Us

Visualizing and understanding raw speech modeling with convolutional neural networks Hannah Muckenhirn, Idiap Research Institute

Embed

Conference Program

Methods for Rule and Knowledge Extraction from Deep Neural Networks Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD May 3, 2019 · 9:10 a.m.

Q&A - Keynote speech: Prof. Pena Carlos Andrés Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD May 3, 2019 · 10:08 a.m.

Visualizing and understanding raw speech modeling with convolutional neural networks Hannah Muckenhirn, Idiap Research Institute May 3, 2019 · 10:15 a.m.

Q&A - Hannah Muckenhirn Hannah Muckenhirn, Idiap Research Institute May 3, 2019 · 10:28 a.m.

Concept Measures to Explain Deep Learning Predictions in Medical Imaging Mara Graziani, HES-SO Valais-Wallis May 3, 2019 · 10:32 a.m.

What do neural network saliency maps encode? Suraj Srinivas, Idiap Research Institute May 3, 2019 · 10:53 a.m.

Transparency of rotation-equivariant CNNs via local geometric priors Dr Vincent Andrearczyk, HES-SO Valais-Wallis May 3, 2019 · 11:30 a.m.

Q&A - Dr Vincent Andrearczyk Dr Vincent Andrearczyk, HES-SO Valais-Wallis May 3, 2019 · 11:48 a.m.

Interpretable models of robot motion learned from few demonstrations Dr Sylvain Calinon, Idiap Research Institute May 3, 2019 · 11:50 a.m.

Q&A - Sylvain Calinon Dr Sylvain Calinon, Idiap Research Institute May 3, 2019 · 12:06 p.m.

The HyperBagGraph DataEdron: An Enriched Browsing Experience of Scientific Publication Databa Xavier Ouvrard, University of Geneva / CERN May 3, 2019 · 12:08 p.m.

Improving robustness to build more interpretable classifiers Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL May 3, 2019 · 12:21 p.m.

Q&A - Seyed Moosavi Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL May 3, 2019 · 12:34 p.m.

Interpretation of End-to-end one Dimension Convolutional Neural Network for Fault Diagnosis on a Planetary Gearbox Sooho Kim, UniGe May 3, 2019 · 12:41 p.m.

Recommended talks

Raw Waveform-based Acoustic Modeling and its analysis Mathew Magimai Doss, Idiap Research Institute Feb. 14, 2019 · 9:12 a.m.

Acoustic data‐driven lexical modeling Mathew Magimai Doss, Idiap Research Institute Sept. 3, 2012 · 4:01 p.m.

Klewel SA

What is Klewel?

Follow Us

Contact Us

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute

Methods for Rule and Knowledge Extraction from Deep Neural Networks
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 9:10 a.m.

Q&A - Keynote speech: Prof. Pena Carlos Andrés
Keynote speech: Prof. Pena Carlos Andrés, HEIG-VD
May 3, 2019 · 10:08 a.m.

Visualizing and understanding raw speech modeling with convolutional neural networks
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:15 a.m.

Q&A - Hannah Muckenhirn
Hannah Muckenhirn, Idiap Research Institute
May 3, 2019 · 10:28 a.m.

Concept Measures to Explain Deep Learning Predictions in Medical Imaging
Mara Graziani, HES-SO Valais-Wallis
May 3, 2019 · 10:32 a.m.

What do neural network saliency maps encode?
Suraj Srinivas, Idiap Research Institute
May 3, 2019 · 10:53 a.m.

Transparency of rotation-equivariant CNNs via local geometric priors
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:30 a.m.

Q&A - Dr Vincent Andrearczyk
Dr Vincent Andrearczyk, HES-SO Valais-Wallis
May 3, 2019 · 11:48 a.m.

Interpretable models of robot motion learned from few demonstrations
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 11:50 a.m.

Q&A - Sylvain Calinon
Dr Sylvain Calinon, Idiap Research Institute
May 3, 2019 · 12:06 p.m.

The HyperBagGraph DataEdron: An Enriched Browsing Experience of Scientific Publication Databa
Xavier Ouvrard, University of Geneva / CERN
May 3, 2019 · 12:08 p.m.

Improving robustness to build more interpretable classifiers
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:21 p.m.

Q&A - Seyed Moosavi
Seyed Moosavi, Signal Processing Laboratory 4 (LTS4), EPFL
May 3, 2019 · 12:34 p.m.

Interpretation of End-to-end one Dimension Convolutional Neural Network for Fault Diagnosis on a Planetary Gearbox
Sooho Kim, UniGe
May 3, 2019 · 12:41 p.m.

Raw Waveform-based Acoustic Modeling and its analysis
Mathew Magimai Doss, Idiap Research Institute
Feb. 14, 2019 · 9:12 a.m.

Acoustic data‐driven lexical modeling
Mathew Magimai Doss, Idiap Research Institute
Sept. 3, 2012 · 4:01 p.m.