1 |
Tue |
Aug 26 |
Introduction |
Zhiyao Duan |
Lyon: Machine Hearing: An Emerging Field |
|
Course Project;
Paper Reviews;
HW0 |
|
1 |
Thu |
Aug 28 |
Auditory Scene Analysis |
Zhiyao Duan |
Bregman: ASA Book Chapter 1 |
Wang & Brown: CASA Book Chapter 1 |
HW1 |
|
2 |
Tue |
Sep 2 |
Signal Processing Review |
Zhiyao Duan |
Mueller: Fundamentals of Music Processing Book Chapter 2 |
|
|
|
2 |
Thu |
Sep 4 |
Python Programming for Audio\
Slides |
Xingjian Du |
librosa;
Audio Input Representations |
MIR with Python;
Python for Scientific Audio |
|
|
|
3 |
Tue |
Sep 9 |
Single Pitch Detection |
Zhiyao Duan |
Cheveigne: CASA Book Chapter 2.1-2.3 |
Cheveigne & Kawahara: YIN;
Kim et al: CREPE;
Gfeller et al: SPICE;
Riou et al: PESTO |
HW2 |
HW1 |
3 |
Thu |
Sep 11 |
Human Auditory Sensation |
Zhiyao Duan |
Yost: Hearing Book Chapter 11 |
Patterson: Auditory Images;
Lyon et al: Sparse Auditory Representations |
|
Paper Review 1 (due Sun) |
4 |
Tue |
Sep 16 |
Human Auditory Sensation |
Zhiyao Duan |
Yost: Hearing Book Chapter 13 |
Shamma: Encoding Sound Timbre in the Auditory System
Wang & Shamma: Spectral Shape Analysis |
|
|
4 |
Thu |
Sep 18 |
Rhythm Analysis |
Zhiyao Duan |
Mueller: Fundamentals of Music Processing Book Chapter 6
Ellis: Beat Tracking by Dynamic Programming |
Klapuri et al: Meter Analysis
Heydari et al: BeatNet
Zhao et al: Beat Transformer
Foscarin et al: Beat This!
Desblancs et al.: Zero Note Mamba
Chang & Su: Beast |
HW3 |
HW2 |
5 |
Tue |
Sep 23 |
Timbre Representation |
Huiran |
Herrera-Boyer et al: Signal Processing Methods for Music Transcription Book Chapter 6 |
Childers et al.: The Cepstrum;
Davis & Mermelstein: MFCC
Huang et al: Music Timbre Style Transfer |
|
|
5 |
Thu |
Sep 25 |
Timbre Representation |
Huiran |
Tzanetakis: Music Data Mining Book Chapter 2 |
Hermansky: PLP;
Hermansky & Morgan: RASTA
Wu et al: Transplayer |
|
|
6 |
Tue |
Sep 30 |
NMF Audio Modeling |
Zhiyao Duan |
Smaragdis & Brown: NMF Polyphonic Music Transcription |
Lee & Seung: NMF |
HW4 |
HW3 |
6 |
Thu |
Oct 2 |
More on NMF |
Zhiyao Duan |
Smaragdis et al.: PLCA |
Virtanen: Monaural Sound Source Separation |
|
Paper Review 2 (due Fri) |
7 |
Tue |
Oct 7 |
HMM Audio Modeling |
Zhiyao Duan |
Rabiner: HMM |
Mysore: PhD Thesis Chapter 2 |
|
|
7 |
Thu |
Oct 9 |
Deep Learning for Audio
CIRC Intro; Bluehive Cheat Sheet |
Zhiyao Duan |
Goodfellow et al.: Deep Learning Book Chapter 6 |
Hinton et al.: DNN for Speech Recognition; |
HW5 |
HW4 |
8 |
Tue |
Oct 14 |
NO CLASS: Fall Break |
|
How to write a paper?
How to give a talk?
How to make a poster? |
|
|
|
8 |
Thu |
Oct 16 |
Deep Learning Implementation
PyTorch 101
|
TBD |
Goodfellow et al.: Deep Learning Book Chapter 9 |
DNN for Speech Separation;
Huang et al: Singing Voice Separation by RNN |
|
Project Proposal |
9 |
Tue |
Oct 21 |
BP derivation |
Zhiyao Duan |
Goodfellow et al.: Deep Learning Book Chapter 14 |
Schluter & Bock: Onset Detection by CNN;
Hamel & Eck: Music Feature Learning with DBN;
|
|
|
9 |
Thu |
Oct 23 |
Speech Technology |
Zhiyao Duan |
Ravanelli et al: SpeechBrain, Park et al: Review of Speaker Diarization
ASVSpoof2019 |
Extended Reading
Kassis & Hengartner: Breaking Voice Authentication |
HW6 |
HW5 |
10 |
Tue |
Oct 28 |
Voice Conversion |
Zhiyao Duan |
Sisman et al.: Overview |
Qian et al.: AutoVC; Sun et al: PPG; Li et al.: StarGANv2-VC |
|
Paper Review 3 |
10 |
Thu |
Oct 30 |
Multi-pitch Analysis |
Zhiyao Duan |
Cheveigne: CASA Book Chapter 2 |
Klapuri: Harmonicity and Spectral Smoothness
Duan et al: Peak and Non-peak Region |
|
|
11 |
Tue |
Nov 4 |
Multi-pitch Analysis |
Zhiyao Duan |
Duan et al: Multi-pitch Streaming |
Poliner & Ellis: Discriminative Model;
Sigtia et al.: Neural Network for Piano Transcription |
|
HW6 |
11 |
Thu |
Nov 6 |
Self Supervised Learning for Music Understanding |
Frank Cwitkowitz |
|
|
|
|
12 |
Tue |
Nov 11 |
Score-Informed Source Separation |
Zhiyao Duan |
Dannenberg & Raphael: Alignment and Accompaniment;
Ewert et al: SISS Overview |
Ewert & Muller: Score-informed NMF;
Duan et al: Soundprism |
|
|
12 |
Thu |
Nov 13 |
Audio-Visual Scene Understanding |
Zhiyao Duan |
Arandjelovic & Zisserman: Objects that Sound; Owens & Efros: AV Scene Analysis |
Arandjelovic & Zisserman: Look Listen and Learn; Zhao et al.: Sounds of Pixels |
|
|
13 |
Tue |
Nov 18 |
Project Status Update in Zhiyao's Office |
Students |
Check Google Doc for Schedule |
|
|
Project Status Update |
13 |
Thu |
Nov 20 |
Room Acoustics and Spatial Audio |
Zhiyao Duan |
Machine Learning in Acoustics
Neural IIR Filter Field
|
Novel-View Acoustic Synthesis
HRTF Estimation in the Wild
|
|
|
14 |
Tue |
Nov 25 |
Multi-channel Source Localization and Separation |
Zhiyao Duan |
Stern et al: CASA Book Chapter 5;
Yilmaz & Rickard: DUET |
Woodruff & Wang: Binaural Localization Reverberant Noisy |
|
|
14 |
Thu |
Nov 27 |
NO CLASS: Happy Thanksgiving!
| |
|
|
|
|
15 |
Tue |
Dec 2 |
Interactive Music Systems |
Zhiyao Duan |
Gifford et al.: Computational Systems for Music Improvisation |
Tatar & Pasquier: Music Agents |
|
|
15 |
Thu |
Dec 4 |
Music Generation |
TBD |
Benetatos et al.: BachDuet; Dhawiwal et al.: Jukebox |
Hadjeres et al.: DeepBach; Roberts et al.: Hierarchical Latent Vector Model; Jaques et al.: Generating Music with Reinforcement Learning; |
|
|
16 |
TBD |
TBD |
Project Oral Presentations |
Students |
TBD |
|
|
Project Report Final;
Slides Final |