ECE 477 - Calendar

Course Calendar (Subject to Change)

Week	Day	Date	Topic	Speaker	Required Reading	Extended Reading	Assigned	Due
1	Tue	Aug 27	Introduction	Zhiyao Duan (Zoom)	Lyon: Machine Hearing: An Emerging Field		Course Project; HW0
1	Thu	Aug 29	Auditory Scene Analysis	Zhiyao Duan (Zoom)	Bregman: ASA Book Chapter 1	Wang & Brown: CASA Book Chapter 1	HW1
2	Tue	Sep 3	Signal Processing Review	Huiran Yu	Mueller: Fundamentals of Music Processing Book Chapter 2
2	Thu	Sep 5	Python Programming for Audio	Huiran Yu	librosa; Audio Input Representations	MIR with Python; Python for Scientific Audio
3	Tue	Sep 10	Single Pitch Detection	Zhiyao Duan	Cheveigne: CASA Book Chapter 2.1-2.3	Cheveigne & Kawahara: YIN; Kim et al: CREPE	HW2	HW1
3	Thu	Sep 12	Human Auditory Sensation	Zhiyao Duan	Yost: Hearing Book Chapter 11	Patterson: Auditory Images; Lyon et al: Sparse Auditory Representations
4	Tue	Sep 17	Human Auditory Sensation	Zhiyao Duan	Yost: Hearing Book Chapter 13	Shamma: Encoding Sound Timbre in the Auditory System Wang & Shamma: Spectral Shape Analysis
4	Thu	Sep 19	Rhythm Analysis	Zhiyao Duan	Mueller: Fundamentals of Music Processing Book Chapter 6 Ellis: Beat Tracking by Dynamic Programming	Klapuri et al: Meter Analysis Heydari et al: BeatNet	HW3	HW2
5	Tue	Sep 24	Timbre Representation	Zhiyao Duan	Herrera-Boyer et al: Signal Processing Methods for Music Transcription Book Chapter 6	Childers et al.: The Cepstrum; Davis & Mermelstein: MFCC
5	Thu	Sep 26	Timbre Representation	Zhiyao Duan	Tzanetakis: Music Data Mining Book Chapter 2	Hermansky: PLP; Hermansky & Morgan: RASTA
6	Tue	Oct 1	NMF Audio Modeling	Zhiyao Duan	Smaragdis & Brown: NMF Polyphonic Music Transcription	Lee & Seung: NMF	HW4	HW3
6	Thu	Oct 3	More on NMF	Zhiyao Duan	Smaragdis et al.: PLCA	Virtanen: Monaural Sound Source Separation
7	Tue	Oct 8	HMM Audio Modeling	Zhiyao Duan	Rabiner: HMM	Mysore: PhD Thesis Chapter 2
7	Thu	Oct 10	Deep Learning for Audio CIRC Intro; Bluehive Cheat Sheet	Zhiyao Duan	Goodfellow et al.: Deep Learning Book Chapter 6	Hinton et al.: DNN for Speech Recognition;	HW5	HW4
8	Tue	Oct 15	NO CLASS: Fall Break		How to write a paper? How to give a talk? How to make a poster?
8	Thu	Oct 17	Deep Learning Implementation PyTorch 101	Xingjian Du	Goodfellow et al.: Deep Learning Book Chapter 9	DNN for Speech Separation; Huang et al: Singing Voice Separation by RNN		Project Proposal
9	Tue	Oct 22	BP derivation	Zhiyao Duan	Goodfellow et al.: Deep Learning Book Chapter 14	Schluter & Bock: Onset Detection by CNN; Hamel & Eck: Music Feature Learning with DBN;
9	Thu	Oct 24	Speech Technology	Zhiyao Duan	Ravanelli et al: SpeechBrain, Park et al: Review of Speaker Diarization ASVSpoof2019	Extended Reading Kassis & Hengartner: Breaking Voice Authentication
10	Tue	Oct 29	Multi-pitch Analysis	Frank Cwitkowitz	Cheveigne: CASA Book Chapter 2	Klapuri: Harmonicity and Spectral Smoothness Duan et al: Peak and Non-peak Region		HW5
10	Thu	Oct 31	Multi-pitch Analysis	Zhiyao Duan	Duan et al: Multi-pitch Streaming	Poliner & Ellis: Discriminative Model; Sigtia et al.: Neural Network for Piano Transcription	HW6
11	Tue	Nov 5	Score-Informed Source Separation	Zhiyao Duan	Dannenberg & Raphael: Alignment and Accompaniment; Ewert et al: SISS Overview	Ewert & Muller: Score-informed NMF; Duan et al: Soundprism
11	Thu	Nov 7	Interactive Music Systems	Zhiyao Duan	Gifford et al.: Computational Systems for Music Improvisation	Tatar & Pasquier: Music Agents
12	Tue	Nov 12	Voice Conversion	Melissa Chen	Sisman et al.: Overview	Qian et al.: AutoVC; Sun et al: PPG; Li et al.: StarGANv2-VC		HW6
12	Thu	Nov 14	Room Acoustics and Spatial Audio	Neil Zhang	Machine Learning in Acoustics Neural IIR Filter Field	Novel-View Acoustic Synthesis HRTF Estimation in the Wild
13	Tue	Nov 19	Multi-channel Source Localization and Separation	Zhiyao Duan	Stern et al: CASA Book Chapter 5; Yilmaz & Rickard: DUET	Woodruff & Wang: Binaural Localization Reverberant Noisy
13	Thu	Nov 21	Audio-Visual Scene Understanding	Zhiyao Duan	Arandjelovic & Zisserman: Objects that Sound; Owens & Efros: AV Scene Analysis	Arandjelovic & Zisserman: Look Listen and Learn; Zhao et al.: Sounds of Pixels
14	Tue	Nov 26	Project Status Update	Students
14	Thu	Nov 28	NO CLASS: Happy Thanksgiving!
15	Tue	Dec 3	Self Supervised Learning for Music Understanding	Frank Cwitkowitz
15	Thu	Dec 5	Music Generation	Moji Heydari	Benetatos et al.: BachDuet; Dhawiwal et al.: Jukebox	Hadjeres et al.: DeepBach; Roberts et al.: Hierarchical Latent Vector Model; Jaques et al.: Generating Music with Reinforcement Learning;
16	Sun	Dec 15	Project Oral Presentations	Students	7:15-10:15 PM at CSB 601			Project Report Final; Slides Final

ECE 477 - Computer Audition

Fall 2024 - ECE Department - University of Rochester

(Cross listed as AME/ECE 277, CSC 264/464, TEE 477)

Course Calendar (Subject to Change)