Audio Signal Processing for Music Applications

Product type
Logo Coursera (CC)
Provider rating: starstarstarstar_halfstar_border 7.2 Coursera (CC) has an average rating of 7.2 (out of 6 reviews)

Need more information? Get more details on the site of the provider.

Description

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: In this course you will learn about audio signal processing methodologies that are specific for music and of use in real applications. We focus on the spectral processing techniques of relevance for the description and transformation of sounds, developing the basic theoretical and practical knowledge with which to analyze, synthesize, transform and describe audio signals in the context of music applications. The course is based on open software and content. The demonstrations and programming exercises are done using Python under Ubuntu, and the references and materials for the course come from open online repositories. We are also distributing with open licenses the s…

Read the complete description

Frequently asked questions

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.

Didn't find what you were looking for? See also: Audio, Music, Python, Programming (general), and Web Accessibility.

When you enroll for courses through Coursera you get to choose for a paid plan or for a free plan

  • Free plan: No certicification and/or audit only. You will have access to all course materials except graded items.
  • Paid plan: Commit to earning a Certificate—it's a trusted, shareable way to showcase your new skills.

About this course: In this course you will learn about audio signal processing methodologies that are specific for music and of use in real applications. We focus on the spectral processing techniques of relevance for the description and transformation of sounds, developing the basic theoretical and practical knowledge with which to analyze, synthesize, transform and describe audio signals in the context of music applications. The course is based on open software and content. The demonstrations and programming exercises are done using Python under Ubuntu, and the references and materials for the course come from open online repositories. We are also distributing with open licenses the software and materials developed for the course.

Who is this class for: This course is primary aimed at advanced undergraduate or master students, along with professionals, interested in signal processing, programming and music.

Created by:  Universitat Pompeu Fabra of Barcelona, Stanford University
  • Taught by:  Xavier Serra, Associate Professor

    Dept. of Information and Communication Technologies, UPF
  • Taught by:  Prof Julius O Smith, III, Professor of Music and (by courtesy) Electrical Engineering

    CCRMA
Level Intermediate Commitment 10 weeks of study, 8 hours/week Language English How To Pass Pass all graded assignments to complete the course. User Ratings 4.9 stars Average User Rating 4.9See what learners said Coursework

Each course is like an interactive textbook, featuring pre-recorded videos, quizzes and projects.

Help from your peers

Connect with thousands of other learners and debate ideas, discuss course material, and get help mastering concepts.

Universitat Pompeu Fabra of Barcelona Pompeu Fabra University (UPF) is a modern public university, conveniently located in the centre of Barcelona (Spain) with the aim of providing top quality education and standing out as a research-based university. UPF is both a specialised university with a unique teaching model and a cutting-edge research institution. UPF places a strong emphasis on quality teaching, based on comprehensive education and student-centred learning, and innovation in the learning processes. UPF’s MOOCs are produced within this general goal. Stanford University The Leland Stanford Junior University, commonly referred to as Stanford University or Stanford, is an American private research university located in Stanford, California on an 8,180-acre (3,310 ha) campus near Palo Alto, California, United States.

Syllabus


WEEK 1


Introduction



Introduction to the course, to the field of Audio Signal Processing, and to the basic mathematics needed to start the course. Introductory demonstrations to some of the software applications and tools to be used. Introduction to Python and to the sms-tools package, the main programming tool for the course.


11 videos, 1 reading expand


  1. Video: Teaser
  2. Video: Welcome
  3. Video: Introduction to Audio Signal Processing
  4. Video: Course outline
  5. Video: Basic mathematics
  6. Video: Introduction to Audacity
  7. Video: Introduction to SonicVisualizer
  8. Video: Introduction to sms-tools
  9. Video: Introduction to Python
  10. Video: Python and sounds
  11. Video: sms-tools software
  12. Reading: Advanced readings and videos

Graded: Basics
Graded: Python and sound

WEEK 2


Discrete Fourier transform



The Discrete Fourier Transform equation; complex exponentials; scalar product in the DFT; DFT of complex sinusoids; DFT of real sinusoids; and inverse-DFT. Demonstrations on how to analyze a sound using the DFT; introduction to Freesound.org. Generating sinusoids and implementing the DFT in Python.


6 videos, 1 reading expand


  1. Video: DFT 1
  2. Video: DFT 2
  3. Video: Analyzing a sound
  4. Video: Introduction to Freesound
  5. Video: Sinusoids
  6. Video: DFT
  7. Reading: Advanced readings and videos

Graded: DFT
Graded: Sinusoids and DFT

WEEK 3


Fourier theorems



Linearity, shift, symmetry, convolution; energy conservation and decibels; phase unwrapping; zero padding; Fast Fourier Transform and zero-phase windowing; and analysis/synthesis. Demonstration of the analysis of simple periodic signals and of complex sounds; demonstration of spectrum analysis tools. Implementing the computation of the spectrum of a sound fragment using Python and presentation of the dftModel functions implemented in the sms-tools package.


7 videos, 1 reading expand


  1. Video: Fourier properties 1
  2. Video: Fourier properties 2
  3. Video: Periodic signals
  4. Video: Complex sounds
  5. Video: Spectrum
  6. Video: Fourier properties
  7. Video: dftModel
  8. Reading: Advanced readings and videos

Graded: Fourier properties
Graded: Fourier Properties

WEEK 4


Short-time Fourier transform



STFT equation; analysis window; FFT size and hop size; time-frequency compromise; inverse STFT. Demonstration of tools to compute the spectrogram of a sound and on how to analyze a sound using them. Implementation of the windowing of sounds using Python and presentation of the STFT functions from the sms-tools package, explaining how to use them.


6 videos, 1 reading expand


  1. Video: STFT 1
  2. Video: STFT 2
  3. Video: Spectrogram
  4. Video: Analyzing a sound
  5. Video: Windows
  6. Video: STFT
  7. Reading: Advanced readings and videos

Graded: Short-time Fourier transform
Graded: Short-time Fourier Transform (STFT)

WEEK 5


Sinusoidal model



Sinusoidal model equation; sinewaves in a spectrum; sinewaves as spectral peaks; time-varying sinewaves in spectrogram; sinusoidal synthesis. Demonstration of the sinusoidal model interface of the sms-tools package and its use in the analysis and synthesis of sounds. Implementation of the detection of spectral peaks and of the sinusoidal synthesis using Python and presentation of the sineModel functions from the sms-tools package, explaining how to use them.


8 videos, 1 reading expand


  1. Video: Sinusoidal model 1
  2. Video: Sinusoidal model 2
  3. Video: Sinusoidal model 3
  4. Video: Sinusoidal model
  5. Video: Analyzing a sound
  6. Video: Peak detection
  7. Video: Sinusoidal synthesis
  8. Video: sineModel
  9. Reading: Advance reading

Graded: Sinusoidal model
Graded: Sinusoidal model

WEEK 6


Harmonic model



Harmonic model equation; sinusoids-partials-harmonics; polyphonic-monophonic signals; harmonic detection; f0-detection in time and frequency domains. Demonstrations of pitch detection algorithm, of the harmonic model interface of the sms-tools package and of its use in the analysis and synthesis of sounds. Implementation of the detection of the fundamental frequency in the frequency domain using the TWM algorithm in Python and presentation of the harmonicModel functions from the sms-tools package, explaining how to use them.


7 videos, 1 reading expand


  1. Video: Harmonic model
  2. Video: F0 detection
  3. Video: Pitch detection
  4. Video: Harmonic model
  5. Video: Analyzing a sound
  6. Video: F0 detection
  7. Video: harmonicModel
  8. Reading: Advanced readings

Graded: Harmonic model
Graded: Harmonic Model

WEEK 7


Sinusoidal plus residual model



Stochastic signals; stochastic model; stochastic approximation of sounds; sinusoidal/harmonic plus residual model; residual subtraction; sinusoidal/harmonic plus stochastic model; stochastic model of residual. Demonstrations of the stochastic model, harmonic plus residual, and harmonic plus stochastic interfaces of the sms-tools package and of its use in the analysis and synthesis of sounds. Presentation of the stochasticModel, hprModel and hpsModel functions implemented in the sms-tools package, explaining how to use them.


8 videos, 1 reading expand


  1. Video: Stochastic model
  2. Video: Sinusoidal plus residual modeling
  3. Video: Stochastic model
  4. Video: Harmonic plus residual model
  5. Video: Harmonic plus stochastic model
  6. Video: stochasticModel
  7. Video: hprModel
  8. Video: hpsModel
  9. Reading: Advanced readings

Graded: Sinusoidal plus residual model
Graded: Sinusoidal plus residual

WEEK 8


Sound transformations



Filtering and morphing using the short-time Fourier transform; frequency and time scaling using the sinusoidal model; frequency transformations using the harmonic plus residual model; time scaling and morphing using the harmonic plus stochastic model. Demonstrations of the various transformation interfaces of the sms-tools package and of Audacity. Presentation of the stftTransformations, sineTransformations and hpsTransformations functions implemented in the sms-tools package, explaining how to use them.


9 videos, 1 reading expand


  1. Video: Sounds transformations 1
  2. Video: Sounds transformations 2
  3. Video: Morphing with STFT
  4. Video: Time scaling
  5. Video: Pitch changes
  6. Video: Morphing with HPS
  7. Video: stftTransformations
  8. Video: sineTransformations
  9. Video: hpsTransformations
  10. Reading: Advanced readings

Graded: Sound transformations
Graded: Transformations

WEEK 9


Sound and music description



Extraction of audio features using spectral analysis methods; describing sounds, sound collections, music recordings and music collections. Clustering and classification of sounds. Demonstration of various plugins from SonicVisualiser to describe sound and music signals and demonstration of some advance features of freesound.org. Presentation of Essentia, a C++ library for sound and music description, explaining how to use it from Python. Programming with the Freesound API in Python to download sound collections and to study them.


6 videos expand


  1. Video: Audio features
  2. Video: Sound and music description
  3. Video: Sound descriptors
  4. Video: Freesound
  5. Video: Intro to Essentia
  6. Video: Freesound API

Graded: Sound and music description
Graded: Sound and music description

WEEK 10


Concluding topics



Audio signal processing beyond this course. Beyond audio signal processing. Review of the course topics. Where to learn more about the topics of this course. Presentation of MTG-UPF. Demonstration of Dunya, a web browser to explore several audio music collections, and of AcousticBrainz, a collaborative initiative to collect and share music data.


6 videos, 1 reading expand


  1. Video: Beyond audio processing
  2. Video: Review
  3. Video: MTG-UPF
  4. Video: Goodbye
  5. Video: Dunya
  6. Video: AcousticBrainz
  7. Reading: Advanced readings

Graded: Concluding topics
Graded: A music piece combining sounds and their transformations
There are no reviews yet.

Share your review

Do you have experience with this course? Submit your review and help other people make the right choice. As a thank you for your effort we will donate $1.- to Stichting Edukans.

There are no frequently asked questions yet. If you have any more questions or need help, contact our customer service.