Tampere University of Technology

TUTCRIS Research Portal

Nonnegative Matrix Factorization

Research output: Chapter in Book/Report/Conference proceedingChapterScientific

Details

Original languageEnglish
Title of host publicationAudio Source Separation and Speech Enhancement
EditorsEmmanuel Vincent, Tuomas Virtanen, Sharon Gannot
PublisherWiley
ISBN (Electronic)978-1-119-27986-0
ISBN (Print)978-1-119-27989-1
DOIs
Publication statusPublished - 3 Aug 2018
Publication typeB2 Part of a book or another research book

Abstract

Nonnegative matrix factorization (NMF) is a very powerful model for representing speech and music data. In this chapter, we present the mathematical foundations, and describe several probabilistic frameworks and various algorithms for computing an NMF. We also describe some advanced NMF models that are able to more accurately represent audio signals, by enforcing properties such as sparsity, harmonicity and spectral smoothness, and by taking the non‐stationarity of the data into account. We show that coupled factorizations make it possible to exploit some extra information we may have about the observed signal, including the musical score. Finally, we present several methods that perform dictionary learning for NMF, and we conclude about the main benefits and downsides of NMF models.

Field of science, Statistics Finland