Inversion of Magnitude Spectrograms with Adaptive Window Lenghts
Presented at the 34th International Conference of Audio, Speech, and Signal Processing (ICASSP-09), April 19-24, 2009, Taipei, Taiwan (ROC). Conference homepageAbstract
In this paper, we extend the Real-Time Iterative Spectrogram Inversion method (RTISI) for generating a time-domain audio signal from a magnitude spectrogram such that it can handle changing spectrogram window lengths. For each desired window length, we use a separate buffer structure and synchronize the buffers each time the window length changes. This way, the proposed method helps to improve the time/frequency-resolution trade-off for algorithms that operate on magnitude-only spectra.Paper
PDFPoster
PDF (2 MB)Sound Examples
Mix of a castanets and double bass signal. Original sources: EBU-SQAM
All sound examples are stored in the FLAC format (Free Lossless Audio Coding). Details and decoding software can be found here.
Original | Single-resolution Phase Estimation | Multi-Res. Phase Estim. | ||
512 Samples | 1024 Samples | 2048 Samples | 512/2048 Samples | |
FLAC | FLAC | FLAC | FLAC | FLAC |