Abstract: A new neural network architecture is proposed that can be used to convert Mel spectrograms into an audio signal. The architecture is designed from the ground up to be run on a mobile device, ...
Abstract: The quality of raw audio waveform generated by a vocoder could affect various audio generative tasks. In recent years, the dominance of source-filter vocoders was greatly challenged by ...
References [1] High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling [2] Low-latency real-time ...