You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ability to provide initial phase to griffin-lim, for phase reconstruction.
Motivation, pitch
pytorchaudio has a Griffin-Lim implementation to reconstruct audio from spectrogram magnitudes. Here phase is reconstructed entirely within torch's griffin lim implementation.
In tasks such as source separation, often the solution converges much more quickly/becomes better when using the mixture phase for the estimation of the source audio output. So it would be cool if another parameter could be added to the function allowing you to optionally pass in an initial guess of the phase.
This functionality exists in librosa's implementation of Griffin Lim, where if the function parameter 'init' is passed None, the phase is initialized from input spectrogram (which must now phase).
🚀 The feature
Ability to provide initial phase to griffin-lim, for phase reconstruction.
Motivation, pitch
pytorchaudio has a Griffin-Lim implementation to reconstruct audio from spectrogram magnitudes. Here phase is reconstructed entirely within torch's griffin lim implementation.
https://pytorch.org/audio/main/generated/torchaudio.transforms.GriffinLim.html
In tasks such as source separation, often the solution converges much more quickly/becomes better when using the mixture phase for the estimation of the source audio output. So it would be cool if another parameter could be added to the function allowing you to optionally pass in an initial guess of the phase.
This functionality exists in librosa's implementation of Griffin Lim, where if the function parameter 'init' is passed None, the phase is initialized from input spectrogram (which must now phase).
https://librosa.org/doc/latest/generated/librosa.griffinlim.html
Alternatives
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: