I expect I can convert an audio file or waveform to the spectrogram image where:
(20hz until the max possible audio frequency can reach)
. I also expect I can set scale in this axis such as linearly or logarithm or with my custom function like: f(p) = 2p
where p is n-th pixel from 0 to the maximum heigh of image
and f(p)
is frequency.720*480
So is there python library/package that can I install, or I should calculate manually which I should transform from time domain waveform to the frequency domain waveform using Fast Fourier Transform?
Check librosa library, should conain all that you need. For instance https://librosa.org/doc/main/generated/librosa.stft.html