Monophonic resampling¶
The following code block demonstrates how to resample an audio signal.
We use librosa for loading the audio, but this is purely for ease of demonstration. resampy does not depend on librosa.
1import librosa
2import resampy
3
4# Load in librosa's example audio file at its native sampling rate
5x, sr_orig = librosa.load(librosa.util.example_audio_file(), sr=None)
6
7# x is now a 1-d numpy array, with `sr_orig` audio samples per second
8
9# We can resample this to any sampling rate we like, say 16000 Hz
10y_low = resampy.resample(x, sr_orig, 16000)
11
12# That's it!
Stereo and multi-dimensional data¶
The previous example operates on monophonic signals, but resampy also supports stereo resampling, as demonstrated below.
1import librosa
2import resampy
3
4# Load in librosa's example audio file at its native sampling rate.
5# This time, also disable the stereo->mono downmixing
6x, sr_orig = librosa.load(librosa.util.example_audio_file(), sr=None, mono=False)
7
8# x is now a 2-d numpy array, with `sr_orig` audio samples per second
9# The first dimension of x indexes the channels, the second dimension indexes
10# samples.
11# x[0] is the left channel, x[1] is the right channel.
12
13# We can again resample. By default, resample assumes the last index is time.
14y_low = resampy.resample(x, sr_orig, 16000)
15
16# To be more explicit, provide a target axis
17y_low = resampy.resample(x, sr_orig, 16000, axis=1)
The next block illustrates resampling along an arbitrary dimension.
1import numpy as np
2import resampy
3
4# Generate 4-dimensional white noise. The third axis (axis=2) will index time.
5sr_orig = 22050
6x = np.random.randn(10, 3, sr_orig * 5, 2)
7
8# x is now a 10-by-3-by-(5*22050)-by-2 tensor of data.
9
10# We can resample along the time axis as follows
11y_low = resampy.resample(x, sr_orig, 11025, axis=2)
12
13# y_low is now a 10-by-3-(5*11025)-by-2 tensor of data
Advanced filtering¶
resampy allows you to control the design of the filters used in resampling operations.
1import numpy as np
2import scipy.signal
3import librosa
4import resampy
5
6# Load in some audio
7x, sr_orig = librosa.load(librosa.util.example_audio_file(), sr=None, mono=False)
8
9# Resample to 22050Hz using a Hann-windowed sinc-filter
10y = resampy.resample(x, sr_orig, sr_new, filter='sinc_window', window=scipy.signal.hann)
11
12# Or a shorter sinc-filter than the default (num_zeros=64)
13y = resampy.resample(x, sr_orig, sr_new, filter='sinc_window', num_zeros=32)
14
15# Or use the pre-built high-quality filter
16y = resampy.resample(x, sr_orig, sr_new, filter='kaiser_best')
17
18# Or use the pre-built fast filter
19y = resampy.resample(x, sr_orig, sr_new, filter='kaiser_fast')
Benchmarking¶
Benchmarking resampy is relatively simple, using ipython’s %timeit
magic.
The following example demonstrates resampling a monophonic signal of 400000 samples from
22.05 KHz to 16 KHz using both resampy and scipy.signal.resample.
In [1]: import numpy as np
In [2]: import scipy
In [3]: import resampy
In [4]: x = np.random.randn(400000)
In [5]: sr_in, sr_out = 22050, 16000
In [6]: %timeit resampy.resample(x, sr_in, sr_out, axis=-1)
1 loop, best of 3: 199 ms per loop
In [7]: %timeit scipy.signal.resample(x,
...: int(x.shape[-1] * sr_out / float(sr_in)),
...: axis=-1)
1 loop, best of 3: 6min 5s per loop