TacotronSTFT
TacotronSTFT
Bases: Module
Source code in training/preprocess/tacotron_stft.py
|
|
__init__(filter_length, hop_length, win_length, n_mel_channels, sampling_rate, center, mel_fmax, mel_fmin=0.0)
TacotronSTFT module that computes mel-spectrograms from a batch of waves.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
filter_length |
int
|
Length of the filter window. |
required |
hop_length |
int
|
Number of samples between successive frames. |
required |
win_length |
int
|
Size of the STFT window. |
required |
n_mel_channels |
int
|
Number of mel bins. |
required |
sampling_rate |
int
|
Sampling rate of the input waveforms. |
required |
mel_fmin |
int or None
|
Minimum frequency for the mel filter bank. |
0.0
|
mel_fmax |
int or None
|
Maximum frequency for the mel filter bank. |
required |
center |
bool
|
Whether to pad the input signal on both sides. |
required |
Source code in training/preprocess/tacotron_stft.py
dynamic_range_compression_torch(x, C=1, clip_val=1e-05)
Applies dynamic range compression to x.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
x |
Tensor
|
Input tensor. |
required |
C |
float
|
Compression factor. |
1
|
clip_val |
float
|
Clipping value. |
1e-05
|
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: Output tensor. |
Source code in training/preprocess/tacotron_stft.py
forward(y)
Computes mel-spectrograms from a batch of waves.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
y |
FloatTensor
|
Input waveforms with shape (B, T) in range [-1, 1] |
required |
Returns:
Type | Description |
---|---|
Tensor
|
torch.FloatTensor: Spectrogram of shape (B, n_spech_channels, T) |
Tensor
|
torch.FloatTensor: Mel-spectrogram of shape (B, n_mel_channels, T) |
Source code in training/preprocess/tacotron_stft.py
linear_spectrogram(y)
Computes the linear spectrogram of a batch of waves.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
y |
Tensor
|
Input waveforms. |
required |
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: Linear spectrogram. |
Source code in training/preprocess/tacotron_stft.py
spectral_normalize_torch(magnitudes)
Applies dynamic range compression to magnitudes.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
magnitudes |
Tensor
|
Input magnitudes. |
required |
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: Output magnitudes. |