Tools
calc_same_padding(kernel_size)
Calculates the necessary padding for 'same' padding in convolutional operations.
For 'same' padding, the output size is the same as the input size for stride=1
. This function returns
two integers, representing the padding to be added on either side of the input to achieve 'same' padding.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
kernel_size |
int
|
Size of the convolving kernel. |
required |
Returns:
Type | Description |
---|---|
int
|
Tuple[int, int]: A tuple of two integers representing the number of padding elements to be applied on |
int
|
left and right (or top and bottom for 2D) of the input tensor respectively. |
Source code in models/helpers/tools.py
get_device()
Function returns the device where the model and tensors should be placed.
Returns torch.device: The device where the model and tensors should be placed.
Source code in models/helpers/tools.py
get_mask_from_lengths(lengths)
Generate a mask tensor from a tensor of sequence lengths.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
lengths |
Tensor
|
A tensor of sequence lengths of shape: (batch_size, ) |
required |
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: A mask tensor of shape: (batch_size, max_len) where max_len is the maximum sequence length in the provided tensor. The mask tensor has a value of True at each position that is more than the length of the sequence (padding positions). |
Example
lengths: torch.tensor([2, 3, 1, 4])
Mask tensor will be: torch.tensor([
[False, False, True, True],
[False, False, False, True],
[False, True, True, True],
[False, False, False, False]
])
Source code in models/helpers/tools.py
initialize_embeddings(shape)
Initialize embeddings using Kaiming initialization (He initialization).
This method is specifically designed for 2D matrices and helps to avoid the vanishing/exploding gradient problem in deep neural networks. This is achieved by keeping the variance of the outputs of a layer to be the same as the variance of its inputs.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
shape |
Tuple[int, ...]
|
The shape of the embedding matrix to create, denoted as a tuple of integers. The shape should comprise 2 dimensions, i.e., (embedding_dim, num_embeddings). |
required |
Raises:
Type | Description |
---|---|
AssertionError
|
if the provided shape is not 2D. |
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: the created embedding matrix. |
Source code in models/helpers/tools.py
pad(input_ele, max_len)
Takes a list of 1D or 2D tensors and pads them to match the maximum length.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
input_ele |
List[Tensor]
|
The list of tensors to be padded. |
required |
max_len |
int
|
The length to which the tensors should be padded. |
required |
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: A tensor containing all the padded input tensors. |
Source code in models/helpers/tools.py
stride_lens_downsampling(lens, stride=2)
Function computes the lengths of 1D tensor when applying a stride for downsampling.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
lens |
Tensor
|
Tensor containing the lengths to be downsampled. |
required |
stride |
int
|
The stride to be used for downsampling. Defaults to 2. |
2
|
Returns:
Type | Description |
---|---|
Tensor
|
torch.Tensor: A tensor of the same shape as the input containing the downsampled lengths. |