Understanding concatenate (cat), stack, vstack, hstack

If we want to put arrays together, we can typically do so using numpy’s concatenate, stack, vstack, or hstack. In pytorch, we can use cat or stack. concatenate or cat allow us to concatenate 2 or more arrays by expanding an existing dimension and require all other dimensions to match across the arrays. stack allows us to stack 2 or more arrays by inserting a new dimension and requires the arrays to have the same exact shape. vstack allows us to concatenate arrays vertically and requires all non-vertical dimensions to match across the arrays. hstack allows us to concatenate arrays horizontally and requires all non-horizontal dimensions to match across the arrays. For detailed examples, read below.

Concatenation of batches vertically using concatenate (cat) and axis=0 or vstack
Concatenation of batches horizontally using concatenate (cat) and axis=1 or hstack
Concatenation of instance features horizontally using concatenate (cat) and axis=0 or hstack
Stacking of instances vertically using stack and axis=0

Concatenation of batches vertically using concatenate (cat) and axis=0 or vstack

Concatenate can be used to join 2 or more arrays along an existing dimension.

Let’s say we have two arrays of shape batch size (2) x number of features (3).

import numpy as np
a_bf1 = np.array([[1, 2, 3],
                  [4, 5, 6]])
a_bf2 = np.array([[7, 8, 9],
                  [10, 11, 12]])

Now, let’s say we want to join these 2 arrays as to form one array containing 4 rows. In this case, we already have a batch-size dimension and simply want to expand this existing 0th dimension from size 2 to size 4. For this scenario, we use concatenate. We specify axis=0 since the batch dimension we want to concatenate along is the 0th dimension. The result is an array of shape 4 x 2. Note that we can apply the concatenation in any of the suggested approaches below.

a_cat1 = np.concatenate((a_bf1, a_bf2), axis=0)
a_cat1 = np.concatenate((a_bf1, a_bf2), 0)
a_cat1 = np.concatenate((a_bf1, a_bf2))
a_cat1

The output is:

array[[ 1,  2,  3],
      [ 4,  5,  6],
      [ 7,  8,  9]
      [10, 11, 12]])

The pytorch equivalent of concatenate is cat. With cat, the axis parameter can be specified or not in the same way as with numpy’s concatenate.

import torch
from torch import tensor
torch.cat((tensor(a_bf1), tensor(a_bf2)), 0)

The output is:

tensor[[ 1,  2,  3],
       [ 4,  5,  6],
       [ 7,  8,  9]
       [10, 11, 12]])

Alternatively, we can use vstack to vertically concatenate batches. There is no vstack in pytorch.

np.vstack((a_bf1, a_bf2))

The output is the same as earlier.

array[[ 1,  2,  3],
      [ 4,  5,  6],
      [ 7,  8,  9]
      [10, 11, 12]])

Note that this vertical stacking requires the arrays have the same number of columns, i.e., the 1st dimension should be of the same size so that the arrays can be placed on top of one another vertically. More generally, vertical concatenation of batches requies all dimensions of the concatenated arrays to match aside from the 0th dimension along which the concatenation takes place.

Concatenation of batches horizontally using concatenate (cat) and axis=1 or hstack

Let’s say we have two arrays of number of batches x number of features with the first array containing the first 2 features, and the second containing the subsequent 3 features for the same rows.

a_bf1 = np.array([[1, 2],
                  [3, 4]])
a_bf2 = np.array([[5, 6, 7],
                  [8, 9, 10]])

Let’s say we want to concatenate these arrays as to have one array of shape number of batches (2) x number of features (5). In this scenario, we can use concatenate again, but with axis=1, since we want to concatenate along the 1st dimension (number of features), ie, we want to expand the existing 1st dimension.

np.concatenate((a_bf1, a_bf2), 1)

The output is:

array([[1,  2,  5,  6,  7],
       [3,  4,  8,  9, 10]])

The horizontal concatenation of batches can also be done using numpy’s hstack. Additionally, we can use pyotorch’s cat, which works in the same way as numpy’s concatenate.

print(np.hstack((a_bf1, a_bf2)))
print(torch.cat((tensor(a_bf1), tensor(a_bf2)), dim=1))

The output is:

[[ 1  2  5  6  7]
 [ 3  4  8  9 10]]
tensor([[ 1,  2,  5,  6,  7],
        [ 3,  4,  8,  9, 10]])

Note that this horizontal stacking requires the arrays have the same number of rows, i.e., the 0th dimension should be of the same size so that the arrays can be placed next to one another horizontally. More generally, horiontal concatenation of batches requies all dimensions of the concatenated arrays to match aside from the 1st dimension along which the concatenation takes place.

Concatenation of instance features horizontally using concatenate (cat) and axis=0 or hstack

Let’s say we have the first 2 features for a given instance in one array, and its subsequent 3 features in another array and we wish to put them together.

a_f1 = np.array([1, 2])
a_f2 = np.array([3, 4, 5])

We effectively want to expand the 0th dimension (features) and so can use concatenate with `axis=0

np.concatenate((a_f1, a_f2))

The output is an array of shape (5,).

array([1, 2, 3, 4, 5])

Alternatively, we can use numpy’s hstack to carry out the horizontal concatenation.

np.hstack((a_f1, a_f2))

We get the same output as with did with concatenate.

array([1, 2, 3, 4, 5])

Stacking of instances vertically using stack and axis=0

Now, let’s say we have 2 arrays, each corresponding to one instance, and we wish to form one array containing the two rows. In this case, an instance could be the representation of an RGB 2x3 image and thus each array is of shape 3x2x3.

a_chw1 = np.array([[[ 1,  2,  3],
                    [ 4,  5,  6]],
                   [[ 7,  8,  9],
                    [10, 11, 12]],
                   [[13, 14, 15],
                    [16, 17, 18]]
                  ])
a_chw2 = np.array([[[19, 20, 21],
                    [22, 23, 24]],
                   [[25, 26, 27],
                    [28, 29, 30]],
                   [[31, 32, 33],
                    [34, 35, 36]]
                  ])

To form the array containing the 2 images, we need to insert a new 0th dimension at the front corresponding to the number of images. To achieve this, we can use stack with axis=0, since we’re introducing a new 0th dimension along which to stack the images. The stack function could be called in any of the listed approaches below.

a_chw3 = np.stack((a_chw1, a_chw2), axis=0)
a_chw3 = np.stack((a_chw1, a_chw2), 0)
a_chw3 = np.stack((a_chw1, a_chw2))
a_chw3

The output is the expected array of shape number of images (2) x channels (3) x rows (2) x columns (3).

array([[[[ 1,  2,  3],
         [ 4,  5,  6]],

        [[ 7,  8,  9],
         [10, 11, 12]],

        [[13, 14, 15],
         [16, 17, 18]]],


       [[[19, 20, 21],
         [22, 23, 24]],

        [[25, 26, 27],
         [28, 29, 30]],

        [[31, 32, 33],
         [34, 35, 36]]]])

We can carry out the same type of transformation using pytorch’s stack.

torch.stack((tensor(a_chw1), tensor(a_chw2)), 0)

The output is:

tensor([[[[ 1,  2,  3],
          [ 4,  5,  6]],

         [[ 7,  8,  9],
          [10, 11, 12]],

         [[13, 14, 15],
          [16, 17, 18]]],


        [[[19, 20, 21],
          [22, 23, 24]],

         [[25, 26, 27],
          [28, 29, 30]],

         [[31, 32, 33],
          [34, 35, 36]]]])

Note that this stacking of instances (using stack) requires that the stacked arrays have the exact same shape so that they could be stacked together.