net_blocks
Functions
|
Initialise Feed-Forward network with one of activation functions: "swiglu", "relu", "gelu". |
Classes
|
Class to introduce learnable positional embeddings. |
|
Feed-Forward network to introduce nonlinearity into the transformer model. |
|
Base class for positional encoding. |
|
Pre-LN Transformer Layer as described in "On Layer Normalization in the Transformer Architecture" https://arxiv.org/pdf/2002.04745 |
|
Pre-LN Transformer blocks. |
|
Feed-Forward network to introduce nonlinearity into the transformer model. |
|
Base class for transformer layers. |