Dense layers¶

class lasagne.layers.DenseLayer(incoming, num_units, W=lasagne.init.GlorotUniform(), b=lasagne.init.Constant(0.), nonlinearity=lasagne.nonlinearities.rectify, num_leading_axes=1, **kwargs)[source]¶

A fully connected layer.

Parameters:

incoming : a Layer instance or a tuple: The layer feeding into this layer, or the expected input shape
num_units : int: The number of units of the layer
W : Theano shared variable, expression, numpy array or callable: Initial value, expression or initializer for the weights. These should be a matrix with shape (num_inputs, num_units). See lasagne.utils.create_param() for more information.
b : Theano shared variable, expression, numpy array, callable or None: Initial value, expression or initializer for the biases. If set to None, the layer will have no biases. Otherwise, biases should be a 1D array with shape (num_units,). See lasagne.utils.create_param() for more information.
nonlinearity : callable or None: The nonlinearity that is applied to the layer activations. If None is provided, the layer will be linear.
num_leading_axes : int: Number of leading axes to distribute the dot product over. These axes will be kept in the output tensor, remaining axes will be collapsed and multiplied against the weight matrix. A negative number gives the (negated) number of trailing axes to involve in the dot product.

Examples

>>> from lasagne.layers import InputLayer, DenseLayer
>>> l_in = InputLayer((100, 20))
>>> l1 = DenseLayer(l_in, num_units=50)

If the input has more than two axes, by default, all trailing axes will be flattened. This is useful when a dense layer follows a convolutional layer.

>>> l_in = InputLayer((None, 10, 20, 30))
>>> DenseLayer(l_in, num_units=50).output_shape
(None, 50)

Using the num_leading_axes argument, you can specify to keep more than just the first axis. E.g., to apply the same dot product to each step of a batch of time sequences, you would want to keep the first two axes.

>>> DenseLayer(l_in, num_units=50, num_leading_axes=2).output_shape
(None, 10, 50)
>>> DenseLayer(l_in, num_units=50, num_leading_axes=-1).output_shape
(None, 10, 20, 50)

class lasagne.layers.NINLayer(incoming, num_units, untie_biases=False, W=lasagne.init.GlorotUniform(), b=lasagne.init.Constant(0.), nonlinearity=lasagne.nonlinearities.rectify, **kwargs)[source]¶

Network-in-network layer. Like DenseLayer, but broadcasting across all trailing dimensions beyond the 2nd. This results in a convolution operation with filter size 1 on all trailing dimensions. Any number of trailing dimensions is supported, so NINLayer can be used to implement 1D, 2D, 3D, ... convolutions.

Parameters:

incoming : a Layer instance or a tuple: The layer feeding into this layer, or the expected input shape
num_units : int: The number of units of the layer
untie_biases : bool: If false the network has a single bias vector similar to a dense layer. If true a separate bias vector is used for each trailing dimension beyond the 2nd.
W : Theano shared variable, expression, numpy array or callable: Initial value, expression or initializer for the weights. These should be a matrix with shape (num_inputs, num_units), where num_inputs is the size of the second dimension of the input. See lasagne.utils.create_param() for more information.
b : Theano shared variable, expression, numpy array, callable or None: Initial value, expression or initializer for the biases. If set to None, the layer will have no biases. Otherwise, biases should be a 1D array with shape (num_units,) for untie_biases=False, and a tensor of shape (num_units, input_shape[2], ..., input_shape[-1]) for untie_biases=True. See lasagne.utils.create_param() for more information.
nonlinearity : callable or None: The nonlinearity that is applied to the layer activations. If None is provided, the layer will be linear.

References

[1]	Lin, Min, Qiang Chen, and Shuicheng Yan (2013): Network in network. arXiv preprint arXiv:1312.4400.

Examples

>>> from lasagne.layers import InputLayer, NINLayer
>>> l_in = InputLayer((100, 20, 10, 3))
>>> l1 = NINLayer(l_in, num_units=5)