alanse7en/why ReLu.md

Last active May 13, 2016 16:56

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/alanse7en/47b45fe1e04a912d9148.js"></script>
Save alanse7en/47b45fe1e04a912d9148 to your computer and use it in GitHub Desktop.

Raw

The advantages of using Rectified Linear Units in neural networks are

If hard max function is used as activation function, it induces the sparsity in the hidden units.
ReLU doesn't face gradient vanishing problem as with sigmoid and tanh function. Also, It has been shown that deep networks can be trained efficiently using ReLU even without pre-training.
ReLU can be used in Restricted Boltzmann machine to model real/integer valued inputs.