What
is Activation Function?
It’s
just a thing function that you use to get the output of node. It is also known
as Transfer
Function.
Why
we use Activation functions with Neural Networks?
It
is used to determine the output of neural network like yes or no. It maps the
resulting values in between 0 to 1 or -1 to 1 etc. (depending upon the
function).
The
Activation Functions can be basically divided into 2 types-
1. Linear Activation Function
2. Non-linear Activation Functions
Linear
or Identity Activation Function
As you can see the function is a line or linear. Therefore, the output of the functions will not be confined between any range.
Equation : f(x) = x
Range : (-infinity
to infinity)
It
doesn’t help with the complexity or various parameters of usual data that is
fed to the neural networks.
Non-linear Activation Function
The Nonlinear Activation Functions are the most used activation functions. Nonlinearity helps to makes the graph look something like this
It
makes it easy for the
model to generalize or adapt with variety of data and to differentiate between
the output.
The
main terminologies needed to understand for nonlinear functions are:
Derivative or Differential: Change
in y-axis w.r.t. change in x-axis.It is also known as slope.
Monotonic function: A
function which is either entirely non-increasing or non-decreasing.
The
Nonlinear Activation Functions are mainly divided on the basis of their range
or curves-
1. Sigmoid or Logistic Activation Function
The Sigmoid Function curve looks like a S-shape.
Fig: Sigmoid Function
The
main reason why we use sigmoid function is because it exists between (0
to 1). Therefore,
it is especially used for models where we have to predict
the probability as
an output.Since probability of anything exists only between the range of 0
and 1, sigmoid
is the right choice.
The
function is monotonic but function’s
derivative is not.
The
logistic sigmoid function can cause a neural network to get stuck at the
training time.
The softmax
function is a more generalized logistic activation
function which is used for multiclass classification.
0 Comments