Activation Functions
Sigmoid(x)=ex+1ex=1+e−x1
Tanh(x)=ex+e−xex−e−x
ReLU(x)=max(0,x)
Loss Functions
1. Mean Absolute Error
MAE=n1i=1∑n∣yi−yi^∣
2. Mean Squared Error
MSE=n1i=1∑n(yi−yi^)2
3. KL Divergence
Measures the distance between two distributions P(x) and Q(x).
DKL(P∥Q)=x∑P(x)⋅log(Q(x)P(x))