• image
  • Why is this?
    • We want the sum of the outputs to be equal to 1, so we express it as a fraction of the total sum.
    • Additionally, we want the value of the input z to be positive regardless of its value, so we use .
  • The result is in the same format as a One-hot vector.
    • A one-hot vector can be seen as the output of softmax, effectively indicating that one probability is 100%.