The softmax function, also known as softargmax[1]: 184 or normalized exponential function,[2]: 198 converts a vector of K real numbers into a probability distribution of K possible outcomes. It is a generalization of the logistic function to multiple dimensions, and used in multinomial logistic regression. The softmax function is often used as the last activation function of a neural network to
![Softmax function - Wikipedia](https://cdn-ak-scissors.b.st-hatena.com/image/square/1c654a58bb741aab520f6495ea459aea8836fa18/height=288;version=1;width=512/https%3A%2F%2Fupload.wikimedia.org%2Fwikipedia%2Fcommons%2Fthumb%2F0%2F02%2FNeural_network_with_dark_background.png%2F1200px-Neural_network_with_dark_background.png)