Posts by Tags

activation

Introduction to the Softmax Function

2 minute read

Published:

I first encountered the softmax function while learning about convolutional neural networks and transformers , where it’s used as a crucial activation function. At the time, I didn’t fully grasp what this function does . In this post, I aim to break down the softmax function, explaining what it is, how it works, and why it’s important.