In my post on image captioning and visualising decoding algorithms, we used beam search as one of the decoding algorithm to generate captions. In both posts, we only saw the end result i.e the caption generated by the algorithm but we did not discuss the impact of various algorithm specific parameters on the end result. This page is my attempt to address that by showing the impact of following parameters on captions generated using beam search decoder:
- Beam Width: How many words to keep track of at every step
- Max Hypotheses: What is the maximum number of hypotheses after which the algorithm stops
- Max Steps: What is the maximum number of steps after which the algorithm stops