This is a nice semi-technical article, a historical treatment of branch prediction, and the performance vs cost tradeoffs, for a variety of schemes varying from nothing at all to hybrid approaches seen from 1998 to 2010. More recent branch prediction technology being even more arcane and not necessarily public.
https://danluu.com/branch-prediction/With diagrams too!
Earliest referenced machine 1977, latest 2010.