.
Found this interesting presentation (20 page PDF)
ARM810: Dancing to the Beat of a Different Drum by Guy Larri
(Mentioned on
Who are the Computer Architects? by Mark Smotherman)
with this pair of pipeline diagrams:
Attachment:
Screenshot 2020-01-08 at 12.01.10.png
The ARM7's 3 stage pipeline is extended to 5 stages, there's a double-pumped read port to the cache, and the clocks-per-instruction improves from 1.9 to 1.4