Do all FPU operations proceed in a single SYSCLK tick?
This should be simple, but I can't find documentation. I am running an STM32L4P5 at max clock speed (120 MHz), and I have moved all code to SRAM so it is running with zero wait states. I have an extensive calculation to do using the FPU -- I am optimizing retained constants and such in the FPU registers.
I am wondering if any of the FPU instructions take longer than a single clock tick to execute. Divide? Square root? Are there any inserted wait states?
If there is a solid answer and it is in the documentation, please point to where it is docced. Thanks!
