How is CMSIS DSP and NN library implement using SIMD instruction and FPU?
I came across a documentation stating that arm m4+ processors have CMSIS DSP api that is implemented with SIMD instructions. When we try to take the library file to use it there are 2 variants single precision and doubles precision FPU. I assumed that FPU is a hardware unit with its own instructions to accelerate calculations , however after reading about SIMD I am confused how SIMD and FPU are related to CMSIS DSP library.
