Skip to main content
Graduate
August 25, 2025
Question

Transformer models inference on STM32MP257F using its NPU

  • August 25, 2025
  • 1 reply
  • 896 views

Is NPU of STM32MP257F capable of running inference of Transformers models like Small VLMs on this device using its NPU? what frameworks are supported like execuTorch or onnxruntime ?

    This topic has been closed for replies.

    1 reply

    ST Employee
    August 27, 2025

    Hello @ramkumarkoppu 

     

    I recommend you to have a look to the expansion package X-LINUX-AI.

    On MPU, ST provide the STAI_mpu api which unified different framework: ONNX, TFLite™, and Network Binary Graph (NBG).

    You will find in the wiki different reference for implementation to benchmarking:

    https://wiki.st.com/stm32mpu/wiki/Category:X-LINUX-AI_expansion_package

     

    Regards,