NanoEdge AI Studio 5.0.2 high offline accuracy; poor on STEVAL-STWINBX1
- February 25, 2026
- 1 reply
- 183 views
Dear ST Support Team,
I am currently working on a binary voice classification task on the STEVAL-STWINBX1 (for example, “echo” vs “other”) using NanoEdge AI Studio 5.0.2, and I would like to ask for your advice.
My workflow
Data collection
I use STEVAL-STWINBX1 to collect voice data.
Each sample is 1 second long.
I collected more than 200 samples per class.
Data acquisition firmware:
fp-sns-datalog2\fp-sns-datalog2\STM32CubeFunctionPack_DATALOG2_V3.1.0\Projects\STM32U585AI-STWIN.box\Applications\DATALOG2
Dataset conversion
I use batch_to_nanoedge.bat to batch-convert the collected files into NanoEdge-compatible format.
Training in NanoEdge AI Studio 5.0.2
I import the converted data into NanoEdge AI Studio 5.0.2
Perform Data Management (DM)
Train a classification model
Many generated models show accuracy above 97% in Studio
Deployment to MCU
I deploy the generated library to the MCU following this ST wiki page:
I integrate the library into:
FP-AI-MONITOR2_16_3\FP-AI-MONITOR2_V1.0.0_RC8\FP-AI-MONITOR2_V1.0.0\Middlewares\ST\NanoEdge_AI_Library
Problem
Although the classification accuracy in NanoEdge AI Studio is very high (often >97%), the real-time classification accuracy on the MCU is very poor.
My question / suspicion
I suspect that the data used by NanoEdge AI Studio for training/classification may not be the same representation as the raw data sent to the NanoEdge library on the MCU. For example:
Studio-side data may be normalized, or
converted using microphone sensitivity scaling
while the MCU-side classifier may be receiving raw sensor data directly.
This possible mismatch might explain the large accuracy gap between Studio and MCU deployment.
Questions
Is there any issue with the workflow I am using?
For voice classification on STEVAL-STWINBX1, what is the recommended way to ensure that the training data format and MCU runtime input format are strictly consistent?
Does NanoEdge AI Studio expect raw sensor samples, sensitivity-scaled values, or normalized inputs for this type of workflow?
Have you seen similar cases (high Studio accuracy but poor MCU accuracy), and what are the common causes / best practices to solve them?
Any guidance would be greatly appreciated.
Thank you very much for your support.
Best regards,
