STM32L4 + Cellular Modem: IWDG vs Long Blocking Operations Dilemma

Question

Hello STM32 Community,I'm developing a cellular IoT device using STM32L4 and facing an architectural challenge regarding IWDG implementation with long cellular communication timeouts. I'd appreciate your expertise and recommendations.System Overview:Hardware: STM32L4 custom board with cellular modem (Quectel BG95/BG96)Communication: UART-based AT commands with IDLE line interrupt (no DMA)Application: Periodic sensor data transmission to server via UDPPower Management: Device enters STOP2 mode between measurement cyclesIWDG Configuration:Clock Source: LSI (32kHz)Timeout: ~32.8 seconds maximum (IWDG_PRESCALER_256, Reload=4095)Behavior: Once enabled, cannot be disabled; pauses during STOP modesThe Challenge: My low-level AT command function implements timeout-based communication to prevent infinite loops:int <function_name>(const char *command, const char *expected_response, 
 /* other params */, int timeout_ms) {
 uint32_t start_time = HAL_GetTick();
 
 // Send AT command via UART
 
 while ((HAL_GetTick() - start_time) < timeout_ms) {
 if (rx_data_ready_flag) {
 // Copy UART response to buffer
 // Check for expected response patterns
 }
 
 if (strstr(response_buffer, expected_response)) {
 // Parse response and return success
 break;
 }
 
 // Handle error responses, retries, etc.
 }
 
 return result;
}The Problem: Some legitimate cellular operations require timeouts exceeding IWDG maximum:Network Time Protocol (NTP) synchronization: Up to 125 secondsNetwork registration in poor coverage: Up to 60 secondsCurrently, IWDG triggers reset during these legitimate operations, causing communication failures.Questions for the Community:Error Masking Concern: If I add HAL_IWDG_Refresh(&hiwdg) inside the timeout loop every 25 seconds, could this mask genuine errors? For example, if strstr() encounters issues, or if the system enters an unexpected state but continues looping?Architecture Recommendations: What's the best practice for handling this scenario?Implement progress-based IWDG refresh (only refresh when receiving data)?Real-World Experience: For those working with cellular IoT applications, how do you balance IWDG protection with legitimate long network operations?Debugging Implications: If I implement periodic IWDG refresh, what debugging strategies would you recommend to ensure I'm not masking critical issues?Current Timeout Examples:Standard AT commands: 300-5000msNetwork registration: 60,000msNTP time sync: 125,000msModem initialization: 30,000msThe system works reliably when IWDG is disabled during development, but I need it enabled for production deployment to handle potential firmware hangs, memory corruption, or hardware issues. The device operates in remote locations where manual recovery isn't feasible, making robust watchdog implementation critical. However, cellular connectivity can be unpredictable, and legitimate operations sometimes require extended timeouts. Any insights, experiences, or architectural recommendations would be greatly appreciated!Best regards,NG

Andrew Neil · Accepted Answer

Using a State Machine should make it easy to use non-blocking delays!

@GR88_gregni wrote:
The blocking delays are necessary because if the modem doesn't successfully obtain an IPv4/IPv6 address, the system cannot proceed to the next step.

That doesn't follow at all - that can certainly be handled without blocking delays.

@GR88_gregni wrote:
if the first chunk times out, I need to ensure the low-level function continues listening for the response rather than giving up. This would require substantial code restructuring and make the implementation much more complex and less readable.

It shouldn't do; eg,

 while ((HAL_GetTick() - start_time) < timeout_ms) {
 if (rx_data_ready_flag) {
 // Copy UART response to buffer
 // Check for expected response patterns
 }
 
 if (strstr(response_buffer, expected_response)) {
 // Parse response and return success
 break;
 }

 if( time_to_update_wd() )
 {
 // Update the WD
 }
 
 // Handle error responses, retries, etc.
 }

Andrew Neil · Answer

Don't use blocking delays. If you really must use blocking delays, divide them into "chunks" of less than the IWDG timeout.

Sign up

Login with SSO

Login to the community

Login with SSO

Scanning file for viruses.

This file cannot be downloaded