I'm afraid that not so simple, you can't work without a running Linux on the platform.
There is some mandatory learning curve related to Linux system. Overall, MPUs are more complex to use than STM32 MCUs.
anyway, you could start with with a device tree update form DK2 using CubeIDE (new STM32 project from board DK1), then add the peripherals you need for Cortex-M4 side (as many init are done by Linux, you need to give it the information in the DT).
Then compile/update TF-A/uBoot(FIP)/Kernel device tree by following https://wiki.st.com/stm32mpu/wiki/How_to_compile_the_device_tree_with_the_Developer_Package
Flash the SD-Card with new files as you do for getting started (using CubeProgrammer with USB) .
Then you could use CubeIDE to create/update/debug the M4 FW
https://wiki.st.com/stm32mpu/wiki/Getting_started/STM32MP1_boards/STM32MP157x-DK2/Develop_on_Arm%C2%AE_Cortex%C2%AE-M4/Modify,_rebuild_and_reload_a_firmware
Once working, the M4 FW should be placed in the SD-card in a linux folder (e.g. using scp from Linux PC to DK2 Linux connected in 'Ethernet over USB')
Then M4 FW could be started, see https://wiki.st.com/stm32mpu/wiki/Linux_remoteproc_framework_overview#Remote_processor_boot_through_sysfs
Regards.