It can be stretching at the instant where you have pointed.
If you refer to $FX3_SDK_Installation_path$\EZ-USB FX3 SDK\1.3\firmware\fx3_sdk_1_3_4_src\sdk\firmware\src\lpp\cyu3i2c.c,
you can see that in CyU3PI2cReceiveBytes() API, there is a do..while loop. In that loop, there is a while loop which retrieves the data from the FIFO. After the Step 8 in your comments is reached successfully, FX3 waits for the data from the slave. There is a timeout set for read operation as a whole. There is a counter 'glI2cReadTimeout' which is initialized to 0xFFFFF. If the CPU is running at 201.6MHz then the minimum timeout duration would be 5.2ms. So, if clock stretching is involved between steps 8 and 11, then it is good to complete the read operation within 5.2ms.