That MIPI bitrate is huge. You will need something with custom hardware, like the CPUs which have built-in ETH controllers which do all the dirty work and have a dedicated DMA shoving the data into buffers from where you can extract it at your leisure.
Also, unless you structure your code in a very specialised way, your processing will be very slow. Even a 168MHz ARM32 will be slower than you think unless you really optimise it. As an example I am currently getting 140kbytes/sec ETH transmit speed on mine, LWIP, FreeRTOS, FatFS reading of > 1MB/sec. The bottleneck is probably LWIP but it shows how much care you will need.
Cameras all use specialised processors.