I’ve noticed that outside of the fixed use cases and their combination, it is surprisingly difficult to make ESP32 do something. For example, i wanted to crop and apply simple filters on videos being streamed from an ESP32 Cam. Should be doable for a setup that can do facial recognition and AI stuff, right? Clearly not without writing your own libraries.