Improving training and quality idea
As everyone knows a common issue with AI is that it often screws things up, whether that is faces at an angle or people/objects at a distance.
So had this idea that I would like to hear whether people think could work or not. The overall idea is to improve on these issues, while at the same time being able to do this using fairly low-quality training data and especially videos for training without ruining the image quality.
The idea is to have a separate model containing only perspective information, so as you train the AI using a video, let's say a random movie, the AI will analyze each frame and create a mesh by identifying key features, like distance between eyes, mouth and nose etc.

The idea would then be that this information could be used with any human as these features don't really change that much between humans. Obviously, it should allow for deviation.
And this could be done for any identified object in the movie, like cars etc. The important thing here is not quality, but purely information about what objects look like from all kinds of angles, which is why low-quality videos would work fine.
Eventually, one would then use this data for actual creation to guide the AI so it gets angles and distant objects more correct, while also reducing the amount of training data needed, as this could be used on all faces or trained data. Furthermore, the AI would be able to generate people at a distance and could simply scale down the perspective data thereby maintaining the correct proportions. The data could also potentially help the AI identify which features should be hidden, if behind other objects or even where they should be correctly rather than creating a third leg or arm.
An obvious question is how would it handle unknown objects, like a robot or science fiction spaceship, which it would still have to generate using whatever perspective data would make the most sense, so if it is a humanoid robot, it could draw on that data etc.
Do people think something like that could improve it? or wouldn't it work at all?