I assume it's relying partly on a sequence of frames to build that kind of information, Iike conventional supersamplung relies on moving your head to achieve better resolution, this relies on moving frames, well just my guess.
how for example does an AI conclude things like the flag should have a diamond in the middle and thin green lines
This is interesting.https://uploadvr.com/facebook-neural-supersampling/
It looks like you're new here. If you want to get involved, click one of these buttons!