I'd go with command-line interface.
A CLI enables users to type commands in a console or a terminal window expressed as a sequence of characters and presses the enter key on the keyboard to execute that command. And in this case, Sarah is typing an “open document 3” command to the command-line interface to open a file in her computer.
If you take the photo with one person on the ground that way when they take the photo it will look like the person jumping is jumping higher then they really are.
I hope this helps
I would do top to down approach start from the top and work your way down to the bottom.