As you seem to appreciate them, here's another detailed answer:
It's impossible to make a room which is acoustically perfect and it's incredibly expensive to make a room which is even very good. Adding picture to the equation just complicates the problem further and requires even more compromises. This means that you generally have to prioritise the compromises biased towards either picture or sound, depending on the work you specialise in. That's one of the reasons why in TV and Film post production you have separate video editing suites and sound editing/mix rooms.
The problem you've always got is that the speakers have to point towards the monitoring position and therefore also at the wall behind you. The signal from the speakers reflects off the back wall, comes back into the room and interacts with the signal coming out of the speakers. Depending on where you're sitting, these reflections are either going to summing with the signals from the speakers or cancelling with them. This means that there are going to be a bunch of frequencies which will appear to sound either much louder than they should or much quieter. In a small room the problem is worse because the amplitude (level) of the reflections is going to be higher relative to the signal from the speakers and therefore the interaction greater, both in terms of the amount of summing and cancelling at individual frequencies and in terms of the number of frequencies affected.
When we process audio we do so on the basis of what sounds best, which means if we are doing our job well, we're going to be (subconsciously) applying EQ, choosing effects and relative levels which counteract or minimise the undesirable summing and cancelling caused by the room's acoustics. The result might sound good in your room but would sound completely different in any other room (except one with perfectly identical acoustic properties). In your case, the EQ'ing (sweetening) you do may have the opposite effect when your mix is played in another room, your levels could be anywhere from slightly wrong to severely wrong and there maybe artefacts from your noise reduction which you can't hear or conversely, artefacts which you feel are unacceptable which would be unnoticeable elsewhere.
Now, if you put your nearfields on your desk and your desk is near the back wall, your speakers are going to be say around 6' from the back wall causing two problems: 1. Your sound is going to be originating from a different place to your picture (on the front wall) which will disassociate the sound from the picture and sound weird and 2. It's going to increase even further the amount of reflection from the back wall and therefore the amount of summing/cancellation and the number of frequencies affected.
Using broadband absorption on the rear wall would appear to solve our problem by eliminating the reflections in the first place but the simple fact is that even the most expensive/sophisticated absorption is only partially effective and then only partially effective in certain frequency bands. You will hear a dramatic improvement with a lot of high quality acoustic absorption on the wall behind you but so severe are the problems to start with, there's almost no chance this improvement will be enough to make the room usable to produce professional quality results.
Going back to your original suggestion of the nearfields on the wall next to the video monitor. The speakers are now much further away from the rear wall, which would reduce the problems from the wall behind you, providing you are sitting closer to the speakers than the rear wall, which you have indicated will not be the case.

So in this case, the improvement from having the speakers further away from the rear wall will be negligible. Plus, this setup will severely increase the effect of a number of other problems as there will now be a lot more reflective surfaces between you and the speakers (side walls, ceiling and floor) as well as a boundary effect caused by the speakers being just a few inches from the front wall. And, due to the design of nearfields, not only are you going to have severe frequency inaccuracies, you're also likely to have quite severe inaccuracies in the stereo sound-field and even potentially have to overdrive your speakers to produce the sound pressure levels required by the calibration (probably 79dBSPL as you would be sitting further away).
I'm not saying that absolutely, definitely your room will be unusable but with your combination of speakers, room size and required monitoring position, in my opinion you are going to need to get a professional acoustician in, to stand even a slim chance of achieving accurate enough sound reproduction to allow you to consistently produce broadcast quality mixes.
G