Correct me if I'm wrong, but when recording command buffer those commands are not executed immediately.
But rather when this command buffer is submited to the render queue. Only then driver process it and send it to the graphics card.
So not only " command buffers solve is being able to build sequences of commands asynchronously on multiple threads." But additional driver have more knowledge about "what to draw".
One more plus that command buffers can be once bake and summited without recording them again. This is gain for CPU.
When a read GL_NV_command_list I see very similar ideas in it. So this is not a good way of development?
I'm not say that GL_NV_command_list must be add to the core but something base on it could be.