Just look at it from a business POV: when all benchmarks and games use shaders exclusively, and CAD apps in FF mode aren't fillrate-limited, do you invest transistors for FF or for even more programmable power?
To keep the CAD market, some rasterizer and triangle-indexing features need to be kept: wide lines, smooth lines/poly, edge-flags (for quads in wireframe). And then you also make low-precision versions of arithmetic instructions, as FF can easily stay limited to them. (note how the GLSL precision-qualifiers are more meaningful on geforces) .
Then, you create 2 versions of the drivers - one for gamers (geforce), and one optimized for pushing more geometry from glBegin() sysram to vram (quadro) .
Any remaining doubts

?