What about the feature of direct frame buffer access? SDL (what is used by GE) has got the feature of getting the frame buffer pointer to individual manipulate pixels. Maybe that feature in GE would be very interesting, and it would make thinks like the "RayCastle" faster
It's also useful is people want to write custom draw functions for example to draw special custom shapes quickly.