Maybe they are linearizing the gain of the 3-transistor Darlington vs. load current by putting a diode/resistor network in there which increases the gain at higher currents where beta is falling off. With more current through the parallel CR5||R1||R1X combination you get less voltage across it...