Oops, more specifically that should say "as fast as you can get [from the gate]". Since output risetime generally depends on input risetime -- not so much with buffered families, but the general idea is, you chain enough together and the rise time saturates to whatever you get when the input and output risetimes are equal. Sure it might be a little faster if you drove it with something screaming fast, but where are you going to get that, right?
Even among CMOS, there are certainly faster families, and other devices (like comparators) that are even better examples than LVC, yes.
I would shy away from ECL, because the voltages weird (DC bias) and small swing. Of course, that's fine if you don't need much voltage, and as long as all you're looking at is the step or a short pulse, you can remove the bias with a coupling capacitor.
Tim