Note that you only ever use gm OR r_e, never both. That would double count it! r_e is the hybrid-pi model equivalent of gm. That's all.
Perhaps a counterpoint would help: what if the device takes 20V input drive to make 10V out? How about 100V in? What does it matter?
If the input current is approximately zero (many times smaller than the output current, anyway), there is very little power required to drive the outputs, and efficiency only depends on the output voltage range (in terms of total supply voltage).
Think about what an amplifying device is, as an ideal transistor or what have you. It is a device which draws some current through an output circuit, dependent on some input (voltage or current).
Suppose it's linear, so Ic = Vbe * gm, and if you like, Ib = Ic / hFE (where hFE is constant). Which means Vbe/Ib == Rb, a constant base resistance (for small signal purposes, in the hybrid-pi model, this is called r_π). Suppose there's no Early effect, so the collector output curves are flat horizontal. Then r_e = 1/gm, as well.* With no limit to voltage or current (it's a linear device with no bounds), you can pick any arbitrary area on the output curve, draw a load line across it, and drive that load with power equal to the area under the line. Now we can look at power output and device dissipation -- efficiency -- for a general case, with no quirks of an underlying device to worry about!
*Alternately, we might choose vertical curves, so that Vce = Vbe * mu (i.e., a voltage gain mu), or an intermediate case where the curves have a finite slope, or a variable resistance ("triode region"). For the finite slope case, there is the relationship: gm * h_oe = mu (h_oe is the slope of the output curve, the output resistance -- normally given by Early effect). If you like, the constant-current case has infinite mu, while the constant-voltage case has infinite gm. Note that the constant-current family of curves with parallel output resistance, is equivalent to the constant-voltage family of curves with series output resistance, by the Thevenin-Norton source theorem.
Load lines -- for further reading, google the term. It's a geometric technique that's as old as, well, tubes.
It's not important -- not taught -- today, because transistor curves are less reliable in some respects (e.g., Early effect, hFE variation), more reliable in others (Ic = Is * e^(Vb/Vth)), and much more flexible in ratings (whereas a given tube is a huge waste if operated much below its ratings), but at the same time... teaching a method in the abstract isn't suitable for all readers, so it's nice to check out.
Tim