You still are missing the point....there isn't any mathematical meaning, to just sit there, transmitting, (or uttering) 'nothing'...in absense of having a qualifying clock, or flag.
For example, suppose I'm just sitting in a chair, in front of you, totally silent. That doesn't have any time quality, so not very useful. Now, suppose we agreed, that you will write down what sounds there are, if any, when my flag is raised, then dropped. That gives valid data.
Suppose then, that we change the rule, so that you sit, waiting, and then act, to write down any active utterance, simply when it comes. Doing things that way will allow communication even if the data sent is 0, or, literally due to the fact that your 'symbol' is an active thing, not just 'silence'.
At any rate, in the context here, it simply implies that you can have one of your 'tokens' or symbolic characters in place of....(in place of what?). I wrote it that way, because us humans even USE A SYMBOL, when denoting zero..or nothing...it's that little round circle ; ' 0 ' (sorry to be a little snarky).
=====================================
As to the component counts, yes they do go up quite rapidly. But that effect has its limits, enlarging each individual word (parts count), but not affecting in the larger scale. That is, with a memory requirement of approx. equiv. to 100 bytes, you might have more components per word, but still have about 100 total words.
Within the word for 16 states, you could see about 4 times more raw bulk, vs the 4 bits needed, in implementing straight conventional binary. Like I said, we may have to be dragged there, kicking and screaming ...
It DOES help eliminate the need for decoding the 'efficient' binary coded bits...every time, lol.