I searched and couldn't find much about it. We need to provide the most general support for encodings and such and want to make sure ANTLR is going to be able to handle anything we throw at it.
Java VM uses 16 bit character encoding, so MBCS
streams must be converted as they are read into the VM. That's relatively independent of ANTLR
.