Character repertoire
Unicode and ISO/IEC 10646 character set used for both content and markup
- All code points must be identified; done with ranges
- Without Extended Naming Rules, every single code point would have to be listed
- All XML processors must be internationalized