Does anyone know of a great small open source Unicode handling library for C or C++? I've looked at ICU, but it seems way too big.
I need the library to support:
- all the normal encodings
- finding character types – finding if a character should be allowed in identifiers and comments
- validation – recognizing nonsense