This guide should help you understand how Unicode works in Perl 5. Feel free to improve it by opening issues or submitting a pull request.
You can read each chapter by opening it or using the following links:
- 01: Perl strings
- 02: The
unicode_strings
feature - 03: The
-C
command-line option - 04: Some Unicode warnings
- 05: The
utf8
pragma - 06: The
utf8::
encoding and decoding functions - 07: The
utf8::
upgrading and downgrading functions - 08: The Encode module
- 09: Case-folding and the
fc
function - 10: Graphemes and the regular expression atom
\X
- 11: Non-characters
- 12: The regular expression flag
/a
- 13: PerlIO layers
- 14: Standard Unicode modules
- Unicode Introduction at perltuts.com
- The Standard Documentation that ships with Perl:
- Tom Christiansen's answers page on Stackoverflow
This document is made available under the CC-BY-SA license.