“There ain’t no such thing as plain text”. Joel Spolsky (what every developer must know about unicode). This text is based on the assumption, that current locale uses UTF-8 encoding. Behavior might differ for other encodings. I use Ruby 2.0 for evaluation. I was recently doing some text parsing in my native language (which is ...