This post is a reblog.
reblogged from marco
14

Notes

I love character encoding!

marco:

Tonight’s goal: Make a simple PHP class.

  • Input: a URL pointing to an HTML document.
  • Output: a UTF-8 version, regardless of what encoding it’s really in.

[…]

This is so useful, albeit to a relatively narrow range of programmers, that I feel bad not releasing it to the world, except that I assume that someone else has already done this and I just didn’t bother looking for it. (My experiences with PHP-community code are not good, so I almost always roll my own.) Any interest?

Yes, please release it. You don’t have to support it, but helping people convert into UTF-8 is a gift to mankind. I just wish we could convince more people to set everything up as UTF-8 by default. The vast majority of people don’t even know what character encoding is and release code that is an absolute nightmare to internationalize. If UTF-8 were standard, there would no such thing as “garbage characters”.

Posted: 2009-03-19 23:01

Bjorn Stromberg

My name is Bjorn Stromberg and I have a strange fascination with octopus and I live in Taipei, Taiwan. 我會講中文也看得懂繁體跟簡體字.

You can ask me something.

Or you can catch a glimpse of my secret heart.

If you're looking for FriendFeed Savior, Tumblr Savior, Muxtaster, or Tumtaster, they're in my Greasemonkey script collection :)



Feel free to email me at bjorn <at> bjornstar <dot> com

If you're looking for my Enchanter Epic Quest Guide, it's back up and better than ever.



Bjorn Stromberg's Facebook Account

This tumblelog is powered by Tumblr.