What's new

Update Bye-bye mojibake!

thomas

Unswerving cyclist
Admin
14 Mar 2002
15,969
9,208
749
After almost 11 months of arduous, mind-numbing clean-up efforts I am very glad (and relieved) to report that the 日本語 section here at the Japan Forum, including all of its subfora, has finally been cleared of mojibake! :emoji_v:

The task - unfortunately entirely manual - not only gave us the chance to get rid of garbled Japanese text, but also to update invalid and broken links. If you happen to find any leftover traces of mojibake, please do report them in this thread.

We will continue to stomp out ���� throughout all other forum sections, although at a less intensive pace.

Thank you for your patience! 🙂:
 
Much gratitude to those who labored to make the fixes. Feel free to identify yourselves here, so that you can be given the credit you deserve.
 
Not familiar with what is going on here, was it all spam?

When we switched from vBulletin to Xenforo (our current software) in 2014 most of the Japanese-language threads and posts were not properly converted. Our vBulletin installation ran on SJIS, as UTF-8 wasn't supported back in 2002. Various upgrades resulted in a Gordian knot of encryption issues. I invested a lot of time and money in solving these issues, but wasn't satisfied with the results. In the end I decided to go ahead and clean up the garbled text manually, with the kind assistance of @Toritoribe -san and @nekojita (who mysteriously disappeared about a year ago).

お疲れさまでした、thomasさん!!:emoji_clap::emoji_clap:

こちらこそ、サポートをありがとうございました。 :emoji_slight_smile: :emoji_pray:
 
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.
 
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.

The problem is that PHP doesn't have native Unicode support and had to rely on extensions like mbstring and iconv. Also, UTF-8 only became the default charset as of PHP 5.4.0 if I remember correctly.
 
Back
Top Bottom