What's new

Update Bye-bye mojibake!

thomas

Unswerving cyclist
Admin
Joined
Mar 14, 2002
Messages
8,772
Ratings
1 762
After almost 11 months of arduous, mind-numbing clean-up efforts I am very glad (and relieved) to report that the 日本語 section here at the Japan Forum, including all of its subfora, has finally been cleared of mojibake! :emoji_v:

The task - unfortunately entirely manual - not only gave us the chance to get rid of garbled Japanese text, but also to update invalid and broken links. If you happen to find any leftover traces of mojibake, please do report them in this thread.

We will continue to stomp out ���� throughout all other forum sections, although at a less intensive pace.

Thank you for your patience! :emoji_smile:
 

joadbres

八方凡人
Joined
Sep 19, 2016
Messages
517
Ratings
1 53
Much gratitude to those who labored to make the fixes. Feel free to identify yourselves here, so that you can be given the credit you deserve.
 

thomas

Unswerving cyclist
Admin
Joined
Mar 14, 2002
Messages
8,772
Ratings
1 762

musicisgood

Sempai
Donor
Joined
Sep 4, 2015
Messages
547
Ratings
53

thomas

Unswerving cyclist
Admin
Joined
Mar 14, 2002
Messages
8,772
Ratings
1 762
Not familiar with what is going on here, was it all spam?
When we switched from vBulletin to Xenforo (our current software) in 2014 most of the Japanese-language threads and posts were not properly converted. Our vBulletin installation ran on SJIS, as UTF-8 wasn't supported back in 2002. Various upgrades resulted in a Gordian knot of encryption issues. I invested a lot of time and money in solving these issues, but wasn't satisfied with the results. In the end I decided to go ahead and clean up the garbled text manually, with the kind assistance of @Toritoribe -san and @nekojita (who mysteriously disappeared about a year ago).

お疲れさまでした、thomasさん!!:emoji_clap::emoji_clap:
こちらこそ、サポートをありがとうございました。 :emoji_slight_smile: :emoji_pray:
 
Joined
Oct 23, 2012
Messages
681
Ratings
53
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.
 

thomas

Unswerving cyclist
Admin
Joined
Mar 14, 2002
Messages
8,772
Ratings
1 762
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.
The problem is that PHP doesn't have native Unicode support and had to rely on extensions like mbstring and iconv. Also, UTF-8 only became the default charset as of PHP 5.4.0 if I remember correctly.
 
Top