What's new

Welcome to Japan Reference (JREF) - the community for all Things Japanese.

Join Today! It is fast, simple, and FREE!

Update Bye-bye mojibake!

thomas

Unswerving cyclist
Admin
Joined
14 Mar 2002
Messages
10,047
Reaction score
1,571
After almost 11 months of arduous, mind-numbing clean-up efforts I am very glad (and relieved) to report that the 日本語 section here at the Japan Forum, including all of its subfora, has finally been cleared of mojibake! :emoji_v:

The task - unfortunately entirely manual - not only gave us the chance to get rid of garbled Japanese text, but also to update invalid and broken links. If you happen to find any leftover traces of mojibake, please do report them in this thread.

We will continue to stomp out ���� throughout all other forum sections, although at a less intensive pace.

Thank you for your patience! 🙂:
 

joadbres

八方凡人
Joined
19 Sep 2016
Messages
733
Reaction score
264
Much gratitude to those who labored to make the fixes. Feel free to identify yourselves here, so that you can be given the credit you deserve.
 
  • Thread starter
  • Admin
  • #4

thomas

Unswerving cyclist
Admin
Joined
14 Mar 2002
Messages
10,047
Reaction score
1,571

musicisgood

Sempai
Donor
Joined
4 Sep 2015
Messages
1,158
Reaction score
269
  • Thread starter
  • Admin
  • #7

thomas

Unswerving cyclist
Admin
Joined
14 Mar 2002
Messages
10,047
Reaction score
1,571
Not familiar with what is going on here, was it all spam?

When we switched from vBulletin to Xenforo (our current software) in 2014 most of the Japanese-language threads and posts were not properly converted. Our vBulletin installation ran on SJIS, as UTF-8 wasn't supported back in 2002. Various upgrades resulted in a Gordian knot of encryption issues. I invested a lot of time and money in solving these issues, but wasn't satisfied with the results. In the end I decided to go ahead and clean up the garbled text manually, with the kind assistance of @Toritoribe -san and @nekojita (who mysteriously disappeared about a year ago).

お疲れさまでした、thomasさん!!:emoji_clap::emoji_clap:

こちらこそ、サポートをありがとうございました。 :emoji_slight_smile: :emoji_pray:
 

OoTmaster

先輩
Joined
23 Oct 2012
Messages
738
Reaction score
119
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.
 

thomas

Unswerving cyclist
Admin
Joined
14 Mar 2002
Messages
10,047
Reaction score
1,571
I didn't know there was a difference in Unicode encoding. I figured since it was all base 16 they only had 8 bit Unicode. I didn't even know 16 bit Unicode was a thing.

The problem is that PHP doesn't have native Unicode support and had to rely on extensions like mbstring and iconv. Also, UTF-8 only became the default charset as of PHP 5.4.0 if I remember correctly.
 
Top Bottom