[isf-wifidog] UTF-8 codebase and database

Francois Proulx fproulx at lecameleon.net
Sam 2 Avr 11:22:45 EST 2005


WARNING TO ALL DEVELOPPERS WITH CVS ACCESS
As of yesterday (4h30pm EDT) cvs commit, the codebase and the database schema is 100% UTF-8. We did this because we were starting to have inconsistencies : I translated the e-mail messages in french and we had to make sure the whole system was either ISO-8859-1 or UTF-8. The HTML encoding is now UTF-8. Now the .po file contains UTF-8 characters ( the emails are sent with (text/plain, UTF-8 headers ).I'll have to modify the gen.sh script to match UTF-8 on monday... 

WHAT THIS MEANS, is that as of now everybody MUST use an UTF-8 compliant editor if they want to commit their work. Eclipse does this job beautifully (check Tools...Preferences menu I think). Editing .po files should be done with translation tools like KBabel to insure valid UTF-8 enconding. In the near future, I suggest we change all the HTML entities in the .po file to UTF-8 characters ( should be fairly easy with find and replace in KBabel ). Using HTML entities prevents us from using spell checking etc... and we can't use existing translations in emails, because they are sent in text/plain.

P.S: Translating the live database from ISO-8859-1 to UTF-8 meant possible password MD5 hashing matching issues ( if any user had password containing special characters ). That's why we now have a "legacy system" hack ! Duh... :-( We must recode password values we receive in UTF-8 to ISO-8859-1 to match the existing MD5 in database.

Keep in mind UTF-8 !


Plus d'informations sur la liste de diffusion WiFiDog