Loading...
 
Architecture / Installation

Architecture / Installation


I Get It, Use UTF8 - Ummm, there are only 50 to Choose from, Do they All Work?

posts: 4

Message received and believed:
AGONY will be mine if I run Tiki with other than utf8! Excellent! Tiki installation program says UGLY next to the character encoding for my DB, so I am all queued up to make a replacement. So which utf8?

The choices are many.
"UTF-8 Unicode"
utf8_bin
utf8_czech_ci
utf8_danish_ci
utf8_esperanto_ci
utf8_estonian_ci
utf8_general_ci
utf8_general_mysql500_ci
utf8_hungarian_ci
utf8_icelandic_ci
utf8_latvian_ci
utf8_lithuanian_ci
utf8_persian_ci
utf8_polish_ci
utf8_roman_ci
utf8_romanian_ci
utf8_sinhala_ci
utf8_slovak_ci
utf8_slovenian_ci
utf8_spanish2_ci
utf8_spanish_ci
utf8_swedish_ci
utf8_turkish_ci
utf8_unicode_ci
utf8mb4_bin
utf8mb4_czech_ci
utf8mb4_danish_ci
utf8mb4_esperanto_ci
utf8mb4_estonian_ci
utf8mb4_general_ci
utf8mb4_hungarian_ci
utf8mb4_icelandic_ci
utf8mb4_latvian_ci
utf8mb4_lithuanian_ci
utf8mb4_persian_ci
utf8mb4_polish_ci
utf8mb4_roman_ci
utf8mb4_romanian_ci
utf8mb4_sinhala_ci
utf8mb4_slovak_ci
utf8mb4_slovenian_ci
utf8mb4_spanish2_ci
utf8mb4_spanish_ci
utf8mb4_swedish_ci
utf8mb4_turkish_ci
utf8mb4_unicode_ci


Now, you might think I should just know which to pick but considering that for a long time the correct pick was Swedish Latin, which made no sense to me, how can I know which of these to pick?

Related... I thought utf8 is unicode. So why are only two of these named unicode. That looks like the others must not be unicode.

Not really for this forum but I do want to know: if utf8 is the thing to run, what is going on with WordPress, running mb4? Or is that utf8mb4? The only thing I know for sure is that in the last 10 years, only one of more than 1,000 databases I have worked with gave me a problem when I read the data from the database using php and mysql, and that one is WordPress database using UTF8, but a different UTF8 than what I was needing I guess.

I have every faith that using utf8 for tiki is going to save agony. I think tiki software is great.

Thank you for your assistance on making sure I don't foul it up with the wrong character encoding.

posts: 1560 Germany

Unicode or General should be the best choices.

Why you would not let Tiki do that itself and then see which UTF-8 Tiki refers?

Regards,
Torsten

posts: 4

Thank you for responding.

I did not see which Tiki referred to. I still don't.

Here are my steps:
1. Use phpmyadmin to make a database. At that time, I need to select collation. The choices look like character sets but collation is not the same as a character set.
2. The only reference I see from Tiki is

it sort of goes down hill for me after that

That said, I ran the installation and Tiki appears happy with the utf8 it got, to a extent. The detailed server report says I have Latin in my stack I should lose. But the tiki db is all in utf8 in every table. I let Tiki force things that way.


posts: 4

Off topic... but is there something I have wrong causing the image i n these posts to appear broken? 2016 07 11 01 12 00

Is drag and drop of images from the clipboard enabled or enablable?


posts: 734 United States

Bringing this back to the top of the forum, this has been a continuing conundrum for me, even after ensuring that our db is completely utf8, in tiki-check.php it will always say:

collation_server latin1_swedish_ci ugly On a fresh install everything should be set to UTF-8 to avoid unexpected results. utf8_unicode_ci is the default collation for Tiki. For further information please see Understanding Encoding.

This is frustrating, even though we are completely utf8.
Any feedback on this? Thanks!


posts: 36

Hi everyone,

Given I am going through the installation process for Tiki 20 which, at the time of writing, is the latest, I thought I'd update the installation documentation.

As far as UTF8 is concerned, I looked around and the best answer I could find is to use utf8mb4_unicode_ci.

This is based on this answer from Stack Overflow which states

utf8mb4_unicode_ci is based on the official Unicode rules for universal sorting and comparison, which sorts accurately in a wide range of languages



HTH.