Opened 3 years ago

Closed 3 years ago

Last modified 3 years ago

#5874 closed defect (bug) (duplicate)

Database character set default configuration

Reported by: anrghg's profile anrghg Owned by:
Milestone: Priority: high
Component: General Keywords: has-patch


The file wp-config-sample.php suggests to configure the database charset to utf8. The origin of the file seems unknown, its traceability starts 8 years ago, but it most probably predates the year 2010 and was not updated when MySQL released support for actual UTF-8 in 2010.

In MySQL, utf8 is a proprietary charset that only supports the Basic Multilingual Plane of Unicode, as it is limited to 3 bytes, but in UTF-8, characters from U+10000 on take up 4 bytes (starting at F0 90 80 80).

So in MySQL, utf8 is a misnomer for utf8mb3, while the real UTF-8 encoding, that utf8 should be but is not, is labeled utf8mb4.

Attachments (1)

wp-config-sample.php (2.8 KB) - added by anrghg 3 years ago.

Download all attachments as: .zip

Change History (5)

#1 @Presskopp
3 years ago

  • Resolution set to worksforme
  • Status changed from new to closed

There is a function to determine the best charset/collation.


So it is ok as it is.

#2 @Otto42
3 years ago

Also, this is the meta.trac. Issues with core should be made in instead.

#3 @SergeyBiryukov
3 years ago

  • Resolution changed from worksforme to duplicate

Hi there, welcome to WordPress Trac! Thanks for the ticket.

Just noting that there is already a ticket on Core Trac for this: #WP48285.

#4 @anrghg
3 years ago

Thanks to everybody for replies.

Sorry for not retrieving before.

Note: See TracTickets for help on using tickets.