The easiest way to determine the character count of a UTF8 string is to pass the text through utf8_decode() first:
php.net