<!--- Provide a general summary of the issue in the Title above --> ## Description <!--- Provide a more detailed introduction to the issue itself. --> <!-- Why you consider it to be an issue or a bug. --> I try to tokenize text with `ทุ๊กกโคนน` , `อือหือ`, `อย่าลืมอัพการ์ดนะจ๊ะ` using nercut tokenizer. ## Expected results <!-- Tell us what should happen. --> Something should be printed.. ## Current results <!-- Tell us what happens instead. --> <!-- You can also put screenshot here. --> Result not printed   ## Steps to reproduce <!-- Steps to reproduce the behavior. --> <!-- You can also put a source code here. --> ```python import pythainlp from pythainlp import word_tokenize word_tokenize("ทุ๊กกโคนน", engine="nercut") ``` ## Your environment * PyThaiNLP version: 3.0.5 * Python version: 3.7.3 * Operating system and version (distro, 32/64-bit): 64 * More info (Docker, VM, etc.):