I think we should remove `clause_tokenize` function because I found the model are fit to domain's dataset. It can't use as general domain in Thai. #1011 So I think we should remove `clause_tokenize` function until we will have new dataset to use.