Skip to content

Misspellings and errors in dictionary for word tokenization #557

@bact

Description

@bact

Description

Some errors are found in the tokenization dictionary like คู่ทุกขู์คู่ยาก.

This issue is open to get input on those.

Your environment

  • PyThaiNLP version: 2.3.1

Files

  • pythainlp/corpus/words_th.txt

Metadata

Metadata

Assignees

Labels

bugbugs in the librarycorpuscorpus/dataset-related issues

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions