Skip to content

Add ICU wordbreak dictionary (Thai) #877

@wannaphong

Description

@wannaphong

Since ICU are include to almost all web browser, so I think we should add ICU dictionary to PyThaiNLP to use same dictionary and can deploy any system that pythainlp/nlpo3 doesn't support.

Dictionary: https://raw.githubusercontent.com/unicode-org/icu/main/icu4c/source/data/brkitr/dictionaries/thaidict.txt

Metadata

Metadata

Assignees

No one assigned

    Labels

    corpuscorpus/dataset-related issues

    Type

    No type

    Projects

    Status

    To do

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions