-
Notifications
You must be signed in to change notification settings - Fork 274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ModuleNotFoundError when calling crfcut
engine in sent_tokenize
function
#859
Comments
|
I looking new model to removed all crfsuite model but these models are quite efficient and therefore not worth replacing. Deep learning model are not much better. |
I'm confused. If PyThaiNLP uses pycrfsuite (and it does), why not just add it to the list of requirements? |
Hello! Many functions can work without pycrfsuite if it doesn't use pycrfsuite model. Newest sent_tokenize engine can work without using pycrfsuite. We has the plan to remove all python-crfsuite models from PyThaiNLP #655 but the processing is slow. (I and other main contributors are still is not enough free time.) We was survey pythainlp user. We found many user use word tokenizer is the main feature and python-crfsuite dependency fails to build under python 3.10 #626, so we remove pycrfsuite from the list of requirements and add it to extra packages to avoid dependency problem. |
I will close this issue. If you have any question, you can create new issue. |
i've try the
crfcut
engine insent_tokenize
function in stable release version of PyThaiNLP viathis is what i expected
however, i got this as an output instead
since it is a missing package problem, it can be solved by
pip install python-crfsuite
in order to make it compatible to be used. However, is it better to fix it so that the user has no need to take an extra step to install crfsuite everytime they want to use an engine, or we can just leave it as usual here. What do you think ?The text was updated successfully, but these errors were encountered: