$ source venv/bin/activate$ pip install mecab-python3==0.996.5$ pip freeze > requirements.txt$ pip install -e git+https://github.com/nishio/rich_tokenizer#egg=rich_tokenizer[Ah, it would be, nice, if, you could, use, this, system, with, voice, input, so, I'd, like, to, test, voice, input, on].:
In [20]: for x in xs:
... : if x.feature.startswith("verb"):.
...: print(x, x.feature)
...:
...:
Use verb,independent,*,*,one-step,continuous,usable,tsukae,tsukae
say verb,independent,*,*,5dan,wagyo urinative particle,basic form,say,iu,iu
There is verb,independent,*,*,one-step,basic form,there are,il,il
Shi verb, self-supporting,*,*,sa variant, suru, continuous form, to do, shi, shi
mi verb, non-self-standing,*,*,one-stanza, continuous form, miru, mi, mi
思い verb, independent,*,*,五段・ワ行促音便,連用形,思う,オモイ,オモイ
- The only one of these that sounds like a good keyword is "use".
- I could simply filter on a single verb word, but the fact that it is a form of "it would be nice to use" is highly valued, and even if "to do" by itself is not good, if it were "to test" it would be a good keyword, so I would incorporate the surrounding several words into the feature.
Right now I'm taking a walk and using my AirPods for voice input.
0.1667
negative(z), neutral(x), positive(c), quit(q)>z
Right now I'm taking a walk and using my AirPods for voice input.
1.1429
negative(z), neutral(x), positive(c), quit(q)>z
>>> get_ugoki("If you want to do it with audio, the system should repeat back with audio only and not look at the screen.")
(0.99963593, 'repeat back with audio only')
(0.9981591, 'repeat back')
(0.9901821, 'Should be in the system')
(0.95455027, 'would like to')
(0.9340714, 'do')
I noticed that the new method of extracting keywords that include verbs, if we simply connect them, the number of keywords extracted will naturally increase, and then all the places that require a certain number of keywords to be extracted will be triggered earlier than they are now. - Keyword appearance speed
done
I decided to play what I could on a rule basis, such as particles appearing on the boundaries.
This page is auto-translated from /nishio/動きの抽出作業メモ using DeepL. If you looks something interesting but the auto-translated English is not good enough to understand it, feel free to let me know at @nishio_en. I'm very happy to spread my thought to non-Japanese readers.