SentencePieceのユニグラム言語モデルについて
Subword regularization: Improving neural network translation models with multiple subword candidates. In Proc. of ACL. https://aclweb.org/anthology/P18-1007
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing Taku Kudo, John Richardson (Submitted on 19 Aug 2018) https://arxiv.org/abs/1808.06226