polyglot.tokenize package¶
Subpackages¶
Submodules¶
polyglot.tokenize.base module¶
Basic text segmenters.
-
class
polyglot.tokenize.base.SentenceTokenizer(locale='en')[source]¶ Bases:
polyglot.tokenize.base.BreakerSegment text to sentences.
-
class
polyglot.tokenize.base.WordTokenizer(locale='en')[source]¶ Bases:
polyglot.tokenize.base.BreakerSegment text to words or tokens.
Module contents¶
-
class
polyglot.tokenize.WordTokenizer(locale='en')[source]¶ Bases:
polyglot.tokenize.base.BreakerSegment text to words or tokens.
-
class
polyglot.tokenize.SentenceTokenizer(locale='en')[source]¶ Bases:
polyglot.tokenize.base.BreakerSegment text to sentences.