rOpenSci: The tokenizers package
Overview. This R package offers functions with a consistent interface to convert natural language text into tokens. It includes tokenizers for shingled n-grams, skip n-grams, words, word stems, sentences, paragraphs, characters, shingled characters, lines, tweets, Penn Treebank, and regular expressions, as well as functions for counting characters, words, and sentences, and a function for ...
DA: 29 PA: 47 MOZ Rank: 45 Up or Down: Up