docNgram2 {RMeCab} | R Documentation |
docNgram2
returns returns data frame of N-gram from a file or all
files in a given directory. Each word of N-gram makes one column.
docNgram2(directory, type = 0, pos = "Default", minFreq = 1,N = 2, kigo = 0, weight = "no" , dic = "", mecabrc = "", etc = "")
directory |
directory path or a filename (may include path). |
type |
Default being 0. |
pos |
argument3. Default being noun and adjective. |
minFreq |
words of a document appearing less than minDocFreq within that document will be ignored. |
N |
N-gram. Default being 2 |
kigo |
if total must include number of symbols, set sym = 1. Default being 0 |
weight |
weight |
dic |
to specify user dictionary, e.x. ishida.dic |
mecabrc |
to specify mecab resource file |
etc |
other options to mecab |
If necessary, more details than the description above
returns a data frame.
Motohiro ISHIDA ishida.motohiro@gmail.com
石田基広『Rによるテキストマイニング入門』森北出版 2008
objects to See Also as help
,