docNgramDF {RMeCab}R Documentation

docNgramDF

Description

docNgramDF returns returns data frame of N-gram from a file or all files in a given dataframe. Each word of N-gram makes one column.

Usage

  docNgramDF(mojiVec = "MeCab", type = 0, pos = "Default", baseform =0, minFreq = 1, N = 1, kigo = 0, weight = "no", co = 0 , dic = "", mecabrc = "", etc = "" )


Arguments

mojiVec column of dataframe
type Default being 0.
pos argument3. Default being noun and adjective.
baseform
minFreq words of a document appearing less than minDocFreq within that document will be ignored.
N N-gram. Default being 1
kigo if total must include number of symbols, set sym = 1. Default being 0
weight see weight.R
co to make co-occurence matrix.
dic to specify user dictionary, e.x. ishida.dic
mecabrc to specify mecab resource file
etc other options to mecab

Details

If necessary, more details than the description above

Value

returns a data frame.

Author(s)

Motohiro ISHIDA ishida.motohiro@gmail.com

References

石田基広『Rによるテキストマイニング入門』森北出版 2008

See Also

objects to See Also as help,


[Package RMeCab version 0.91 Index]