docNgramDF {RMeCab}R Documentation

docNgramDF

Description

docNgramDF returns returns data frame of N-gram from a file or all files in a given dataframe. Each word of N-gram makes one column.

Usage

  docNgramDF(mojiVec = "MeCab", type = 0, pos = "Default", baseform =0, minFreq = 1, N = 1, kigo = 0, weight = "no", co = 0 , dic = "", mecabrc = "", etc = "" )


Arguments

mojiVec

column of dataframe

type

Default being 0.

pos

argument3. Default being noun and adjective.

baseform
minFreq

words of a document appearing less than minDocFreq within that document will be ignored.

N

N-gram. Default being 1

kigo

if total must include number of symbols, set sym = 1. Default being 0

weight

see weight.R

co

to make co-occurence matrix.

dic

to specify user dictionary, e.x. ishida.dic

mecabrc

to specify mecab resource file

etc

other options to mecab

Details

If necessary, more details than the description above

Value

returns a data frame.

Author(s)

Motohiro ISHIDA ishida.motohiro@gmail.com

References

石田基広『Rによるテキストマイニング入門』森北出版 2008

See Also

objects to See Also as help,


[Package RMeCab version 0.97 Index]