docNgram2 {RMeCab}R Documentation

docNgram2

Description

docNgram2 returns returns data frame of N-gram from a file or all files in a given directory. Each word of N-gram makes one column.

Usage

  docNgram2(directory, type = 0, pos = "Default", minFreq = 1,N = 2, kigo = 0, weight = "no" , dic = "", mecabrc = "", etc = "")


Arguments

directory

directory path or a filename (may include path).

type

Default being 0.

pos

argument3. Default being noun and adjective.

minFreq

words of a document appearing less than minDocFreq within that document will be ignored.

N

N-gram. Default being 2

kigo

if total must include number of symbols, set sym = 1. Default being 0

weight

weight

dic

to specify user dictionary, e.x. ishida.dic

mecabrc

to specify mecab resource file

etc

other options to mecab

Details

If necessary, more details than the description above

Value

returns a data frame.

Author(s)

Motohiro ISHIDA ishida.motohiro@gmail.com

References

石田基広『Rによるテキストマイニング入門』森北出版 2008

See Also

objects to See Also as help,


[Package RMeCab version 0.97 Index]