RCaBoChaMx {RCaBoCha} | R Documentation |
creates a document-term matrix out of a file or all textfiles in a given directory.
RCaBoChaMx(directory, str2 = "", rmT= "KIGO", minFreq = 1, weight = "no")
directory |
directory path or a filename (may include path). |
str2 |
a japaense word |
rmT |
specifies which parts of speech should be removed. |
minFreq |
words of a document appearing less than minDocFreq within that document will be ignored. |
weight |
Calculates a weighted document-term matrix with some options. |
All textfiles in the specified directory are read in and a matrix is composed. Every cell of the matrix shows the actual frequency of each word.
RCaBoChaMx |
the document-term matrix |
Motohiro ISHIDA ishida.motohiro@gmail.com