docMatrix               package:RMeCab               R Documentation

_d_o_c_M_a_t_r_i_x

_D_e_s_c_r_i_p_t_i_o_n:

     creates a document-term matrix out of all textfiles in a given
     directory.

_U_s_a_g_e:

     docMatrix( mydir, pos = "Default", minFreq = 1, weight ="no", kigo=0, co = 0)
     docVector( filename, pos,  posN, minFreq, kigo)

_A_r_g_u_m_e_n_t_s:

filename: filename (may include path).

   mydir: the directory path to textfiles. 

     pos: specifies which parts of speech should be extracted. Default
          being  noun and adjective.

    posN: specifies length of  parts of speech which should be
          extracted. 

 minFreq: words of a document appearing less than minDocFreq within
          that document will be ignored.

  weight: Calculates a weighted document-term matrix with some options.

    kigo: if total must include number of symbols set kigo = 1. Default
          being 0

      co: retrieve co-ocurrence term matrix. default being = 0

_D_e_t_a_i_l_s:

     All textfiles in the specified directory are read in and a matrix
     is composed. Every cell of the matrix shows the actual frequency
     of each word. 

     'docVector()' is a supporting function that creates a
     document-term frequency list.

_V_a_l_u_e:

docMatrix: the document-term matrix 

_A_u_t_h_o_r(_s):

     Motohiro ISHIDA ishida.motohiro@gmail.comt

