日録2010年12月

Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers. Your borwser is not supporting object tag. Please use one of the latest browsers.

_ 2010 年 12 月 31 日（金）晴

07:30 起床
- 外は雪が積もっている．寒い
06:50 研究室

午前
- 大掃除
- stepAICでの変数選択で，複数のタームを指定して，これをモデル残す方法についての質問をみかけたのだが，コメントする暇がなかった．

Month と Day を単なる乱数にする
airquality$Month <- rnorm(153)
airquality$Day <- rnorm(153)

x <- lm( Ozone ~ Solar.R +Wind +Temp +Month +Day,
  data = airquality )

# 単なる乱数にかえた Month と Day は当然，除かれる
stepAIC(x)

# この二つを残す
stepAIC(x, scope = list(lower = ~ Month + Day))

午後 Revolutions blog: 2010 statistics

↑

_ 2010 年 12 月 30 日（木）曇晴

05:30 起床
- 寒い
06:50 研究室

午前
- C&R 社の仕事を進める
- 午後
- C&R 社の仕事を進める
- ちょっとメモ

x <- 1
if(x) print ("X")
else print("Z")
エラー：   予想外の 'else' です  ( " else" の) 
x <- 1
{ if(x) print ("X")
  else print("Z")
}
[1] "X"

↑

_ 2010 年 12 月 29 日（水）曇晴

05:30 起床
- 寒い
06:50 研究室

午前
- C&R 社の仕事を進める
- The Complete Joy of Stats
- 午後
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 28 日（火）晴

05:30 起床
- 寒い
06:50 研究室

午前
- C&R 社の仕事を進める
- 午後
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 27 日（月）晴

06:00 起床
07:50 研究室

午前
- C&R 社の仕事を進める
- The Popularity of Data Analysis Software

午後

ちょっとメモ

>  x <- 1:3
> # ラベルを加えない
> cbind(x, 4:6, deparse.level = 0)
     [,1] [,2]
[1,]    1    4
[2,]    2    5
[3,]    3    6
> # シンボルとして意味があればラベル化する（デフォルト）
> cbind(x, 4:6, deparse.level = 1)
     x  
[1,] 1 4
[2,] 2 5
[3,] 3 6
> # シンボル以外（定数など）でもラベル化する
> cbind(x, 4:6, deparse.level = 2)
     x 4:6
[1,] 1   4
[2,] 2   5
[3,] 3   6
>

↑

_ 2010 年 12 月 26 日（日）晴

06:00 起床
07:50 研究室

午前
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 25 日（土）晴

06:50 起床
08:50 研究室

午前
- C&R 社の仕事を進める
- Merry Christmas from Revolution Analytics!

↑

_ 2010 年 12 月 24 日（金）晴

04:20 起床
05:50 研究室

午前
- C&R 社の仕事を進める
午後．来客と食事
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 23 日（木）晴

04:20 起床
05:50 研究室

午前
- C&R 社の仕事を進める
- Why the other line moves faster

午後
- Forbes: R is a name you need to know in 2011

↑

_ 2010 年 12 月 22 日（水）晴

04:50 起床
06:40 研究室

午前
- C&R 社の仕事を進める
- How Orbitz uses Hadoop and R to optimize hotel search
午後
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 21 日（火）雨

05:30 起床
07:00 研究室

午前
- C&R 社の仕事を進める
午後
- 言語オブジェクトのあたりが，うまく項目分けできないでいる
- Hadley’s tutorials on R Visualization
- A Taxonomy of Data Science
- 充電

↑

_ 2010 年 12 月 20 日（月）晴

06:00 起床
07:30 研究室

午前
- C&R 社の仕事を進める
午後
- C&R 社の仕事を進める

↑

_ 2010 年 12 月 19 日（日）晴

06:00 起床
07:30 研究室

午前
- C&R 社の仕事を進める
午後
- Machine Learning: A Love Story

↑

_ 2010 年 12 月 18 日（土）晴

07:00 起床
08:50 研究室

午前
- C&R 社の仕事を進める
- RMeCab _0.93を用意した

↑

_ 2010 年 12 月 17 日（金）晴

04:30 起床
06:00 研究室

午前
- R 2.12.1 is out
午後
- C&R 社の仕事にとりかかる

↑

_ 2010 年 12 月 16 日（木）晴

04:30 起床
06:00 研究室

午前
- googleVis is an R package providing an interface between R and the Google Visualisation API を入れてみたけど，いろいろ大げさだ
午後
- の下訳見直しを終える
- C&R 社の仕事にとりかかる

↑

_ 2010 年 12 月 15 日（水）晴

04:30 起床
05:40 研究室

午前
- 08:40 - 10:10 演習
- の下訳見直し．
午後
- Facebook's Social Network Graph with R and Hive
- 14:40 - 16:05 演習

↑

_ 2010 年 12 月 14 日（火）雨/晴

03:50 起床
05:20 研究室

午前
- の下訳見直し．
午後
- Machine Learning and Data Mining with R

↑

_ 2010 年 12 月 13 日（月）雨

06:30 起床
07:50 研究室

午前
- の下訳見直し．
午後
- Murrelの R Graphics の画像ページをちょっと確認．

↑

_ 2010 年 12 月 12 日（日）晴

06:30 起床
07:50 研究室

午前
- の下訳が完了したので見直し．
午後 Deducer: R and ggplot2 GUI

↑

_ 2010 年 12 月 11 日（土）晴

06:30 起床
07:50 研究室

午前
- An R interface to the Google Prediction APIっていうのが便利そう
午後
- すごすぎる．GNU Emacs tablet pc touchscreen gesture demo on Ubuntu 10.10

の翻訳作業
の下訳が完了した．

↑

_ 2010 年 12 月 10 日（金）晴

05:00 起床
06:20 研究室

午前
- の翻訳作業
- RColorBrewer っての Neuwirth が作者だったのか

↑

_ 2010 年 12 月 09 日（木）晴

05:00 起床
06:20 研究室

午前
- の翻訳作業
午後
- 12:50-14:25 演習
- 15:00-18:30 会議
  - 長すぎる．．．
- 充電

↑

_ 2010 年 12 月 08 日（水）晴

04:50 起床
06:30 研究室

午前
- 08:40-10:10 演習
午後
- 14:35-16:00 講義

↑

_ 2010 年 12 月 07 日（火）雨・曇

04:50 起床
06:30 研究室

午前
- 自宅のノートパソコンのUbuntuを再構築したのだが，その際，RMeCab の返値がおかしかったので，R-2.12.0 対策が足りていないのか，と考え，研究室の Ubuntu で試したところ，何の異常もない．そこで気が付いた．昨日，MeCab? を導入する際に，コンパイルオプションでの文字コード指定を忘れていたのであった，

$ ./configure --with-charset=utf8

『R初心者のためのABC』翻訳用のコードを見直し，ここに公開した

午後
- の翻訳作業を進めているのだが，こんな一節がある．

There are two built-in themes. The default, \f{theme_gray}, uses a very light grey background with white gridlines. ... We can still see the gridlines to aid in the judgement of position ..., but they have little visual impact and we can easily ``tune'' them out. The grey background gives the plot a similar colour (in a typographical sense) to the remainder of the text, ensuring that the graphics fit in with the flow of a text without jumping out with a bright white background. Finally, the grey background creates a continuous field of colour which ensures that the plot is perceived as a single visual entity.

実は，ggplot2 のこのグラフも，最初はなじめないテーマ設定だなぁ，と感じていたが，慣れてきた今では，上の説明もなるほどと思う．

↑

_ 2010 年 12 月 06 日（月）晴

07:50 起床
09:00 研究室

午前
- の翻訳作業
午後

↑

_ 2010 年 12 月 05 日（日）晴

自宅での翻訳作業

↑

_ 2010 年 12 月 04 日（土）晴

06:10 起床
07:30 研究室

午前
- The Joy of Stats

午後
- の翻訳作業

↑

_ 2010 年 12 月 03 日（金）晴

06:10 起床
風が強い
07:30 研究室

午前
- の翻訳作業
- C&R社向け原稿
午後．来客と食事
- ヤマダ電機で Pocket Wifi を検討
- の翻訳作業

↑

_ 2010 年 12 月 02 日（木）晴

06:10 起床
07:30 研究室

午前
- の翻訳作業
午後
- 12:50-14:25 演習
- の翻訳作業
- 16:30 会議

↑

_ 2010 年 12 月 01 日（水）晴

06:10 起床
07:30 研究室

午前
- 08:40-10:10 演習
- Rでグラフを作成する際，色覚に問題がある人にも優しい色を選ぶにはdichromatパッケージを使います．
午後
- 14:35-16:05
- の翻訳作業
- RMeCabのdocMatrix2の出力を頻度順に並び替えるコードを泥縄で書いたが，もう少し，ましな方法があるわな

res2 <- docMatrix2("writers", pos = c("名詞","形容詞"),  minFreq = 5)

#　各行（ターム）の頻度合計を列に追加したデータフレームを作成する
#  あるいはオブジェクトを上書きしてしまう．
#　ついでに降順に並び替える

z <- data.frame(term = rownames(res2)[ order(rowSums(res2), decreasing= T ) ] ,
                   sums = rowSums(res2)[order(rowSums(res2), decreasing  = T )] )
# 頻度別の単語数は
table(z$sums)
# 頻度が100を超えるタームを抽出
z[z$sums > 100,]
# Termが"の"の行を確認
res[rownames(res)  == "の", ]

RcppGSL extends Rcpp in the direction of the GNU Scientific Library
Wubiのgrub2が逝かれたのを修復しようとしてVistaのMBRを破壊してしまった．ここからVistaのBootISOを取得して起動．システム回復コンソールから

Bootrec /FixMbr

で復活させた．

添付ファイル:

plot.jpg 1441件 [詳細]

Link: RMeCab(897d) 和布蕪他インストール(5132d) 過去の日録(5192d)

Last-modified: 2011-01-01 (土) 08:55:24 (5313d)

アールメカブ