Ching-Chuan Chen's Blogger

Statistics, Machine Learning and Programming

0%

這是我自己在PTT PO的文,詳細介紹dplyr,以下是正文~~

這篇重點放在dplyr

Read more »

這是我自己在PTT PO的文,詳細介紹data.table,以下是正文~~

data.table包含的東西很多

但是很多東西都可以被plyr, dplyr的function取代

所以data.table很多function,我都不太熟

這裡簡單介紹一下data.table

Read more »

這是我自己在PTT PO的文,詳細介紹magrittr,以下是正文~~

鑒於andrew大大的提議,現下剛好有閒就來寫一系列資料整理套件的文章

版上比我熟這些套件的人也不少,如果不足的部分,再麻煩幫忙補充

Read more »

I had been searching how to use Rcpp function in snowfall or doSNOW for a long time, but there is still not a solution. I recently come up an idea to implement. Since the error is printed when exporting the Rcpp function to nodes, I compile Rcpp function in nodes. Surprisingly, I success.

Read more »

A simple log for doing a job of mapreduce in python.

We implement wordcount by using hadoop streaming. New two python script files named mapper.py and reducer.py, respectively.

Read more »