mahler-5のブックマーク / 2015年4月12日 - はてなブックマーク

mahler-5 id:mahler-5

2015年4月12日のブックマーク (3件)

dplyrを使いこなす！基礎編 - Qiita
はじめに 4月ということで、新卒が入ってきたりRを使ったことないメンバーがJOINしたりしたので、超便利なdplyrの使い方を何回かに分けてまとめて行きます。 Rは知らないけど、SQLとか他のプログラミング言語はある程度やったことあるみたいな人向けです。 dplyrを使いこなす！シリーズ基礎編以外も書きましたので、↓からどうぞ。 * dplyrを使いこなす！Window関数編 * dplyrを使いこなす！JOIN編 dplyrとはデータフレームの操作に特化したパッケージです。 Rは基本的に処理速度はあまり早くないですが、dplyrはC++で書かれているのでかなり高速に動作します。ソースの可読性もよくなるので、宗教上の理由で禁止されている人以外は使うメリットは大きいです。処理可能なデータサイズの目安あくまでも個人の環境に強く依存した感覚値ですが、1000万行、100MBぐらいのデ
mahler-5 2015/04/12
わかりやすい

R

dplyr

あとで読む

RStudio
リンク
Jupyter Notebook Viewer
This notebook compares pandas and dplyr. The comparison is just on syntax (verbage), not performance. Whether you're an R user looking to switch to pandas (or the other way around), I hope this guide will help ease the transition. We'll work through the introductory dplyr vignette to analyze some flight data. I'm working on a better layout to show the two packages side by side. But for now I'm jus
mahler-5 2015/04/12
python

R
リンク
7 Command-Line Tools for Data Science | Jeroen Janssens
Data science is OSEMN (pronounced as awesome). That is, it involves Obtaining, Scrubbing, Exploring, Modelling, and iNterpreting data. As a data scientist, I spend quite a bit of time on the command-line, especially when there’s data to be obtained, scrubbed, or explored. And I’m not alone in this. Recently, Greg Reda discussed how the classics (e.g., head, cut, grep, sed, and awk) can be used for
mahler-5 2015/04/12
json
リンク
- 2015年4月13日
- 2015年4月12日
- 2015年4月9日

お知らせ

もっと読む

公式Twitter

@HatenaBookmark
リリース、障害情報などのサービスのお知らせ
@hatebu
最新の人気エントリーの配信

キーボードショートカット一覧

j次のブックマーク

k前のブックマーク

lあとで読む

eコメント一覧を開く

oページを開く

設定を変更しましたx