サクサク読めて、アプリ限定の機能も多数!
トップへ戻る
アメリカ大統領選
varianceexplained.org
Last week I shared a thought on Twitter: When you’ve written the same code 3 times, write a function When you’ve given the same in-person advice 3 times, write a blog post — David Robinson (@drob) November 9, 2017 Ironically, this tweet hints at a piece of advice I’ve given at least 3 dozen times, but haven’t yet written a post about. I’ve given this advice to almost every aspiring data scientist
A lot of data takes the form of these success/total counts, where you want to estimate a “proportion of success” for each instance. Each row might represent: An ad you’re running: Which of your ads have higher clickthrough rates, and which have lower? (Note that I’m not talking about running an A/B test comparing two options, but rather about ranking and analyzing a large list of choices.) A user
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront I don’t normally post about politics (I’m not particularly savvy about polling, which is where data science has had the largest impact on politics). But this weekend I saw a hypothesis about Donald Trump’s twitter account
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront At Stack Overflow we’ve always been committed to sharing data: all content contributed to the site is CC-BY-SA licensed, and we release regular “data dumps” of our entire history of questions and answers. I’m excited to a
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront One day in January 2013 I found myself wasting time on the internet. This wasn’t a good idea: I was as busy as anyone 2.5 years into their PhD. I had to finish a presentation on some yeast genetics research, I was months
Bar plot The most common way a pie chart can be improved is by turning it into a bar chart, with categories on the x axis and percentages on the y-axis. This doesn’t apply to all plots, but it does to this one. library(ggplot2) theme_set(theme_bw()) ggplot(d, aes(Hours, Percentage)) + geom_bar(stat = "identity") + facet_wrap(~Task) Note that much like the original pie chart, we “faceted” (divided
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront If you’ve read my blog, taken one of my classes, or sat next to me on an airplane, you probably know I’m a big fan of Hadley Wickham’s ggplot2 package, especially compared to base R plotting. Not everyone agrees. Among th
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront I recently came across this question on Cross Validated, and I thought it offered a great opportunity to use R and ggplot2 to explore, in depth, the assumptions underlying the k-means algorithm. The question, and my respo
このページを最初にブックマークしてみませんか?
『varianceexplained.org』の新着エントリーを見る
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く