サクサク読めて、アプリ限定の機能も多数!
トップへ戻る
やろう!確定申告
varianceexplained.org
Last week I shared a thought on Twitter: When you’ve written the same code 3 times, write a function When you’ve given the same in-person advice 3 times, write a blog post — David Robinson (@drob) November 9, 2017 Ironically, this tweet hints at a piece of advice I’ve given at least 3 dozen times, but haven’t yet written a post about. I’ve given this advice to almost every aspiring data scientist
A lot of data takes the form of these success/total counts, where you want to estimate a “proportion of success” for each instance. Each row might represent: An ad you’re running: Which of your ads have higher clickthrough rates, and which have lower? (Note that I’m not talking about running an A/B test comparing two options, but rather about ranking and analyzing a large list of choices.) A user
David Robinson Director of Engineering at Contentsquare Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront I don’t normally post about politics (I’m not particularly savvy about polling, which is where data science has had the largest impact on politics). But this weekend I saw a hypothesis about Donald Trump’s twitter account that s
David Robinson Director of Engineering at Contentsquare Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront At Stack Overflow we’ve always been committed to sharing data: all content contributed to the site is CC-BY-SA licensed, and we release regular “data dumps” of our entire history of questions and answers. I’m excited to announce
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront One day in January 2013 I found myself wasting time on the internet. This wasn’t a good idea: I was as busy as anyone 2.5 years into their PhD. I had to finish a presentation on some yeast genetics research, I was months
Bar plot The most common way a pie chart can be improved is by turning it into a bar chart, with categories on the x axis and percentages on the y-axis. This doesn’t apply to all plots, but it does to this one. library(ggplot2) theme_set(theme_bw()) ggplot(d, aes(Hours, Percentage)) + geom_bar(stat = "identity") + facet_wrap(~Task) Note that much like the original pie chart, we “faceted” (divided
David Robinson Director of Data Scientist at Heap, works in R. Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront If you’ve read my blog, taken one of my classes, or sat next to me on an airplane, you probably know I’m a big fan of Hadley Wickham’s ggplot2 package, especially compared to base R plotting. Not everyone agrees. Among th
David Robinson Director of Engineering at Contentsquare Email Twitter Github Stack Overflow Subscribe Recommended R Bloggers RStudio Blog R4Stats Simply Statistics Upfront I recently came across this question on Cross Validated, and I thought it offered a great opportunity to use R and ggplot2 to explore, in depth, the assumptions underlying the k-means algorithm. The question, and my response, fo
このページを最初にブックマークしてみませんか?
『varianceexplained.org』の新着エントリーを見る
j次のブックマーク
k前のブックマーク
lあとで読む
eコメント一覧を開く
oページを開く