Blogs I Follow

Recent Posts
 ChangRae Lee’s On Such a Full Sea is a readable dystopia that really pushes the reader to think hard
 Enjoyed Cynthia Ozick’s story “The Coast of New Zealand” in The New Yorker
 Boneland by Alan Garner
 Encadrement du responsable du centre multimédia de Houndé (CMH) sur les techniques de rédaction des livres pour enfants
 Funny in Farsi by Firoozeh Dumas
Archives
Categories
Friends of African Village Libraries (I post regularly here)
 An error has occurred; the feed is probably down. Try again later.
Category Archives: R statistics
Stata tip: Doing something conditional on existence of a variable in the dataset, using a local
In a program I am writing I want the same utility file to be run on different datasets and certain variables in the dataset. But the datasets have different sets of variables. So a nice Stata tip, adapted for my … Continue reading
Posted in R statistics
Comments Off on Stata tip: Doing something conditional on existence of a variable in the dataset, using a local
What students learning R in Econ 4142 can aspire to…
The ideal job for every Econ major! You can follow Levin on Twitter here. Matt Levin is the data dude for CALmatters. His work entails distilling complex policy topics into easily digestible charts and graphs, finding and writing original stories … Continue reading
Posted in R statistics
Comments Off on What students learning R in Econ 4142 can aspire to…
Is there a good way to download the coordinates of a path or route on google map into a file?
The folks at stackoverflow just dismissed this question. But is an honest and useful question to have answered, for some applications. And here is a good ttip from fentonsrule: This is something I learned on my own. If you create … Continue reading
Posted in R statistics
Comments Off on Is there a good way to download the coordinates of a path or route on google map into a file?
The perfect graph for every undergraduate statistics class, from Financial Times
I object to usage of “trend” to indicate “relationship” (for me “trend” implies over time) but that is just a quibble. Perfect illustration of importance of visualizing data. PS. Doug Campbell reminded me of importance of a title for the … Continue reading
Posted in Development thinking, R statistics
Comments Off on The perfect graph for every undergraduate statistics class, from Financial Times
Reading in PDF files into R to do text analysis
Turns out Clay Ford at the University of Virginia wrote a nice tutorial for this, and a package does the trick very nicely. I tested the “update” at the bottom of the post which shows how to use the pdftools … Continue reading
Posted in R statistics
Comments Off on Reading in PDF files into R to do text analysis
Stata gets a markdown package… will try this in coming weeks
Rigorous documentation of the analysis plan, procedure, and computer codes enhances the comprehensibility and transparency of data analysis. Documentation is particularly critical when the codes and data are meant to be publicly shared and examined by the scientific community to … Continue reading
Posted in R statistics
Comments Off on Stata gets a markdown package… will try this in coming weeks
Noncitizens voting? Wonderful straightforward analysis from Ansolabehere, Luks, and Schaffner
Stepping back from the immediate question of whether the CCES in fact shows a low rate of voting among noncitizens, our analysis carries a much broader lesson and caution about the analysis of big databases to study low frequency characteristics … Continue reading
Posted in R statistics
Comments Off on Noncitizens voting? Wonderful straightforward analysis from Ansolabehere, Luks, and Schaffner
Why use R instead of Excel or SPSS, for quantitative archaeology or any social science for that matter!
For a long time archaeologists had few options to deal with these problems because there were few alternative programs. The general alternative to using a pointandclick program is writing scripts to program algorithms for statistical analysis and visualisations. Writing scripts … Continue reading
Posted in R statistics
Comments Off on Why use R instead of Excel or SPSS, for quantitative archaeology or any social science for that matter!
Something I need to remember…
reading a 100M csv file into R, read.csv takes 61s, and with read_csv in readr just 3s. That’s amazing. #rstats @hadleywickham a great job via Hadley Wickham (@hadleywickham)  Twitter.
Posted in R statistics
Comments Off on Something I need to remember…
Microsoft to acquire Revolution Analytics
I’m very pleased to announce that Microsoft has reached an agreement to acquire Revolution Analytics. Revolution Analytics is the leading commercial provider of software and services for R, the world’s most widely used programming language for statistical computing and predictive … Continue reading
Posted in R statistics
Comments Off on Microsoft to acquire Revolution Analytics
Using quotes in local and global macros in Stata
For some reason I spent a lot of time yesterday doing this: turn a list of variables into a list with doublequotes for each variable, then separated by commas. I confess I still do not really understand the logic of … Continue reading
Posted in R statistics
Comments Off on Using quotes in local and global macros in Stata
Coding Stata and listening to The Format, On the Porch, over and over…
Posted in R statistics
Comments Off on Coding Stata and listening to The Format, On the Porch, over and over…