In case you missed it: December 2017 roundup

In case you missed them, here are some articles from December of particular interest to R users. Hadley Wickham’s Shiny app for making eggnog. Using R to analyze the vocal range of pop singers. A video tour of the data.table package from its creator, Matt Dowle. The European R Users Meeting (eRum) will be held in Budapest, May 14-18. Winners of the ASA Police Data Challenge student visualization contest. An introduction to seplyr, a re-skinning of the dplyr package to a standard R evaluation interface. How to run R in the Windows Subsystem for Linux, along with the rest of the Linux ecosystem. A chart of Bechdel scores, showing representation of women in movies over time. The British Ecological Society’s Guide to Reproducible Science advocates the use of R and Rmarkdown. Eight modules from the Microsoft AI School cover Microsoft R…
Original Post: In case you missed it: December 2017 roundup

In case you missed it: November 2017 roundup

In case you missed them, here are some articles from November of particular interest to R users. R 3.4.3 “Kite Eating Tree” has been released. Several approaches for generating a “Secret Santa” list with R. The “RevoScaleR” package from Microsoft R Server has now been ported to Python. The call for papers for the R/Finance 2018 conference in Chicago is now open. Give thanks to the volunteers behind R. Advice for R user groups from the organizer of R-Ladies Chicago. Use containers to build R clusters for parallel workloads in Azure with the doAzureParallel package. A collection of R scripts for interesting visualizations that fit into a 280-character Tweet. R is featured in a StackOverflow case study at the Microsoft Connect conference. The City of Chicago uses R to forecast water quality and issue beach safety alerts. A collection of…
Original Post: In case you missed it: November 2017 roundup

In case you missed it: October 2017 roundup

In case you missed them, here are some articles from October of particular interest to R users. A recent survey of competitors on the Kaggle platform reveals that Python (76%) and R (59%) are the preferred tools for building predictive models. Microsoft’s “Team Data Science Process” has been updated with new guidelines on use of the IDEAR framework for R and Python. Microsoft R Open 3.4.2 is now available for Windows, Mac and Linux. Using the foreach package to estimate bias of rpart trees via bootstrapping. Replays of webinars on the Azure Data Science VM, and on document collection analysis with Azure ML Workbench, are now available. The “officer” package makes it possible to create PowerPoint and Word documents from R, and even include editable R charts. An online book on statistical machine learning with the MicrosoftML package. An updated…
Original Post: In case you missed it: October 2017 roundup

In case you missed it: September 2017 roundup

In case you missed them, here are some articles from September of particular interest to R users. The mathpix package converts images of hand-drawn equations to their LaTeX equivalent. R 3.4.2 is released.  Applying image featurization to the problem of classifying wood knots in lumber. Microsoft ML Server 9.2, which provides operationalization for R, is released. A roundup of news stories related to R from the Ignite conference. A tutorial on launching a Spark cluster with R on HDInsight. A map of pirate attacks, and an ethics lesson on scraping data. A preview of ALTREP, the new vector implementation that promises to bring performance improvements to the R engine. The USGS used R to visualize the rainfall from hurricanes Irma and Harvey. A preview of the EARL London conference, and highlights of some R applications presented there. Microsoft R Open 3.4.1, featuring…
Original Post: In case you missed it: September 2017 roundup

In case you missed it: August 2017 roundup

In case you missed them, here are some articles from August of particular interest to R users. Using the featurizeText function in the MicrosoftML package to extract ngrams from unstructured text. A joyplot visualizes the probabilities associated with phrases like “highly likely” and “little chance” by a sample of 46 Redditors. Two examples of creating 3-D animations in R: a stereo cube, and the Charleston in motion capture data. A tutorial on creating thematic maps in R, from ComputerWorld.  Some tips on using R to query data in Power BI. Using the Rcpp package to calculate a membership matrix for fuzzy k-means clustering. A reimagining of Minard’s chart of Napoleon’s march on Russia. Rankings of gender roles for men and women in film, from an analysis of scripts with the tidytext package. Several talks at the upcoming Ignite conference feature…
Original Post: In case you missed it: August 2017 roundup

In case you missed it: July 2017 roundup

In case you missed them, here are some articles from July of particular interest to R users. A tutorial on using the rsparkling package to apply H20’s algorithms to data in HDInsight. Several exercises to learn parallel programming with the foreach package. A presentation on the R6 class system, by Winston Chang. Introducing “joyplots”, a ggplot2 extension for visualizing multiple time series or distributions (with a nod to Joy Division). SQL Server 2017, with many new R-related capabilities, is nearing release. Ali Zaidi on using neural embeddings with R and Spark to analyze Github comments. R ranks #6 in the 2017 IEEE Spectrum Top Programming Languages. Course materials on “Data Analysis for the Life Sciences”, from Rafael Irizarry. How to securely store API keys in R scripts with the “secret” package. An in-depth tutorial on implementing neural network algorithms in…
Original Post: In case you missed it: July 2017 roundup

In case you missed it: June 2017 roundup

In case you missed them, here are some articles from June of particular interest to R users. R 3.4.1 “Single Candle” has been released. The Scientific Computing Coordinator at the FDA explains how R is used at the FDA and by sponsors for clinical trial submissions. Several useful tips related to including images in Rmarkdown documents. A review of one of R’s best features — its community. It’s now possible to include interactive R visualizations in Power BI reports (like graphics created with plotly or htmlwidgets). The Azure Data Science Virtual Machine for Windows now supports GPU-based computations with Microsoft R, Tensorflow, and other included software. The 2017 Burtch Works survey of data science software popularity shows R leading, Python gaining, and SAS declining. A video presentation by Ali Zaidi on using the sparklyr package with Microsoft R Server. The…
Original Post: In case you missed it: June 2017 roundup

In case you missed it: May 2017 roundup

In case you missed them, here are some articles from May of particular interest to R users. Many interesting presentations recorded at the R/Finance 2017 conference in Chicago are now available to watch. A review of some of the R packages and projects implemented at the 2017 ROpenSci Unconference. An example of applying Bayesian Learning with the “bnlearn” package to challenge stereotypical assumptions Data from the Billboard Hot 100 chart used to find the most popular words in the titles of pop hits. Microsoft R Open 3.4.0 is now available for Windows, Mac and Linux. How to use the “tweenr” package to create smooth transitions in data animations.  A preview of some of the companies and R applications to be presented at the EARL conference in San Francisco. The AzureDSVM package makes it easy to spawn and manage clusters of…
Original Post: In case you missed it: May 2017 roundup

In case you missed it: April 2017 roundup

In case you missed them, here are some articles from April of particular interest to R users.  The rxExecBy function (in Microsoft R Server) deploys “embarassingly parallel” problems to remote compute services. An interesting population map, reminiscent of a Joy Division album cover, shows Europe’s population density using just 14 lines of R code. Financial startup dv01 uses R to bring greater transparency to consumer lending. Reproducibility with knitr: how to use the “checkpoint” package with the “Knit” feature in RStudio. A summary of the improvements in R 3.4.0.  Slides from my recent talk, “Reproducible Data Science with R”.  SQL Server 2017 will support both R and Python for in-database computation.  New features in Microsoft R Server 9.1, now available.  A sentiment analysis of Warren Buffett’s letters to shareholders.  A workshop on Artificial Intelligence, featuring Microsoft R, is being held…
Original Post: In case you missed it: April 2017 roundup

In case you missed it: December 2016 roundup

In case you missed them, here are some articles from December of particular interest to R users.  Power BI now has a gallery of custom visualizations built with R. Chicago’s Department of Public Health uses R to prioritize health inspections at restaurants. A beautiful map of Switzerland municipalities combined with a relief map of the mountains, created with R. Using the Azure Interface Tool to parallelize the problem of optimizing an R model across the hyperparameter space. A primer on Bayesian Statistics. Animating Voronoi tesselations in R to create a greeting card. The Linux Data Science Virtual Machine, which includes several R-related components, is available for a free “test drive” on Azure. The new AzureSMR package lets you manage Azure virtual machines, clusters and storage from R. Interactive decision trees in Microsoft R Server. The ompr package provides numerical optimization with…
Original Post: In case you missed it: December 2016 roundup