In case you missed it: April 2018 roundup

In case you missed them, here are some articles from April of particular interest to R users. Microsoft R Open 3.4.4, based on R 3.4.4, is now available. An R script by Ryan Timpe converts a photo into instructions for rendering it as LEGO bricks. R functions to build a random maze in Minecraft, and have your avatar solve the maze automatically. A dive into some of the internal changes bringing performance improvements to the new R 3.5.0. AI, Machine Learning and Data Science Roundup, April 2018.  An analysis with R shows that Uber has overtaken taxis for trips in New York City. News from the R Consortium: new projects, results from a survey on licenses, and R-Ladies is promoted to a top-level project. A talk, aimed at Artificial Intelligence developers, making the case for using R. Bob Rudis analyzes…
Original Post: In case you missed it: April 2018 roundup

AI, Machine Learning and Data Science Roundup: April 2018

A monthly roundup of news about Artificial Intelligence, Machine Learning and Data Science. This is an eclectic collection of interesting blog posts, software announcements and data applications I’ve noted over the past month or so. Open Source AI, ML & Data Science News An interface between R and Python: reticulate. TensorFlow Hub: A library for reusable machine learning modules. TensorFlow.js: Browser-based machine learning with WebGL acceleration.  Download data from Kaggle with the Kaggle API. Industry News Tensorflow 1.7 supports the TensorRT library for faster computation on NVIDIA GPUs. RStudio now provides a Tensorflow template in Paperspace for computation with NVIDIA GPUs. Google Cloud Text-to-Speech provides natural speech in 32 voices and 12 languages. Amazon Translate is now generally available. Microsoft News ZDNet reviews The Future Computed: “do read it to remind yourself how much preparation is required for the impact…
Original Post: AI, Machine Learning and Data Science Roundup: April 2018

In case you missed it: December 2017 roundup

In case you missed them, here are some articles from December of particular interest to R users. Hadley Wickham’s Shiny app for making eggnog. Using R to analyze the vocal range of pop singers. A video tour of the data.table package from its creator, Matt Dowle. The European R Users Meeting (eRum) will be held in Budapest, May 14-18. Winners of the ASA Police Data Challenge student visualization contest. An introduction to seplyr, a re-skinning of the dplyr package to a standard R evaluation interface. How to run R in the Windows Subsystem for Linux, along with the rest of the Linux ecosystem. A chart of Bechdel scores, showing representation of women in movies over time. The British Ecological Society’s Guide to Reproducible Science advocates the use of R and Rmarkdown. Eight modules from the Microsoft AI School cover Microsoft R…
Original Post: In case you missed it: December 2017 roundup

In case you missed it: November 2017 roundup

In case you missed them, here are some articles from November of particular interest to R users. R 3.4.3 “Kite Eating Tree” has been released. Several approaches for generating a “Secret Santa” list with R. The “RevoScaleR” package from Microsoft R Server has now been ported to Python. The call for papers for the R/Finance 2018 conference in Chicago is now open. Give thanks to the volunteers behind R. Advice for R user groups from the organizer of R-Ladies Chicago. Use containers to build R clusters for parallel workloads in Azure with the doAzureParallel package. A collection of R scripts for interesting visualizations that fit into a 280-character Tweet. R is featured in a StackOverflow case study at the Microsoft Connect conference. The City of Chicago uses R to forecast water quality and issue beach safety alerts. A collection of…
Original Post: In case you missed it: November 2017 roundup

In case you missed it: October 2017 roundup

In case you missed them, here are some articles from October of particular interest to R users. A recent survey of competitors on the Kaggle platform reveals that Python (76%) and R (59%) are the preferred tools for building predictive models. Microsoft’s “Team Data Science Process” has been updated with new guidelines on use of the IDEAR framework for R and Python. Microsoft R Open 3.4.2 is now available for Windows, Mac and Linux. Using the foreach package to estimate bias of rpart trees via bootstrapping. Replays of webinars on the Azure Data Science VM, and on document collection analysis with Azure ML Workbench, are now available. The “officer” package makes it possible to create PowerPoint and Word documents from R, and even include editable R charts. An online book on statistical machine learning with the MicrosoftML package. An updated…
Original Post: In case you missed it: October 2017 roundup

In case you missed it: September 2017 roundup

In case you missed them, here are some articles from September of particular interest to R users. The mathpix package converts images of hand-drawn equations to their LaTeX equivalent. R 3.4.2 is released.  Applying image featurization to the problem of classifying wood knots in lumber. Microsoft ML Server 9.2, which provides operationalization for R, is released. A roundup of news stories related to R from the Ignite conference. A tutorial on launching a Spark cluster with R on HDInsight. A map of pirate attacks, and an ethics lesson on scraping data. A preview of ALTREP, the new vector implementation that promises to bring performance improvements to the R engine. The USGS used R to visualize the rainfall from hurricanes Irma and Harvey. A preview of the EARL London conference, and highlights of some R applications presented there. Microsoft R Open 3.4.1, featuring…
Original Post: In case you missed it: September 2017 roundup

In case you missed it: August 2017 roundup

In case you missed them, here are some articles from August of particular interest to R users. Using the featurizeText function in the MicrosoftML package to extract ngrams from unstructured text. A joyplot visualizes the probabilities associated with phrases like “highly likely” and “little chance” by a sample of 46 Redditors. Two examples of creating 3-D animations in R: a stereo cube, and the Charleston in motion capture data. A tutorial on creating thematic maps in R, from ComputerWorld.  Some tips on using R to query data in Power BI. Using the Rcpp package to calculate a membership matrix for fuzzy k-means clustering. A reimagining of Minard’s chart of Napoleon’s march on Russia. Rankings of gender roles for men and women in film, from an analysis of scripts with the tidytext package. Several talks at the upcoming Ignite conference feature…
Original Post: In case you missed it: August 2017 roundup

In case you missed it: July 2017 roundup

In case you missed them, here are some articles from July of particular interest to R users. A tutorial on using the rsparkling package to apply H20’s algorithms to data in HDInsight. Several exercises to learn parallel programming with the foreach package. A presentation on the R6 class system, by Winston Chang. Introducing “joyplots”, a ggplot2 extension for visualizing multiple time series or distributions (with a nod to Joy Division). SQL Server 2017, with many new R-related capabilities, is nearing release. Ali Zaidi on using neural embeddings with R and Spark to analyze Github comments. R ranks #6 in the 2017 IEEE Spectrum Top Programming Languages. Course materials on “Data Analysis for the Life Sciences”, from Rafael Irizarry. How to securely store API keys in R scripts with the “secret” package. An in-depth tutorial on implementing neural network algorithms in…
Original Post: In case you missed it: July 2017 roundup

In case you missed it: June 2017 roundup

In case you missed them, here are some articles from June of particular interest to R users. R 3.4.1 “Single Candle” has been released. The Scientific Computing Coordinator at the FDA explains how R is used at the FDA and by sponsors for clinical trial submissions. Several useful tips related to including images in Rmarkdown documents. A review of one of R’s best features — its community. It’s now possible to include interactive R visualizations in Power BI reports (like graphics created with plotly or htmlwidgets). The Azure Data Science Virtual Machine for Windows now supports GPU-based computations with Microsoft R, Tensorflow, and other included software. The 2017 Burtch Works survey of data science software popularity shows R leading, Python gaining, and SAS declining. A video presentation by Ali Zaidi on using the sparklyr package with Microsoft R Server. The…
Original Post: In case you missed it: June 2017 roundup

In case you missed it: May 2017 roundup

In case you missed them, here are some articles from May of particular interest to R users. Many interesting presentations recorded at the R/Finance 2017 conference in Chicago are now available to watch. A review of some of the R packages and projects implemented at the 2017 ROpenSci Unconference. An example of applying Bayesian Learning with the “bnlearn” package to challenge stereotypical assumptions Data from the Billboard Hot 100 chart used to find the most popular words in the titles of pop hits. Microsoft R Open 3.4.0 is now available for Windows, Mac and Linux. How to use the “tweenr” package to create smooth transitions in data animations.  A preview of some of the companies and R applications to be presented at the EARL conference in San Francisco. The AzureDSVM package makes it easy to spawn and manage clusters of…
Original Post: In case you missed it: May 2017 roundup