Microsoft R Open 3.4.3 now available

Microsoft R Open (MRO), Microsoft’s enhanced distribution of open source R, has been upgraded to version 3.4.3 and is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to the latest R (version 3.4.3) and updates the bundled packages (specifically: checkpoint, curl, doParallel, foreach, and iterators) to new versions.  MRO is 100% compatible with all R packages. MRO 3.4.3 points to a fixed CRAN snapshot taken on January 1 2018, and you can see some highlights of new packages released since the prior version of MRO on the Spotlights page. As always, you can use the built-in checkpoint package to access packages from an earlier date (for reproducibility) or a later date (to access new and updated packages). MRO 3.4.3 is based on R 3.4.3, a minor update to the R engine (you can see the detailed list…
Original Post: Microsoft R Open 3.4.3 now available

Personal note on joining the Microsoft Cloud Advocates team

A quick personal note: today is my first day as a member of the Cloud Developer Advocates team at Microsoft! I’ll still be blogging and events related to R, and supporting the R community, but now I’ll be doing it as a member of a team dedicated to community outreach. As a bit of background, when I joined Microsoft back in 2015 via the acquisition of Revolution Analytics, I was thrilled to be able to continue my role supporting the R community. Since then, Microsoft as a whole has continue to ramp up its support of open source projects and to interact directly with developers of all stripes (including data scientists!) through various initiatives across the company. (Aside: I knew Microsoft was a big company before I joined, but even then took me a while to appreciate the scale of the…
Original Post: Personal note on joining the Microsoft Cloud Advocates team

Vowpal Wabbit 8.5.0 & NIPS tutorial

Vowpal Wabbit 8.5.0 & NIPS tutorial Yesterday, I tagged VW version 8.5.0 which has many interactive learning improvements (both contextual bandit and active learning), better support for sparse models, and a new baseline reduction which I’m considering making a part of the default update rule. If you want to know the details, we’ll be doing a mini-tutorial during the Friday lunch break at the Extreme Classification workshop at NIPS. Please join us if interested.
Original Post: Vowpal Wabbit 8.5.0 & NIPS tutorial

R 3.4.3 released

R 3.4.3 has been released, as announced by the R Core team today. As of this writing, only the source distribution (for those that build R themselves) is available, but binaries for Windows, Mac and Linux should appear on your local CRAN mirror within the next day or so. This is primarily a bug-fix release. It fixes an issue with incorrect time zones on MacOS High Sierra, and some issues with handling Unicode characters. (Incidentally, representing international and special characters is something that R takes great care in handling properly. It’s not an easy task: a 2003 essay by Joel Spolsky describes the minefield that is character representation, and not much has changed since then.) You can check out the complete list of changes here. Whatever your platform, R 3.4.3 should be backwards-compatible will other R versions in the R…
Original Post: R 3.4.3 released

Scale up your parallel R workloads with containers and doAzureParallel

by JS Tan (Program Manager, Microsoft) The R language is by and far the most popular statistical language, and has seen massive adoption in both academia and industry. In our new data-centric economy, the models and algorithms that data scientists build in R are not just being used for research and experimentation. They are now also being deployed into production environments, and directly into products themselves. However, taking your workload in R and deploying it at production capacity, and at scale, is no trivial matter.  Because of R’s rich and robust package ecosystem, and the many versions of R, reproducing the environment of your local machine in a production setting can be challenging. Let alone ensuring your model’s reproducibility! This is why using containers is extremely important when it comes to operationalizing your R workloads. I’m happy to announce that…
Original Post: Scale up your parallel R workloads with containers and doAzureParallel

Microsoft R Open 3.4.2 now available

Microsoft R Open (MRO), Microsoft’s enhanced distribution of open source R, has been upgraded to version 3.4.2 and is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to the latest R 3.4.2 and updates the bundled packages.  MRO is 100% compatible with all R packages. MRO 3.4.2 points to a fixed CRAN snapshot taken on October 15 2017, and you can see some highlights of new packages released since the prior version of MRO on the Spotlights page. As always you can use the built-in checkpoint package to access packages from an earlier date (for compatibility) or a later date (to access new and updated packages). MRO 3.4.2 is based on R 3.4.2, a minor update to the R engine (you can see the detailed list of updates to R here). This update is backwards-compatible with…
Original Post: Microsoft R Open 3.4.2 now available

Announcing dplyrXdf 1.0

I’m delighted to announce the release of version 1.0.0 of the dplyrXdf package. dplyrXdf began as a simple (relatively speaking) backend to dplyr for Microsoft Machine Learning Server/Microsoft R Server’s Xdf file format, but has now become a broader suite of tools to ease working with Xdf files. This update to dplyrXdf brings the following new features: Support for the new tidyeval framework that powers the current release of dplyr Support for Spark and Hadoop clusters, including integration with the sparklyr package to process Hive tables in Spark Integration with dplyr to process SQL Server tables in-database Simplified handling of parallel processing for grouped data Several utility functions for Xdf and file management Workarounds for various glitches and unexpected behaviour in MRS and dplyr Spark, Hadoop and HDFS New in version 1.0.0 of dplyrXdf is support for Xdf files and datasets stored…
Original Post: Announcing dplyrXdf 1.0

R 3.4.2 is released

The R Core team today announced the release of R 3.4.2. This release fixes a number of minor bugs and also includes a performance improvement to the commonly-used function c when applied to vectors with a names attribute. Like all minor releases, this release is backwards compatible with prior releases in the R 3.4.x series. Binary builds of R 3.4.2 will begin to appear on your local CRAN mirror within the next day or so. This release also marks the final Windows build of R by Duncan Murdoch, who has been volunteering his time to the R Core group for more than 15 years to build R (and its companion Rtools) for Windows. Along with the rest of the team here at Microsoft, I’d like to offer a hearty THANK YOU to Duncan for his time, effort and expertise over…
Original Post: R 3.4.2 is released

Meet the new Microsoft R Server: Microsoft ML Server 9.2

Microsoft R Server has received a new name and a major update: Microsoft ML Server 9.2 is now available. ML Server provides a scalable production platform for R — and now Python — programs. The basic idea is that a local client can push R or Python code and have it operationalized on the remote server. ML Server is also included with the Data Science Virtual Machine and HDInsight Spark clusters on Azure.  This video gives a high-level overview of the process, or you can also see details of deploying an R model or a Python model as a web service. The related Microsoft Machine Learning Services provides similar capabilities for in-database computations within SQL Server 2017 (now with Python as well as R) and (in preview) the fully-managed Azure SQL Database. ML Services also provides real-time scoring of trained models, with predictions generated…
Original Post: Meet the new Microsoft R Server: Microsoft ML Server 9.2

Microsoft R Open 3.4.1 now available

Microsoft R Open (MRO), Microsoft’s enhanced distribution of open source R, has been upgraded to version 3.4.1 and is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to R 3.4.1 and updates the bundled packages.  MRO is 100% compatible with all R packages. MRO 3.4.1 points to a fixed CRAN snapshot from September 1 2017, and you can see some highlights of new packages released since MRO 3.4.0 on the Spotlights page. As always you can use the built-in checkpoint package to access packages from an earlier date (for compatibility) or a later date (to access new and updated packages).   MRO 3.4.1 is based on R 3.4.1, a minor update to the R engine (you can see the detailed list of updates to R here. If you’ve had problems installing packages on Windows, this update does…
Original Post: Microsoft R Open 3.4.1 now available