5 ways a SAS Health Check can simplify a transition to R

Nic Crane, Data Scientist At Mango, we’re seeing more and more clients making the decision to modernise their analytics process; moving away from SAS and on to R, Python, and other technologies. There are a variety of reasons for this, including SAS license costs, the increase of recent graduates with R and Python skills, SAS becoming increasingly uncommon, or the need for flexible technologies which have the capability for advanced analytics and quality graphics output. While such transitions are typically about much more than just technology migration, the code accounts for a significant degree of the complexity. So, in order to support our clients, we have developed a software suite to analyse the existing SAS code and simplify this process. So how can a SAS Code Health Check help you decide on how to tackle this kind of transformation? 1.…
Original Post: 5 ways a SAS Health Check can simplify a transition to R

EARL Boston round up

Now we’ve recovered from over indulging in Boston’s culinary delights, we’re ready to share our highlights from this year’s EARL Boston Conference. Day 1 highlights Stack Overflow’s David Robinson kicked off the Conference, using Stack Overflow data to perform all sorts of interesting analyses. Highlights included trends in questions mentioning specific R packages over time, leading to the identification of rival R packages. We found that R is the least disliked language (because it’s the best obviously!); although David cautioned that often people who haven’t used R before haven’t heard of it either. Richie Cotton’s talk on how DataCamp is a ‘data-inspired’ organisation was particularly entertaining and he was a really engaging speaker. It was also great to hear from Emily Riederer about tidycf; she shared a really good example of the type of data-driven revolution taking place in many…
Original Post: EARL Boston round up

Putting the cat in scatterplot

Clara Schartner, Data Scientist It will come as no surprise that cats and ggplot are among our favourite things here at Mango, luckily there is an easy way to combine both. Using the function annotation_custom in the popular ggplot2 package it is possible to display images on a plot i.e. points of a scatterplot. This way data can be displayed in a more fun, creative way. In keeping with the cat theme I have chosen a data set about cats and a cat icon based on Mango the cat. The MASS package provides a data set called cats which contains the body weight, heart weight and sex of adult cats. library(MASS) data(cats) head(cats) set.seed(1234) cats <- cats[sample(1:144, size = 40),] First a normal scatterplot is defined on which the images will be plotted later: library(ggplot2) sCATter <-ggplot(data = cats, aes(x…
Original Post: Putting the cat in scatterplot

EARL Boston must see sessions and 15% off!

We’re only a few weeks away! Don’t miss keynotes from Mara Averick, David Robinson and Tareef Kawaf! They will be joined by speakers from DataCamp, Crunch, Pfizer, GfK, Biogen, VideoBlocks, Microsoft, Oracle, TIBCO, and Sorensen Impact Center, just to name a few. The agenda covers a huge range of topics, including: Using Shiny in a commercial setting – see Dr Shatrunhai Singh’s presentation: ‘A Shiny way to operationalising analytics’. Using R to improve the community – see Joseph Ciesielski’s presentation: ‘The Tidyvese in the non-profit sector: From counting kids to changing outcomes’. The use of R in pharma – see Dr Xiao Ni’s presentation: ‘ShinyRAP: A framework for analysis and building interactive/dynamic reports using Shiny/Markdown’. R and data engineering – see Jeff Allen’s presentation: ‘Push-button publishing in RStudio Connect’. Moving from SAS to R – see Monika Wahi’s presentation: ‘Increasing…
Original Post: EARL Boston must see sessions and 15% off!

EARL London 2017 – That’s a wrap!

Beth Ashlee, Data Scientist After a successful first-ever EARL San Francisco in June, it was time to head back to the birth place EARL – London. With more abstracts submitted than ever before, the conference was made up of 54 fantastic talks and 5 key notes from an impressive selection of industries. With so many talks to pick from we thought we would summarise a few of my favourites! Day 1 highlights: After brilliant keynotes from Tom Smith (ONS) and Rstudio’s Jenny Bryan in session 1, Derek Norton and Neera Talbert from Microsoft took us through the Microsoft process of moving a company from SAS to R in session 2. They explained that with the aim of shrinking the ‘SAS footprint’, it’s important to think about the drivers behind a company leaving SAS as well as considering the impact to…
Original Post: EARL London 2017 – That’s a wrap!

Boston EARL speaker announcement: Agenda now available

Mango Solutions are delighted to announce the full EARL Boston agenda for this coming November! As always, we received more abstracts than we could accept and all were of excellent quality. Because of this, the decision-making process was not easy; however, we believe we’ve put together a great agenda that showcases how a range of industries are solving business problems with R. We’re confident that R users of all levels and from all industries will find the EARL Boston Conference useful. On to the speakers! Joining David, Mara and Tareef, are speakers from companies such as DataCamp, IBM, Microsoft, Capital One, Goldman Sachs International, Novartis and GfK. Thursday 2 November Dr Shatrunjai Singh, John Hancock Insurance Charlie Thompson, VideoBlocks Chris Campbell, Mango Solutions Sudha Subramanian, Sparkfish Joseph Ciesielski, Philadelphia Youth Network Daniel Hadley, Sorenson Impact Center Emily Riederer, Capital One…
Original Post: Boston EARL speaker announcement: Agenda now available

Less than a week to go: EARL London 2017

We’re down to single digits for the 2017 EARL London! Next Tuesday’s workshops only have limited places left, so if you had plans to attend, make sure you book now. For more information on the workshops available take a look here. Wednesday and Thursday are shaping up to be two excellent days of presentations with keynotes: Tom Smith, Managing Director at the Office for National Statistics’ Data Science Campus; Jenny Bryan, Software Engineer at RStudio; and Hilary Parker, Data Scientist at Stitch Fix. They’ll be followed by talks from R users from almost every industry, including organisations like Microsoft, RStudio, the National Audit Office, IBM, Royal London, Bank of England, AstraZeneca, uSwitch and Telegraph Media Group. Every year, the feedback we hear most from delegates at EARL is that they like being able to learn from their fellow R users’…
Original Post: Less than a week to go: EARL London 2017

DEADLINE EXTENDED: Last call for Boston EARL abstracts

Are you solving problems and innovating with R?Are you working with R in a commercial setting?Do you enjoy sharing your knowledge? If you said yes to any of the above, we want your abstract! Share your commerical R stories with your peers at EARL Boston this November. EARL isn’t about knowing the most or being the best in your field – it’s about taking what you’ve learnt and sharing it with others, so they can learn from your wins (and sometimes your failures, because we all have them!). As long as your proposed presentation is focused on the commerical use of R, any topic from any industry is welcome! The abstract submission deadline has been extended to Sunday 3 September. Join David Robinson, Mara Averick and Tareef Kawaf on 1-3 November 2017 at The Charles Hotel in Cambridge. To submit…
Original Post: DEADLINE EXTENDED: Last call for Boston EARL abstracts

Boston EARL Keynote speaker announcement: Tareef Kawaf

Mango Solutions are thrilled to announce that Tareef Kawaf, President of RStudio, will be joining us at EARL Boston as our third Keynote Speaker. Tareef is an experienced software startup executive and a member of teams that built up ATG’s eCommerce offering and Brightcove’s Online Video Platform, helping both companies grow from early startups to publicly traded companies. He joined RStudio in early 2013 to help define its commercial product strategy and build the team. He is a software engineer by training, and an aspiring student of advanced analytics and R. This will be Tareef’s second time speaking at EARL Boston and we’re big supporters of RStudio’s mission to provide the most widely used open source and enterprise-ready professional software for the R statistical computing environment, so we’re looking forward to him taking to the podium again this year. Want…
Original Post: Boston EARL Keynote speaker announcement: Tareef Kawaf

GoTr – R wrapper for An API of Ice And Fire

Ava Yang It’s Game of Thrones time again as the battle for Westeros is heating up. There are tons of ideas, ingredients and interesting analyses out there and I was craving for my own flavour. So step zero, where is the data? Jenny Bryan’s purrr tutorial introduced the list got_chars, representing characters information from the first five books, which seems not much fun beyond exercising list manipulation muscle. However, it led me to an API of Ice and Fire, the world’s greatest source for quantified and structured data from the universe of Ice and Fire including the HBO series Game of Thrones. I decided to create my own API functions, or better, an R package (inspired by the famous rwar package). The API resources cover 3 types of endpoint – Books, Characters and Houses. GoTr pulls data in JSON format…
Original Post: GoTr – R wrapper for An API of Ice And Fire