Stan 2.11 Good, Stan 2.10 Bad

Stan 2.11 Good, Stan 2.10 Bad Stan 2.11 is available for all interfaces We are happy to announce that all of the interfaces have been updated to Stan 2.11. There was a subtle bug introduced in 2.10 where a probabilistic acceptance condition was being checked twice. Sorry about that and thanks for your patience. We’ve added some additional tests to…
Original Post: Stan 2.11 Good, Stan 2.10 Bad

Mind Your Units

By JEAN STEINERRandomized A/B experiments are the gold standard for estimating causal effects. The analysis can be straightforward, especially when it’s safe to assume that individual observations of an outcome measure are independent. However, this is not always the case. When observations are not independent, an analysis that assumes independence can lead us to believe that effects are significant when…
Original Post: Mind Your Units

Even social scientists can think like pundits, unfortunately

I regularly read the Orgtheory blog which has interesting perspectives from sociologists. Today I saw this, from Sean Safford: I [Safford] actually hold to the idea that the winning candidate for President is always the one who has a clearer view of the challenges and opportunities facing the country and articulates a viable roadmap for how to navigate them. I…
Original Post: Even social scientists can think like pundits, unfortunately

Visualisation of Likert scale results

I wrote a function to visualise results of Likert scale items. Please find the function below the post. Here is an example plot: The function is called ‘plot.likert’ and takes the following arguments: – vec: The vector with the raw results – possible.values: A vector with all the possible values. This is sometimes important if not all possible responses were…
Original Post: Visualisation of Likert scale results

KDnuggets Free Bronze Pass to Strata + Hadoop World New York City, Sep 28-29, 2016

Previous post            Tweet Tags: Big Data, Business, Hadoop, New York City, NY, Strata Strata + Hadoop World is the leading event on how big data and ubiquitous, real-time computing is shaping the course of business and society. Win KDnuggets free pass to Strata + Hadoop World New York City. By Gregory Piatetsky, KDnuggets. As a media partner…
Original Post: KDnuggets Free Bronze Pass to Strata + Hadoop World New York City, Sep 28-29, 2016

Data Science of Visiting Famous Movie Locations in San Francisco

           Tweet Previous post Next post Tags: CA, Data Science, Google, IMDb, Python, San Francisco Using the Google Places API and IMDb API, we selected movie locations in The Golden City which every movie fan should visit while they are in town, and optimize sightseeing by solving the travelling salesman problem. comments By Juraj Kapasny, Knoyd. Let the…
Original Post: Data Science of Visiting Famous Movie Locations in San Francisco

What recommendations to give when a medical study is not definitive (which of course will happen all the time, especially considering that new treatments should be compared to best available alternatives, which implies that most improvements will be incremental at best)

Simon Gates writes: I thought you might be interested in a recently published clinical trial, for potential blog material. It picks up some themes that have cropped in recent months. Also, it is important for the way statistical methods influence what can be life or death decisions. The OPPTIMUM trial (http://www.thelancet.com/journals/lancet/article/PIIS0140-6736(16)00350-0/abstract) evaluated use of vaginal progesterone for prevention of preterm delivery. …
Original Post: What recommendations to give when a medical study is not definitive (which of course will happen all the time, especially considering that new treatments should be compared to best available alternatives, which implies that most improvements will be incremental at best)

Plotcon 2016 – Speakers and topics in R

Jenny Bryan Topic: Extract plotting intent from spreadsheets in R Bio: Jennifer Bryan is an Associate Professor in the Statistics Department and the Michael Smith Laboratories at the University of British Columbia in Vancouver. She’s a biostatistician specialized in genomics and takes a special interest and delight in data analysis and statistical computing. Kent Russell Topic: Research in Finance Alive…
Original Post: Plotcon 2016 – Speakers and topics in R

Plot some variables against many others with tidyr and ggplot2

(This article was first published on blogR, and kindly contributed to R-bloggers) Want to see how some of your variables relate to many others? Here’s an example of just this: library(tidyr) library(ggplot2) mtcars %>% gather(-mpg, -hp, -cyl, key = “var”, value = “value”) %>% ggplot(aes(x = value, y = mpg, color = hp, shape = factor(cyl))) + geom_point() + facet_wrap(~…
Original Post: Plot some variables against many others with tidyr and ggplot2

Merge a list of datasets together

Last week I showed how to read a lot of datasets at once with R, and this week I’ll continue from there and show a very simple function that uses this list of read datasets and merges them all together. First we’ll use read_list() to read all the datasets at once (for more details read last week’s post): library(“readr”) library(“tibble”)…
Original Post: Merge a list of datasets together