IDentrix: Data Scientist

Seeking a passionate Data Scientist with a proven record of building data driven solutions, who is interested in data mining and modeling specialized large and connected datasets. Company: IDentrixLocation: Bethesda, MDWeb: www.identrix.comPosition: Data Scientist_Contact_:Email to recruiting@identrix.com. IDentrix (www.identrix.com), is disrupting the world of risk management and insider threat with a first of its kind continuous risk management platform. This opening…
Original Post: IDentrix: Data Scientist

Top KDnuggets tweets, Nov 23-29: The Entire #Python Language in a Single Image ; Great list of Data Science, Machine Learning, AI Resources

The Entire #Python Language in a Single Image; Cartoon: Thanksgiving, #BigData, and Turkey #DataScience; 50% of Data Scientists have under 10 GB databases, not #BigData; Machine Learning Algorithms: A Concise Technical OverviewMost popular @KDnuggets tweets for Nov 23-29 wereMost Retweeted:Great list of resources! #DataScience, #MachineLearning, #ArtificialIntelligence #AI Resources https://t.co/eYcQaNFhJG https://t.co/wnWZGhsQDOMost Favorited:Great list of resources! #DataScience, #MachineLearning, #ArtificialIntelligence #AI Resources https://t.co/eYcQaNFhJG…
Original Post: Top KDnuggets tweets, Nov 23-29: The Entire #Python Language in a Single Image ; Great list of Data Science, Machine Learning, AI Resources

An exciting new entry in the “clueless graphs from clueless rich guys” competition

An exciting new entry in the “clueless graphs from clueless rich guys” competition Posted by Andrew on 30 November 2016, 12:36 pm Jeff Lax points to this post from Matt Novak linking to a post by Matt Taibbi that shares the above graph from newspaper columnist / rich guy Thomas Friedman. I’m not one to spend precious blog space mocking bad…
Original Post: An exciting new entry in the “clueless graphs from clueless rich guys” competition

Measuring Topic Interpretability with Crowdsourcing

Topic modelling is an important statistical modelling technique to discover abstract topics in collection of documents. This article talks about a new measure for assessing the semantic properties of statistical topics and how to use it. By Fred Morstatter and Huan Liu. Machine learning algorithms can help produce models that are capable of revealing summaries of the dataset. Topic modelling…
Original Post: Measuring Topic Interpretability with Crowdsourcing

Microsoft R Open 3.3.2 now available

Microsoft R Open 3.3.2, Microsoft’s enhanced distribution of open source R, is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to version 3.3.2, adds new bundled packages and updates others, and upgrades the Intel Math Kernel Libraries. The updated R 3.3.2 engine includes some performance improvements (particularly in calculation of eigenvalues), better handling…
Original Post: Microsoft R Open 3.3.2 now available

The Data Science Delusion

Gleanings from observed technical misunderstandings between business leaders and data scientists (and among data scientists themselves) so dramatic that one could start wondering whether there is something wrong with data science as it is being practiced. By Anand Ramanathan, Computer Scientist. Prologue  Four years ago, having earned my living as a programmer/researcher for over a decade, I was co-opted into…
Original Post: The Data Science Delusion

Top 10 Amazon Books in Artificial Intelligence & Machine Learning – 2016 Edition

Given the ongoing explosion in interest for all things Data Science, Artificial Intelligence, Machine Learning, etc., we have updated our Amazon top books lists from last year. Here are the 10 most popular titles in the AI & Machine Learning category. The recent explosion of interest in data science, data mining, and related disciplines has been mirrored by an explosion…
Original Post: Top 10 Amazon Books in Artificial Intelligence & Machine Learning – 2016 Edition

Interesting epi paper using Stan

Jon Zelner writes: Just thought I’d send along this paper by Justin Lessler et al. Thought it was both clever & useful and a nice ad for using Stan for epidemiological work. Basically, what this paper is about is estimating the true prevalence and case fatality ratio of MERS-CoV [Middle East Respiratory Syndrome Coronavirus Infection] using data collected via a…
Original Post: Interesting epi paper using Stan

KDnuggets™ News 16:n42, Nov 30: Python Machine Learning Open Source Projects; Facebook Groups for Big Data & Data Science

Python Machine Learning Open Source Projects; Facebook Groups for Big Data & Data Science; Combining Different Methods to Create Advanced Time Series Prediction; Tips for Beginner Machine Learning/Data Scientists Feeling Overwhelmed; Continuous improvement for IoT through AI / Continuous learning Features |  Software |  Tutorials |  Opinions |  News |  Webcasts |  Courses |  Meetings |  Jobs |  Academic |  Tweets | Image of the week  Features   Software   Tutorials, Overviews, How-Tos   Opinions   News…
Original Post: KDnuggets™ News 16:n42, Nov 30: Python Machine Learning Open Source Projects; Facebook Groups for Big Data & Data Science

How to create a ggplot Theme – Unicorn Edition

Themes are an convenient way to give ggplot charts an individualized, sometimes stylish look. Most of the time, I rely on the ggthemes package and the Economist style. Last week colleagues asked me to change the look of my charts. We joked around and I agreed to create a unicorn ggplot theme. I want to use the challenge to detail…
Original Post: How to create a ggplot Theme – Unicorn Edition