Model Training Productionization with H2O REST API and Requests in Python

Building a model scoring application in production using H2O is greatly benefited from its ability to export models to MOJOs which allows scoring large datasets in Spark without native H2O dependency. However, the model training process is still hard to productionize given the nature of how data scientists work vs engineering requirements to build a…

Build a multi-label documentation classification model for SHARE using one vs the rest classifiers

Jiankun Liu A recent post talked about how we can label documents on SHARE with Natural Language Processing models. In this post I’m going to include more detail on how it was done. If you are interested in reading further, I recommend you read the previous post (link) first, which introduced the problem, the data…