R bloggers

New open data sets from Microsoft Research

FavoriteLoadingAdd to favorites

Microsoft has released a number of data sets produced by Microsoft Research and made them available for download at Microsoft Research Open Data.   The Datasets in Microsoft Research Open Data are categorized by their primary research area, such as Physics, Social Science, Environmental Science, and Information Science. Many of the data sets have not been previously available to the public, and many are large and useful for research in AI and Machine Learning techniques. Many of the datasets also include links to associated papers from Microsoft Research. For example, the 10Gb DESM Word Embeddings dataset provides the IN and the OUT word2vec embeddings for 2.7M words trained on a Bing query corpus of 600M+ queries. Other data sets of note include: A collection of 38M tweets related to the 2012 US election 3-D capture data from individuals performing a variety of hand gestures…
Original Post: New open data sets from Microsoft Research