Data science, Python, Industrial iot deployments & more… BigData News Tuesday, February 27

BigData News TLDR / Table of Contents

  • Python Overtakes R for Data Science and Machine Learning
    • This article summarizes a trend in programming languages usage, based on a number of proxy metrics. This change started to be more pronounced in early 2017: Py…
    • data science, python, Python Data Science, ,
  • Samsung expands IoT partnerships for ARTIK, teams up with PTC
    • Samsung is stepping up its industrial IoT game, with its ARTIK platform as a service.
    • industrial IoT deployments, Samsung, IoT platform, IoT deployment unit, PTC
  • Why data science is simply the new astrology
    • Many data scientists accept the model that gives the best results on the data set at hand; no attempt is made to understand why the given inputs lead to the output
    • data, data science, spurious correlations, data analysis, n’t data analysis
  • Deep learning for biology
    • A popular artificial-intelligence method provides a powerful tool for surveying and classifying biological data. But for the uninitiated, the technology poses significant difficulties.
    • deep learning, data, deep-learning algorithms, Google Accelerated Science, data sets
  • Spring Discovery Brings Machine Learning to Longevity Sector
    • Backed by General Catalyst, Sam Altman and the Longevity Fund, the startup hopes to accelerate the experimentation phase of aging research.
    • Dow Jones, Dow Jones Reprints, presentation-ready copies, ,
  • This change started to be more pronounced in early 2017: Python became the language of choice, over R, for data science and machine learning applications.
  • Search index for Python Data Science (blue) versus R Data Science (red) over the last 5 years, in US – – We used the app in question to compare search interest for R data Science versus Python Data Science, see above chart.
  • Top cities in US are: – – R Data Sciencereturns 7,533 full time jobs.
  • Top cities in US are: – – We have 83 fresh, active job ads, relevant to data science and mostly in US and London, for Python: you can check them out here.
  • A Google search for R or Python(on Data Science Central) will yield similar conclusions.

Tags: data science, python, Python Data Science, ,

  • Video: Humanizing the Internet of Things – Samsung announced a series of partnerships with the likes of PTC, in a bid to put its ARTIK Internet of Things (IoT) platform into more buildings, devices, and industrial settings.
  • Among the major highlights: – Samsung formed a partnership with PTC on its ThingWorx platform, which is used in industrial settings.ARTIK is also being integrated with Shoreline iCast2’s IoT bridge in a move that pairs up PTC, Samsung and Shoreline in industrial IoT deployments.Samsung’s ARTIK is now interoperable with Harman’s…
  • PTC’s ThingWorxs is a staple in industrial IoT deployments and ARTIK will give Samsung a foothold in the space.
  • The Samsung ARTIK and PTC have teamed up on cloud services, asset management kits, device hardware via Shoreline, and various tools.
  • Read also: PTC’s industrial IoT platform ThingWorx gets new apps, more AR support – On the partnership front, ARTIK will integrate with Harman’s gateway and applications and industrial tools.

Tags: industrial IoT deployments, Samsung, IoT platform, IoT deployment unit, PTC

  • The way all these algorithms perform is similarfed with large sets of images that contain both positive and negative cases of the condition to be detected, they calibrate parameters of a mathematical formula so that patterns that lead to positive and negative cases can be distinguished.
  • In case the patterns dont make sense, data scientists have tweaked their models in a way that they give more meaningful results.
  • Different mathematical models are adept at detecting patterns in different kinds of data, and picking the right algorithm for the data ensures that spurious pattern detection is minimized.
  • Given the difficulty of explaining the models, the average data scientist proceeds to use the algorithms as black boxes.
  • The way a large number of data scientists approach a problem is to take a data set and then apply all possible machine learning methods on it.

Tags: data, data science, spurious correlations, data analysis, n’t data analysis

  • They were interested in applying deep-learning approaches to the mountains of imaging data generated by Finkbeiners team at the Gladstone Institute of Neurological Disease in San Francisco, also in California.
  • Deep-learning algorithms take raw features from an extremely large, annotated data set, such as a collection of images or genomes, and use them to create a predictive tool based on patterns buried inside.
  • Finkbeiners team, with scientists at Google, trained a deep algorithm with two sets of cells, one artificially labelled to highlight features that scientists cant normally see, the other unlabelled.
  • Researchers are using the algorithms to classify cellular images, make genomic connections, advance drug discovery and even find links across different data types, from genomics and imaging to electronic medical records.
  • Freys academic team at the University of Toronto developed algorithms trained on genomic and transcriptomic data from healthy cells.

Tags: deep learning, data, deep-learning algorithms, Google Accelerated Science, data sets

  • Type and press ‘Enter’ or click ‘Search’ – – – Alerts – – – – – – – – This copy is for your personal, non-commercial use only.
  • To order presentation-ready copies for distribution to your colleagues, clients or customers visit

Tags: Dow Jones, Dow Jones Reprints, presentation-ready copies, ,

Top Big Data Courses

The Ultimate Hands-On Hadoop - Tame your Big Data! (31,889 students enrolled)

By Sundog Education by Frank Kane
  • Design distributed systems that manage "big data" using Hadoop and related technologies.
  • Use HDFS and MapReduce for storing and analyzing data at scale.
  • Use Pig and Spark to create scripts to process data on a Hadoop cluster in more complex ways.
  • Analyze relational data using Hive and MySQL
  • Analyze non-relational data using HBase, Cassandra, and MongoDB
  • Query data interactively with Drill, Phoenix, and Presto
  • Choose an appropriate data storage technology for your application
  • Understand how Hadoop clusters are managed by YARN, Tez, Mesos, Zookeeper, Zeppelin, Hue, and Oozie.
  • Publish data to your Hadoop cluster using Kafka, Sqoop, and Flume
  • Consume streaming data using Spark Streaming, Flink, and Storm

Learn more.

Taming Big Data with MapReduce and Hadoop - Hands On! (13,894 students enrolled)

By Sundog Education by Frank Kane
  • Understand how MapReduce can be used to analyze big data sets
  • Write your own MapReduce jobs using Python and MRJob
  • Run MapReduce jobs on Hadoop clusters using Amazon Elastic MapReduce
  • Chain MapReduce jobs together to analyze more complex problems
  • Analyze social network data using MapReduce
  • Analyze movie ratings data using MapReduce and produce movie recommendations with it.
  • Understand other Hadoop-based technologies, including Hive, Pig, and Spark
  • Understand what Hadoop is for, and how it works

Learn more.

Comments are closed, but trackbacks and pingbacks are open.