MMLSpark: Unifying Machine Learning Ecosystems at Massive Scales

10/20/2018
by   Mark Hamilton, et al.
0

We introduce Microsoft Machine Learning for Apache Spark (MMLSpark), an ecosystem of enhancements that expand the Apache Spark distributed computing library to tackle problems in Deep Learning, Micro-Service Orchestration, Gradient Boosting, Model Interpretability, and other areas of modern computation. Furthermore, we present a novel system called Spark Serving that allows users to run any Apache Spark program as a distributed, sub-millisecond latency web service backed by their existing Spark Cluster. All MMLSpark contributions have the same API to enable simple composition across frameworks and usage across batch, streaming, and RESTful web serving scenarios on static, elastic, or serverless clusters. We showcase MMLSpark by creating a method for deep object detection capable of learning without human labeled data and demonstrate its effectiveness for Snow Leopard conservation.

READ FULL TEXT
research
11/08/2018

Relation of Web Service Orchestration, Abstract Process, Web Service and Choreography

We refine the relation of Web service orchestration, abstract process, W...
research
11/27/2018

DLHub: Model and Data Serving for Science

While the Machine Learning (ML) landscape is evolving rapidly, there has...
research
02/23/2022

MLProxy: SLA-Aware Reverse Proxy for Machine Learning Inference Serving on Serverless Computing Platforms

Serving machine learning inference workloads on the cloud is still a cha...
research
02/08/2018

Deep Learning with Apache SystemML

Enterprises operate large data lakes using Hadoop and Spark frameworks t...
research
09/11/2019

Addressing Algorithmic Bottlenecks in Elastic Machine Learning with Chicle

Distributed machine learning training is one of the most common and impo...
research
05/07/2020

funcX: A Federated Function Serving Fabric for Science

Exploding data volumes and velocities, new computational methods and pla...
research
09/04/2023

Objcache: An Elastic Filesystem over External Persistent Storage for Container Clusters

Container virtualization enables emerging AI workloads such as model ser...

Please sign up or login with your details

Forgot password? Click here to reset