A Survey on Large-scale Machine Learning

08/10/2020
by   Meng Wang, et al.
0

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However, most sophisticated machine learning approaches suffer from huge time costs when operating on large-scale data. This issue calls for the need of Large-scale Machine Learning (LML), which aims to learn patterns from big data with comparable performance efficiently. In this paper, we offer a systematic survey on existing LML methods to provide a blueprint for the future developments of this area. We first divide these LML methods according to the ways of improving the scalability: 1) model simplification on computational complexities, 2) optimization approximation on computational efficiency, and 3) computation parallelism on computational capabilities. Then we categorize the methods in each perspective according to their targeted scenarios and introduce representative methods in line with intrinsic strategies. Lastly, we analyze their limitations and discuss potential directions as well as open issues that are promising to address in the future.

READ FULL TEXT

page 7

page 8

page 20

research
11/18/2018

A Survey on Spark Ecosystem for Big Data Processing

With the explosive increase of big data in industry and academic fields,...
research
09/02/2019

Big Data Analytics for Large Scale Wireless Networks: Challenges and Opportunities

The wide proliferation of various wireless communication systems and wir...
research
05/15/2019

End-to-End Entity Resolution for Big Data: A Survey

One of the most important tasks for improving data quality and the relia...
research
03/20/2023

A Survey of Demonstration Learning

With the fast improvement of machine learning, reinforcement learning (R...
research
08/23/2022

Survey on Evolutionary Deep Learning: Principles, Algorithms, Applications and Open Issues

Over recent years, there has been a rapid development of deep learning (...
research
10/15/2014

Complexity Issues and Randomization Strategies in Frank-Wolfe Algorithms for Machine Learning

Frank-Wolfe algorithms for convex minimization have recently gained cons...
research
07/26/2019

Exploiting new forms of data to study the private rented sector: strengths and limitations of a database of rental listings

Reviews of official statistics for UK housing have noted that developmen...

Please sign up or login with your details

Forgot password? Click here to reset