What can Data-Centric AI Learn from Data and ML Engineering?

by   Neoklis Polyzotis, et al.

Data-centric AI is a new and exciting research topic in the AI community, but many organizations already build and maintain various "data-centric" applications whose goal is to produce high quality data. These range from traditional business data processing applications (e.g., "how much should we charge each of our customers this month?") to production ML systems such as recommendation engines. The fields of data and ML engineering have arisen in recent years to manage these applications, and both include many interesting novel tools and processes. In this paper, we discuss several lessons from data and ML engineering that could be interesting to apply in data-centric AI, based on our experience building data and ML platforms that serve thousands of applications at a range of organizations.


page 1

page 2

page 3

page 4


Data-centric Artificial Intelligence

Data-centric artificial intelligence (data-centric AI) represents an eme...

DataPerf: Benchmarks for Data-Centric AI Development

Machine learning (ML) research has generally focused on models, while th...

A conceptual model for leaving the data-centric approach in machine learning

For a long time, machine learning (ML) has been seen as the abstract pro...

Towards Data-centric Graph Machine Learning: Review and Outlook

Data-centric AI, with its primary focus on the collection, management, a...

The Tensor Data Platform: Towards an AI-centric Database System

Database engines have historically absorbed many of the innovations in d...

Framework for disruptive AI/ML Innovation

This framework enables C suite executive leaders to define a business pl...

Data-Centric Governance

Artificial intelligence (AI) governance is the body of standards and pra...

Please sign up or login with your details

Forgot password? Click here to reset