Serving Hybrid-Cloud SQL Interactive Queries at Twitter

07/09/2022
by   Chunxu Tang, et al.
0

The demand for data analytics has been consistently increasing in the past years at Twitter. In order to fulfill the requirements and provide a highly scalable and available query experience, a large-scale in-house SQL system is heavily relied on. Recently, we evolved the SQL system into a hybrid-cloud SQL federation system, compliant with Twitter's Partly Cloudy strategy. The hybrid-cloud SQL federation system is capable of processing queries across Twitter's data centers and the public cloud, interacting with around 10PB of data per day. In this paper, the design of the hybrid-cloud SQL federation system is presented, which consists of query, cluster, and storage federations. We identify challenges in a modern SQL system and demonstrate how our system addresses them with some important design decisions. We also conduct qualitative examinations and summarize instructive lessons learned from the development and operation of such a SQL system.

READ FULL TEXT

page 9

page 12

research
03/31/2018

A comparative analysis of state-of-the-art SQL-on-Hadoop systems for interactive analytics

Hadoop is emerging as the primary data hub in enterprises, and SQL repre...
research
04/12/2022

Forecasting SQL Query Cost at Twitter

With the advent of the Big Data era, it is usually computationally expen...
research
04/24/2022

Taming Hybrid-Cloud Fast and Scalable Graph Analytics at Twitter

We have witnessed a boosted demand for graph analytics at Twitter in rec...
research
08/24/2023

Lightweight Materialization for Fast Dashboards Over Joins

Dashboards are vital in modern business intelligence tools, providing no...
research
04/06/2022

Sigma Workbook: A Spreadsheet for Cloud Data Warehouses

Cloud data warehouses (CDWs) bring large-scale data and compute power cl...
research
05/21/2018

Algorithms and Analysis for the SPARQL Constructs

As Resource Description Framework (RDF) is becoming a popular data model...
research
04/13/2023

SIGNAL – The SAP Signavio Analytics Query Language

This paper provides an introduction to and discussion of SIGNAL, an indu...

Please sign up or login with your details

Forgot password? Click here to reset