Large-scale Mobile App Identification Using Deep Learning

10/06/2019
by   Shahbaz Rezaei, et al.
0

Many network services and tools (e.g. network monitors, malware-detection systems, routing and billing policy enforcement modules in ISPs) depend on identifying the type of traffic that passes through the network. With the widespread use of mobile devices, the vast diversity of mobile apps, and the massive adoption of encryption protocols (such as TLS), large-scale traffic classification becomes inevitable and more difficult. In this paper, we propose a deep learning model for mobile app identification. The proposed model only needs the payload of the first few packets for classification, and, hence, it is suitable even for applications that rely on early prediction, such as routing and QoS provisioning. The deep model achieves between 84 accuracy for the identification of 80 popular apps. We also perform occlusion analysis for the first time to bring insight into what data is leaked from SSL/TLS protocol that allows accurate app identification. Moreover, our traffic analysis shows that many apps generate not only app-specific traffic, but also numerous ambiguous flows. Ambiguous flows are flows generated by common functionality modules, such as advertisement and traffic analytics. Because such flows are common among many different apps, identifying the source app that generates ambiguous flows is challenging. To address this challenge, we propose a CNN+LSTM model that takes adjacent flows to learn the order and pattern of multiple flows, to better identify the app that generates them. We show that such flow association considerably improves the accuracy, particularly for ambiguous flows. Furthermore, we show that our approach is robust to mixed traffic scenarios where some unrelated flows may appear in adjacent flows. To the best of our knowledge, this is the first work that identifies the source app for ambiguous flows.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 10

page 12

page 14

research
06/09/2021

Auditing Network Traffic and Privacy Policies in Oculus VR

Virtual reality (VR) is an emerging technology that enables new applicat...
research
11/27/2019

DeviceWatch: Identifying Compromised Mobile Devices through Network Traffic Analysis and Graph Inference

In this paper, we propose to identify compromised mobile devices from a ...
research
02/06/2023

From Emulation to Mathematical: A More General Traffic Obfuscation Approach To Encounter Feature based Mobile App traffic Classification

The usage of the mobile app is unassailable in this digital era. While t...
research
12/19/2018

AnFlo: Detecting Anomalous Sensitive Information Flows in Android Apps

Smartphone apps usually have access to sensitive user data such as conta...
research
02/18/2020

Mind Your Weight(s): A Large-scale Study on Insufficient Machine Learning Model Protection in Mobile Apps

On-device machine learning (ML) is quickly gaining popularity among mobi...
research
09/03/2022

Illegal But Not Malware: An Underground Economy App Detection System Based on Usage Scenario

This paper focuses on mobile apps serving the underground economy by pro...
research
12/14/2010

Seamless Flow Migration on Smartphones without Network Support

This paper addresses the following question: Is it possible to migrate T...

Please sign up or login with your details

Forgot password? Click here to reset