Exposed! Vulnerability-proneness of Google Play Apps

Exposed! On the Vulnerability-
proneness of Google Play apps.
Andrea Sebastiano
Di Sorbo Panichella
https://spanichella.github.io/
https://www.unisannio.it/en/user/9355
ESEC/FSE - Journal First Presentation
14-18, November 2022, Singapore

OUTLINE
CONTEXT: User perception of risks when
installing mobile apps
RESEARCH DESIGN: data collection,
information extraction, and tools used
FINDINGS: Interesting stuff
CONCLUSIONS are drawn

CONTEXT
Are users aware of the risks?
Perception
Reality

PAST WORK
Factors affecting app
success
Vulnerabilities in mobile
apps

PAST WORK
Factors affecting app
success
Vulnerabilities in mobile
apps
“Fault- and change-prone APIs can
hinder the success of mobile apps”.
“High-rated apps have larger sizes, more
complex code, more requirements on
users, more marketing efforts, more
dependence on libraries, and adopt higher
quality Android APIs”.
“Roughly 70% of free apps and roughly
50% of paid apps with vulnerabilities
were vulnerable due to libraries”.
“For most vulnerability types, third-
party code (including common
libraries) represents the main carrier of
app vulnerabilities”.
“User feedback reporting bugs are negatively correlated with the
rating, while reviews reporting feature requests do not”.

RESEARCH GOALS
‘‘Vulnerability-proneness: the number of different types of
known security issues exhibited by the app’’
• Assess the vulnerability-
proneness levels of mobile
apps
• Evaluate the extent to which
users can perceive
vulnerability-proneness
• RQ1: Which are the different vulnerabilities
exhibited by Google market apps (belonging to
different app categories)?
• RQ2: Does the vulnerability-proneness of
Google market apps affect app success?
• RQ3: Is it possible to predict the level of
vulnerability-proneness of an app by using the
app’s contextual information?
GOALS RQs

DATASET
• About 1,000 apps spanning
23 different Play Store’s
categories
• For each category we have
both low- and high-rated
apps
• We only considered apps
having a reliable number of
user votes

DATA COLLECTION AND TOOLSET
AndroBugs
Framework

AndroBugs
Framework
Selenium
WebDriver
Google Play

R scripts Weka Framework
AndroBugs
Framework
Selenium
WebDriver
Google Play

ANDROBUGS
vulnerability type
severity
• State-of-the-art
vulnerability
scanner
• Static analysis of
apk files (without
executing them)
• Successfully used
to find
vulnerabilities in
popular Android
apps, such as
Facebook, eBay,
Twitter, etc.
https://github.com/AndroBugs/AndroBugs_Framework

EXAMPLES OF VULNERABILITIES
Vulns that may cause
MITM attacks
Vulns that may cause
injection attacks
Vulns that may allow
access to sensitive data

App (Market) Contextual Information
?
?

RQ1: Which are the different vulnerabilities exhibited by Google
market apps (belonging to different app categories)?
Approach:
Compare the vulnerability-proneness of apps
belonging to different app categories
(statistical analysis + qualitative analysis)

Categories adjusted p-value Cliff's d
Medical - Communication 0.0074 -0.5143
Medical - Entertainment 0.0011 -0.5283
Medical - Food & Drink 0.0023 -0.6402
Medical - News & Magazines 0.0026 -0.5859
Medical - Social 0.0081 -0.5201

• Medical apps exhibit less security
flaws than other categories
• Finance and Shopping apps exhibit
vulnerability-proneness levels
similar to other categories
Categories adjusted p-value Cliff's d
Medical - Communication 0.0074 -0.5143
Medical - Entertainment 0.0011 -0.5283
Medical - Food & Drink 0.0023 -0.6402
Medical - News & Magazines 0.0026 -0.5859
Medical - Social 0.0081 -0.5201

Vulnerability Communication Entertainment Food Medical News Social
<SSL_Security> SSL Connection Checking 100.00% 97.17% 100.00% 84.85% 100.00% 96.72%
<WebView><Remote Code
Execution><#CVE-2013-4710#> WebView
RCE Vulnerability Checking
73.13% 86.79% 81.25% 36.36% 86.67% 73.77%
<Implicit_Intent> Implicit Service Checking 52.24% 49.06% 46.88% 15.15% 35.56% 54.10%
App Sandbox Permission Checking 17.91% 27.36% 37.50% 24.24% 31.11% 27.87%
<SSL_Security> SSL Certificate Verification
Checking
16.42% 18.87% 21.88% 6.06% 40.00% 14.75%
<KeyStore><Hacker> KeyStore Protection
Checking
13.43% 9.43% 31.25% 9.09% 35.56% 19.67%
<Command> Runtime Command Checking 34.33% 13.21% 21.88% 12.12% 31.11% 9.84%
<#BID 64208 CVE-2013-6271#> Fragment
Vulnerability Checking
22.39% 11.32% 12.50% 3.03% 13.33% 8.20%
AndroidManifest ContentProvider Exported
Checking
19.40% 16.98% 18.75% 9.09% 24.44% 14.75%
<SSL_Security> SSL Implementation Checking
(Verifying Host Name in Custom Classes)
11.94% 17.92% 9.38% 12.12% 28.89% 21.31%
<SSL_Security> SSL Implementation
Checking (Verifying Host Name in Fields)
5.97% 8.49% 12.50% 9.09% 26.67% 11.48%

RQ2: Does the vulnerability-proneness
of Google market apps affect app success?
App
success
Average rating
Downloads
Compare apps with different
levels of success

No relations between vulnerability-proneness and app
rating are observed

No relations between vulnerability-proneness and
app rating are observed
Apps with higher number of downloads tend to
exhibit higher levels of vulnerability-proneness
(statistically significant)

Apps having a lower average rating tend to have a higher
vulnerability-proneness density
Apps with higher number of downloads tend to exhibit
higher levels of vulnerability-proneness (statistically
significant)
Vulnerability-proneness density:
“we divided the number of vulnerability warnings
signaled by AndroBugs by the number of classes”

Apps having a lower average rating tend to have a higher
vulnerability-proneness density
Apps with higher number of downloads tend to exhibit
higher levels of vulnerability-proneness (statistically
significant)
Vulnerability-proneness density:
“we divided the number of vulnerability warnings
signaled by AndroBugs by the number of classes”
Perception
Reality
User could be not aware of the risks that they take when installing an app..

RQ3: Is it possible to predict the level of vulnerability-proneness of
an app by using the app’s contextual information?
App market
metrics
Textual features
(app description)
Static analysis
features
(number of libs,
classes, interfaces,
etc.)

Algorithm Precision Recall F-Measure
J48 0.691 0.687 0.686
Random Forest 0.760 0.751 0.751
Naive Bayes 0.660 0.657 0.652
Experiment 5 (app market features + static analysis)
J48 0.664 0.665 0.664
Random Forest 0.723 0.720 0.719
Naive Bayes 0.592 0.590 0.590
Experiment 4 (app market features + text features + static analysis)
J48 0.671 0.667 0.666
Random Forest 0.730 0.728 0.728
Naive Bayes 0.647 0.647 0.645
Experiment 2 (app market features)
J48 0.619 0.620 0.619
Random Forest 0.660 0.660 0.658
Naive Bayes 0.581 0.577 0.576
Experiment 1 (app market features + text features)
Considering only app market info Considering also static analysis info
Random Forest outperforms the other ML algorithms
(low vs. high)

J48 0.691 0.687 0.686
Random Forest 0.760 0.751 0.751
Naive Bayes 0.660 0.657 0.652
J48 0.671 0.667 0.666
Random Forest 0.730 0.728 0.728
Naive Bayes 0.647 0.647 0.645
Experiment 2 (app market features)
J48 0.664 0.665 0.664
Random Forest 0.723 0.720 0.719
Naive Bayes 0.592 0.590 0.590
Experiment 4 (app market features + text features + static analysis)
J48 0.619 0.620 0.619
Random Forest 0.660 0.660 0.658
Naive Bayes 0.581 0.577 0.576
Experiment 1 (app market features + text features)
Considering only app market info Considering also static analysis info
Textual features introduce noise affecting the classification performance
(low vs. high)

Considering only static analysis info Considering also app market info
App market metrics provide complementary information
to the one related to code.
J48 0.691 0.687 0.686
Random Forest 0.760 0.751 0.751
Naive Bayes 0.660 0.657 0.652
J48 0.726 0.712 0.709
Random Forest 0.716 0.714 0.714
Naive Bayes 0.660 0.652 0.643
Experiment 3 (static analysis features)
(low vs. high)

CONCLUSIONS
• RQ1: Which are the different vulnerabilities exhibited by Google market
apps (belonging to different app categories)?
Almost all apps have known security defects
Apps belonging to the Medical category exhibit less security flaws
than apps in the other categories

CONCLUSIONS
• RQ1: Which are the different vulnerabilities exhibited by Google market apps (belonging to
• Almost all apps have known security defects
• Apps belonging to the Medical category exhibit less security flaws than apps in the
other categories
• RQ2: Does the vulnerability-proneness of Google market apps
affect app success?
- Vulnerability-proneness levels are not reflected in app
ratings
- Vulnerability-proneness density levels are reflected in
app ratings
Popular apps tend to exhibit higher levels of
vulnerability-proneness

CONCLUSIONS
• RQ1: Which are the different vulnerabilities exhibited by Google market apps (belonging to
• Almost all apps have known security defects
• Apps belonging to the Medical category exhibit less security flaws than apps in the
other categories
• RQ2: Does the vulnerability-proneness of Google market apps affect app success?
• Vulnerability-proneness levels are not reflected in app ratings
• Popular apps tend to exhibit higher levels of vulnerability-proneness
• RQ3: Is it possible to predict the level of vulnerability-
proneness of an app by using the app’s contextual
information?
App market information is useful to predict the
vulnerability-proneness level of an app in about 3 out of
4 cases
App market information could be used in addition to
static analysis features to improve the prediction results

FUTURE WORK
• Surveying app users
• to better understand how they deal with privacy and security concerns
• and, to further validate our results
• Improve the prediction results
• Extract additional features from the app store (e.g., interactive
elements, developer’s information, last update, compatible devices,
etc.)
• Extract additional static analysis features (e.g., quality metrics,
intents, etc.)
• Investigate features weighting
• Investigate the possibility to predict specific type of security defects in other
domains

Exposed! On the Vulnerability-
proneness of Google Play apps.
Andrea Sebastiano
Di Sorbo Panichella
https://spanichella.github.io/
https://www.unisannio.it/en/user/9355
ESEC/FSE - Journal First Presentation
14-18, November 2022, Singapore
https://link.springer.com/article/10.1007/s10664-021-09978-0
Thanks for the Attention!

Exposed! Vulnerability-proneness of Google Play Apps

Recommended

Recommended

More Related Content

Similar to Exposed! Vulnerability-proneness of Google Play Apps

Similar to Exposed! Vulnerability-proneness of Google Play Apps (20)

More from Sebastiano Panichella

More from Sebastiano Panichella (20)

Recently uploaded

Recently uploaded (20)

Exposed! Vulnerability-proneness of Google Play Apps

Editor's Notes