Code Between the Lines: Semantic Analysis of Android Applications

Johannes Feichtner; Stefan Gruber

doi:10.1007/978-3-030-58201-2_12

Conference Papers Year : 2020

Code Between the Lines: Semantic Analysis of Android Applications

(1, 2) , (1)

1
2

Johannes Feichtner

Function : Author
PersonId : 1042881

Institute of Applied Information Processing and Communications [Graz]

Secure Information Technology Center

Stefan Gruber

Function : Author

Institute of Applied Information Processing and Communications [Graz]

Abstract

Static and dynamic program analysis are the key concepts researchers apply to uncover security-critical implementation weaknesses in Android applications. As it is often not obvious in which context problematic statements occur, it is challenging to assess their practical impact. While some flaws may turn out to be bad practice but not undermine the overall security level, others could have a serious impact. Distinguishing them requires knowledge of the designated app purpose.In this paper, we introduce a machine learning-based system that is capable of generating natural language text describing the purpose and core functionality of Android apps based on their actual code. We design a dense neural network that captures the semantic relationships of resource identifiers, string constants, and API calls contained in apps to derive a high-level picture of implemented program behavior. For arbitrary applications, our system can predict precise, human-readable keywords and short phrases that indicate the main use-cases apps are designed for.We evaluate our solution on 67,040 real-world apps and find that with a precision between 69% and 84% we can identify keywords that also occur in the developer-provided description in Google Play. To avoid incomprehensible black box predictions, we apply a model explaining algorithm and demonstrate that our technique can substantially augment inspections of Android apps by contributing contextual information.

Domains

Computer Science [cs]

Fichier principal

497034_1_En_12_Chapter.pdf (535.27 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03440841

Submitted on : Monday, November 22, 2021-3:33:34 PM

Last modification on : Monday, November 22, 2021-4:37:41 PM

Long-term archiving on : Wednesday, February 23, 2022-7:59:13 PM

Dates and versions

hal-03440841 , version 1 (22-11-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03440841 , version 1
DOI : 10.1007/978-3-030-58201-2_12

Cite

Johannes Feichtner, Stefan Gruber. Code Between the Lines: Semantic Analysis of Android Applications. 35th IFIP International Conference on ICT Systems Security and Privacy Protection (SEC), Sep 2020, Maribor, Slovenia. pp.171-186, ⟨10.1007/978-3-030-58201-2_12⟩. ⟨hal-03440841⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-TC11 IFIP-SEC

30 View

194 Download

Code Between the Lines: Semantic Analysis of Android Applications

Abstract

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share