Abstract
This short empirical paper investigates a snapshot of about two million files from a continuously updated big data collection maintained by F-Secure for security intelligence purposes. By further augmenting the snapshot with open data covering about a half of a million files, the paper examines two questions: (a) what is the shape of a probability distribution characterizing the relative share of malware files to all files distributed from web-facing Internet domains, and (b) what is the distribution shaping the popularity of malware files? A bimodal distribution is proposed as an answer to the former question, while a graph theoretical definition for the popularity concept indicates a long-tailed, extreme value distribution. With these two questions - and the answers thereto, the paper contributes to the attempts to understand large-scale characteristics of malware at the grand population level - at the level of the whole Internet.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 European Intelligence and Security Informatics Conference, EISIC 2016 |
Publisher | IEEE |
Pages | 144-147 |
Number of pages | 4 |
ISBN (Electronic) | 9781509028566 |
DOIs | |
Publication status | Published - 2 Mar 2017 |
MoE publication type | A4 Conference publication |
Event | European Intelligence and Security Informatics Conference - Uppsala, Sweden Duration: 17 Aug 2016 → 19 Aug 2016 Conference number: 7 |
Conference
Conference | European Intelligence and Security Informatics Conference |
---|---|
Abbreviated title | EISIC |
Country/Territory | Sweden |
City | Uppsala |
Period | 17/08/2016 → 19/08/2016 |
Keywords
- Malware
- Security intelligence
- Web crawling