Analyzing and visualizing web server access log file

Tri Nguyen*, Thanh Dang Diep, Hoang Vinh Tran, Takuma Nakajima, Nam Thoai

*Corresponding author for this work

Research output: Contribution to conferencePaperScientificpeer-review


Websites have endlessly multiplied during the recent decades and the number of visitors to the websites keeps the pace with them simultaneously, which leads to the process of huge data creation. The data are believed to consist of hidden knowledge well worth considering in various activities related to e-Business, e-CRM, e-Services, e-Newspapers, e-Government, Digital Libraries, and so on. In order to extract knowledge from the web data efficiently, a process called web usage mining is applied to such data. In this literature, we use the process to uncover interesting patterns in web server access log file gathered from Ho Chi Minh City University of Technology (HCMUT) in Vietnam. Moreover, we propose a novel model to construct and add new attributes encompassing country, province (or city), Internet Service Provider (ISP) from the existing attribute IP. The model belongs to attribute construction (or feature construction) which is one of strategies of data transformation being a data pre-processing technique. By utilizing the aforementioned mining process, we have wide knowledge about user access patterns for every country, province and ISP. Such knowledge can be leveraged for optimizing system performance as well as enhancing personalization. Furthermore, the valuable knowledge can be useful for deciding reasonable caching policies for web proxies.
Original languageEnglish
Number of pages19
Publication statusPublished - 28 Nov 2018
MoE publication typeNot Eligible


Dive into the research topics of 'Analyzing and visualizing web server access log file'. Together they form a unique fingerprint.

Cite this