Efficient Methods on Reducing Data Redundancy in the Internet

Research output: ThesisDoctoral ThesisCollection of Articles

Standard

Efficient Methods on Reducing Data Redundancy in the Internet. / Saha, Sumanta.

Aalto University, 2015. 148 p.

Research output: ThesisDoctoral ThesisCollection of Articles

Harvard

Saha, S 2015, 'Efficient Methods on Reducing Data Redundancy in the Internet', Doctor's degree, Aalto University.

APA

Vancouver

Saha S. Efficient Methods on Reducing Data Redundancy in the Internet. Aalto University, 2015. 148 p. (Aalto University publication series DOCTORAL DISSERTATIONS; 153).

Author

Saha, Sumanta. / Efficient Methods on Reducing Data Redundancy in the Internet. Aalto University, 2015. 148 p.

Bibtex - Download

@phdthesis{d597fe37804d438a9a57c1c5c806cfd4,
title = "Efficient Methods on Reducing Data Redundancy in the Internet",
abstract = "The transformation of the Internet from a client-server based paradigm to a content-based one has led to many of the fundamental network designs becoming outdated. The increase in user-generated contents, instant sharing, flash popularity, etc., brings forward the needs for designing an Internet which is ready for these and can handle the needs of the small-scale content providers. The Internet, as of today, carries and stores a large amount of duplicate, redundant data, primarily due to a lack of duplication detection mechanisms and caching principles. This redundancy costs the network in different ways: it consumes energy from the network elements that need to process the extra data; it makes the network caches store duplicate data, thus causing the tail of the data distribution to be swapped out of the caches; and it causes the content-servers to be loaded more as they have to always serve the less popular contents. In this dissertation, we have analyzed the aforementioned phenomena and proposed several methods to reduce the redundancy of the network at a low cost. The proposals involve different approaches to do so--including data chunk level redundancy detection and elimination, rerouting-based caching mechanisms in information-centric networks, and energy-aware content distribution techniques. Using these approaches, we have demonstrated how we can perform redundancy elimination using a low overhead and low processing power. We have also demonstrated that by using local or global cooperation methods, we can increase the storage efficiency of the existing caches many-fold. In addition to that, this work shows that it is possible to reduce a sizable amount of traffic from the core network using collaborative content download mechanisms, while reducing client devices' energy consumption simultaneously.",
keywords = "cache, redundancy, energy, ICN, Internet, cache, redundancy, energy, ICN, Internet",
author = "Sumanta Saha",
year = "2015",
language = "English",
isbn = "978-952-60-6421-5",
series = "Aalto University publication series DOCTORAL DISSERTATIONS",
publisher = "Aalto University",
number = "153",
school = "Aalto University",

}

RIS - Download

TY - THES

T1 - Efficient Methods on Reducing Data Redundancy in the Internet

AU - Saha, Sumanta

PY - 2015

Y1 - 2015

N2 - The transformation of the Internet from a client-server based paradigm to a content-based one has led to many of the fundamental network designs becoming outdated. The increase in user-generated contents, instant sharing, flash popularity, etc., brings forward the needs for designing an Internet which is ready for these and can handle the needs of the small-scale content providers. The Internet, as of today, carries and stores a large amount of duplicate, redundant data, primarily due to a lack of duplication detection mechanisms and caching principles. This redundancy costs the network in different ways: it consumes energy from the network elements that need to process the extra data; it makes the network caches store duplicate data, thus causing the tail of the data distribution to be swapped out of the caches; and it causes the content-servers to be loaded more as they have to always serve the less popular contents. In this dissertation, we have analyzed the aforementioned phenomena and proposed several methods to reduce the redundancy of the network at a low cost. The proposals involve different approaches to do so--including data chunk level redundancy detection and elimination, rerouting-based caching mechanisms in information-centric networks, and energy-aware content distribution techniques. Using these approaches, we have demonstrated how we can perform redundancy elimination using a low overhead and low processing power. We have also demonstrated that by using local or global cooperation methods, we can increase the storage efficiency of the existing caches many-fold. In addition to that, this work shows that it is possible to reduce a sizable amount of traffic from the core network using collaborative content download mechanisms, while reducing client devices' energy consumption simultaneously.

AB - The transformation of the Internet from a client-server based paradigm to a content-based one has led to many of the fundamental network designs becoming outdated. The increase in user-generated contents, instant sharing, flash popularity, etc., brings forward the needs for designing an Internet which is ready for these and can handle the needs of the small-scale content providers. The Internet, as of today, carries and stores a large amount of duplicate, redundant data, primarily due to a lack of duplication detection mechanisms and caching principles. This redundancy costs the network in different ways: it consumes energy from the network elements that need to process the extra data; it makes the network caches store duplicate data, thus causing the tail of the data distribution to be swapped out of the caches; and it causes the content-servers to be loaded more as they have to always serve the less popular contents. In this dissertation, we have analyzed the aforementioned phenomena and proposed several methods to reduce the redundancy of the network at a low cost. The proposals involve different approaches to do so--including data chunk level redundancy detection and elimination, rerouting-based caching mechanisms in information-centric networks, and energy-aware content distribution techniques. Using these approaches, we have demonstrated how we can perform redundancy elimination using a low overhead and low processing power. We have also demonstrated that by using local or global cooperation methods, we can increase the storage efficiency of the existing caches many-fold. In addition to that, this work shows that it is possible to reduce a sizable amount of traffic from the core network using collaborative content download mechanisms, while reducing client devices' energy consumption simultaneously.

KW - cache

KW - redundancy

KW - energy

KW - ICN

KW - Internet

KW - cache

KW - redundancy

KW - energy

KW - ICN

KW - Internet

M3 - Doctoral Thesis

SN - 978-952-60-6421-5

T3 - Aalto University publication series DOCTORAL DISSERTATIONS

PB - Aalto University

ER -

ID: 18374884