Joint Cache Placement and Delivery Design using Reinforcement Learning for Cellular Networks

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

We consider a reinforcement learning (RL) based joint cache placement and delivery (CPD) policy for cellular networks with limited caching capacity at both Base Stations (BSs) and User Equipments (UEs). The dynamics of file preferences of users is modeled by a Markov process. User requests are based on current preferences, and on the content of the user’s cache. We assume probabilistic models for the cache placement at both the UEs and the BSs. When the network receives a request for an un-cached file, it fetches the file from the core network via a back-haul link. File delivery is based on network-level orthogonal multipoint multicasting transmissions. For this, all BSs caching a specific file transmit collaboratively in a dedicated resource. File reception depends on the state of the wireless channels. We design the CPD policy while taking into account the user Quality of Service and the back-haul load, and using an Actor-Critic RL framework with two neural networks. Simulation results are used to show the merits of the devised CPD policy.
Original languageEnglish
Title of host publicationIEEE Vehicular Technology Conference
PublisherIEEE
Number of pages6
Publication statusAccepted/In press - 10 Dec 2020
MoE publication typeA4 Article in a conference publication
EventIEEE Vehicular Technology Conference - Helsinki, Finland
Duration: 25 Apr 202128 Apr 2021
Conference number: 93

Publication series

NameIEEE Vehicular Technology Conference
ISSN (Print)1090-3038
ISSN (Electronic)2577-2465

Conference

ConferenceIEEE Vehicular Technology Conference
Abbreviated titleVTC-Spring
CountryFinland
CityHelsinki
Period25/04/202128/04/2021

Keywords

  • Wireless caching
  • reinforcement learning
  • Actor-Critic

Fingerprint Dive into the research topics of 'Joint Cache Placement and Delivery Design using Reinforcement Learning for Cellular Networks'. Together they form a unique fingerprint.

Cite this