FONPR: First Open Network Pattern Reactors. Reinforcement Learning agents for fine tuning telecom networks.

Quick Deployment

DQN agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_dqn_agent.yml

BBO agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_bbo_agent.yml

V0 agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_v0_agent.yml

Docker Repo: https://hub.docker.com/r/teamrespons/respons_agent/tags

First Open Network Pattern Reactors guide:

This readme is intended to provide background on First Open Network Pattern Reactors and their deployment.

Contents:

Agent

Advisor

Action Handler

Docker

Agent Deployment

Test Runbook

1. Agent

An Agent is responsible for implementing a policy, i.e. mapping observed system state to desired control actions. The policy can be informed by subject matter experts, or learned independently by a reinforcement learning (RL) algorithm.

All agents currently ingest data via prometheus server, and take actions against a yml file that controls the target application.

General usage:

Agent lives as a script in the agent.py file.
The Agent script is run automatically on deployment in the network cluster as a containerized application, and executes its logic at regular intervals.
The Agent utilizes an Advisor function to set up a connection with the data source, and ingest data.
The Agent executes policy logic and updates cluster (Helm) configuration files in github via the Action Handler.

V0 agent:

Modify parameters in agent_v0.py for custom deployment

The primary functionality for the V0 agent is to use hueristics in order to update limits and requests for AMF.
V0 agent allows for improved Kube-Scheduling.
Inputs: Max CPU for AMF, Avg. CPU for AMF, Max Memory for AMF, Avg. Memory for AMF.
Outputs: Update yml file limits and requests for AMF pods.

BBO agent:

Modify parameters in agent_bbo.py for custom deployment

Google Vizier Library
BBO agent treats the system as a black box. It allows for efficient search of paremeters to optimize a function. It does not understand the function.
BBO is aware of X and Y of a function mapping via system: X->system->Y.
The X are the paremters the BBO Agent can modify.
The Y is the reward the BBO agent recieves after making its actions and allowing the actions to manifest in the system.
The current algorithm underneath the BBO agent is Gaussian Process Optimization.
Inputs BBO: Profit = SLO Price - Infra Cost
Ouputs BBO: UPF Node Sizing

DQN agent:

Modify parameters in agent_dqn.py for custom deployment

Tensorflow Library
7 x 20 x 20 x 2 Fully Connected Nueral Network.
Uses replay buffer for training.
DQN
- Inputs: Action, Observation, Reward, Discount, Next Step Type, Policy Info, Current Step Type.
- Outputs: Q-Value (maximum expected reward) for taking a small sizing action or large sizing action.
The agent itself outputs a modification of UPF Node Sizing

2. Advisor

An Advisor is responsible for connecting with a data source, ingesting data, and preprocessing / filtering that data prior to handing it off to the Agent.

The Prometheus based advisor to send queries to a Prometheus server.

To target the server, the ip address and port number can be found as follows:

ip:port found at
-  AWS → management console → EKS → clusters → resources tab → service and networking tab → endpoints → filter for Prometheus → Prometheus server endpoint

3. Action Handler

An Action Handler is responsible for taking the requested cluster configuration updates (actions) and update the controlling configuration file accordingly.

The current architecture leverages GitHub for revision control and housing of the cluster configuration files. When a config file is updated, it triggers redeployment of the network cluster via Flux.

PLEASE NOTE OUR SECRET IS NOT PUBLIC, PLS MODIFY WITH YOUR OWN SECRET MANAGEMENT STRATEGY

General usage:

The ActionHandler class takes in a GitHub token, the target file path within the repository, branch name, and a dictionary of agent-requested value updates.
The current version of the value file is fetched from GitHub, updated with the new values, and then pushed back to the repository, triggering a new cluster deployment.

4. Docker

The Agent and its helper functions are containerized using Docker.

Repo:https://hub.docker.com/r/teamrespons/respons_agent/tags

To pull docker image from registry:

docker pull -t <imagename>:<version> . 

# e.g.
docker pull -t teamrespons/respons_agent:v0.0 .

To run docker image locally as a container:
```
docker run <imageid>
```

To create new images and contribute them:

To Build docker image from an updated Dockerfile

docker build -t teamrespons/respons_agent:<tagname> -f <dockerfile name> .  

# e.g. 
docker build -t teamrespons/respons_agent:v0-agent -f Dockerfile_V0 .

To run docker image locally as a container
```
docker run <imageid>
```
To push docker image to dockerhub under the response-ml
```
docker push teamrespons/respons_agent:<tagname>
```

5. Agent Deployment

Pre-Requisites:

Set up your machine with the following CLI tools:

AWS CLI

Kubectl

Helm
Set up your local AWS CLI Environment Variables for an account that has access to the EKS cluster:

export AWS_ACCESS_KEY_ID=""
export AWS_SECRET_ACCESS_KEY=""
export AWS_SESSION_TOKEN=""

Update local kubectl config file:

aws eks --region <region> update-kubeconfig --name <clustername>

Deployment:

Update deployment/respons_agent_manifest.yml

Update in the yaml file to specify which image you want deployed into the cluster.
```
  "file image: teamrespons/respons_agent:version"
```

Agent deployments

DQN agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_dqn_agent.yml

BBO agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_bbo_agent.yml

V0 agent deployment:

kubectl create -f https://raw.githubusercontent.com/DISHDevEx/fonpr/main/deployment/manifest_v0_agent.yml

Name		Name	Last commit message	Last commit date
Latest commit History 368 Commits
.github/workflows		.github/workflows
deployment		deployment
fonpr		fonpr
tests		tests
.gitignore		.gitignore
.pylintrc		.pylintrc
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile_BBO		Dockerfile_BBO
Dockerfile_DQN		Dockerfile_DQN
Dockerfile_SAC		Dockerfile_SAC
Dockerfile_V0		Dockerfile_V0
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt
requirements_bbo.txt		requirements_bbo.txt
requirements_dqn.txt		requirements_dqn.txt
requirements_sac.txt		requirements_sac.txt
requirements_v0.txt		requirements_v0.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FONPR: First Open Network Pattern Reactors. Reinforcement Learning agents for fine tuning telecom networks.

Quick Deployment

First Open Network Pattern Reactors guide:

1. Agent

2. Advisor

3. Action Handler

4. Docker

5. Agent Deployment

About

Releases

Packages

Contributors 4

Languages

License

DISHDevEx/fonpr

Folders and files

Latest commit

History

Repository files navigation

FONPR: First Open Network Pattern Reactors. Reinforcement Learning agents for fine tuning telecom networks.

Quick Deployment

First Open Network Pattern Reactors guide:

1. Agent

2. Advisor

3. Action Handler

4. Docker

5. Agent Deployment

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages