framework

Framework

The Framework has been realized with the aim of creating an environment where to implement and test varios load balancing strategies.

An high-level overview of the Framework modules, inputs and output is reported in the following image. This picture represent the UML Component Diagram of the simulator. In blu are highlighted main modules, in green their output artifacts and in orange the required files from the nodes modelling work.

The simulation takes place in steps, also called simulation instants; each of them corresponds to a different situation or configuration that must be simulated in terms of traffic to be balanced.

Main components

Instance Generator

The purpose of this component is to generate the instance on which the simulation will be performed. Specifically, information regarding the peer-to-peer node network and the load that each node will have to handle for each simulation instance is generated in this phase. This script take 3 CLI arguments:

Nodes number: Number of nodes to simulate;
Edge probability: Probability to generate an edge between each pair of nodes;
Max percentage of overloaded nodes: A percentage indicating the max number of nodes to be in overload during the simulation
Seed: Seed used as starting point for pseudo-random generation process.

The artifact produced by this element are a generated json instance and an image of corresponding network topology. Both this elements can be found in output/archive/[timestamp].zip output directory.

Model

The main objective of the model component is to provide a representation of the metrics prediction models within the Framework.

The implementation of the component was done by creating two classes:

Model: this class is used to represent models within the System. It has five private attributes that allow it to uniquely identify a particular model;
ModelProxy: this class serves as an access point for all components of the System that need to obtain predictions.

In the following picture a UML class diagram of this component is reported.

Simulator

This component is concerned to execute all steps of simulation, calculating load balancing weights of each agent towards other agents of the network, using different techniques, and calculating forwarding tables. In this phase is executed simulation.py script.

It accepts as input a single parameter, namely Model type, that allows to specify which model to use among standard regression, 5% quantile regression, and 95% quantile regression.

This component is the main part of the Framework package and uses a various number of other components. In the following picture a UML sequence diagram of simulator main operations is reported.

The simulator, reading an instance file, need to retrieve the metrics predictions of the current instant. This data are provided by the Model component, which requires as input the expected load values from the nodes in the network.

Each agent, during simulation, execute different load balancing strategies, providing the ability to use the metrics prediction models. How this classes are organized is represented in the following class diagram.

Analyzer

This component analyze Simulator output files and calculate indexes used for comparison between different strategies. Like the Simulator, this module also makes use of models for its own purposes; for this reason, the type of model to be used (5% quantile regression, regression or 95% quantile regression) can be provided as input, which should reflect the choice made in the Simulator.

Main index calculated are:

Success rate: Percentage of requests managed with success by the system, applaying a specific load balancing technique;
Success rate during stress period: Percentage of requests managed with success by the system, applaying a specific load balancing technique, when the load is higher;
Mean Node Power Consumption: this index is intended to allow comparison on the average power consumption among different strategies;
Max Node Power Consumption: represents the maximum consumption value estimated on the nodes in the network;
Reject number: Number of rejected request by the system.

This script export a CSV file containing mean and median of these indexex. It also export comparison simulation graph that compares success rates of different startegies during simulation (see figure below).

Simulation controller

Using this component to run a simulation simplifies the experimental phase because it automates the sequential execution of internal modules that would otherwise have to be run manually.

Acting as an entry point for the components contained within it, the Simulation Controller is set up to be able to handle all the parameters in those modules, with some differences:

The Simulation Controller is capable of receive as input one or more values of Max percentage of overloaded nodes, so that the same instance can be evaluated on different values of load;
The Simulation Controller allows you to specify an additional parameter, the number of experiments. This will generate and simulate n scenarios, and then then aggregate the indices obtained in the various experiments into unique values that can provide much more precise indications of performance. If number of experiments is greater than one, than values for the seed will not be accepted by the Simulation Controller.

In the following picture a UML sequence diagram of Simulation Controller main operations is reported.

All artifacts produced by this process are gathered in a zip file as name the timestamp, under output directory.

Configuration Manager

This utility class (configuration/configuration_manager.py) contains all paramter, variables, strategy names, and settings of simulator. It also include the definitions of the parameters used by the Node Margin Strategy and the Power Saving Strategy.

Note: is an algorithm is added to simulator or some paramters need to change, they must be changed in this configuration class.

Implemented Strategies

During simulation are executed all load balancing strategies defined in behaviour module, and added in configuration/configuration_manager.py class.

The implemented strategies are:

Base Strategy. This strategy does not forward traffic, each agent use only its resources. This strategy is implemented in behaviour/base_strategy.py;
Equal Strategy. This strategy shares excess traffic equally among neighbors. This strategy is implemented in behaviour/equal_strategy.py;
Node Margin Strategy. This is the main distribuited load balancing algorithm. With this strategy each node balances its load according to the margin offered by its neighborhood. This strategy is implemented in behaviour/node_margin_strategy.py;
Power Saving Strategy. declination of the Node Margin Strategy that allows a limit to be placed on energy consumption. This strategy is implemented in behaviour/power_saving_strategy.py.

All of the strategies returns as output a set of weights for each function from each agent towards others

How to add other strategies

To add a new strategy it is only necessary create a new file in behaviour module, and the new class must implement the behaviour/strategy.py "interface". New class must implement run method and return a dictionary containing forwarding weights.

It is also necessary add new strategy name in configuration/configuration_manager.py file and add a case for strategy creation in factory/strategy_factory.py class.

How to execute a simulation

For execute a simulation using simulation controller as entry point, first move on simulation folder from the project root.

cd simulation

After that, type and execute the following command:

python simulation_controller.py --nodesnum [integer number] --edgeprob [float probability between 0 and 1] --overloaded [one or more values of percentage] --expnum [integer number] --modeltype [a string among regression, quantile005, quantile095] --seed [integer number]

Note: All parameters are optional.

Example:

python simulation_controller.py --nodesnum 10 --edgeprob 0.3 --overloaded 30 60 --expnum 1 --modeltype regression --seed 701

Note: If expnum is greater than 1, than seed cannot be specified:

python simulation_controller.py --nodesnum 10 --edgeprob 0.3 --overloaded 30 60 --expnum 10 --modeltype regression

It is also possible to execute single scripts:

python instance_generator.py --nodesnum 5 --edgeprob 0.3 --overloaded 40 --seed 711

python simulation.py --modeltype regression

python analyzer.py --modeltype regression

License

The source code in this repository is licensed under the GNU Affero General Public License (AGPL), version 2.0 or later. See the LICENSE file for more information.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

The complete list of The DFaaS Authors can be fond in the AUTHORS file or in the contributors page on the DFaaS GitHub repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Framework

Main components

Instance Generator

Model

Simulator

Analyzer

Simulation controller

Configuration Manager

Implemented Strategies

How to add other strategies

How to execute a simulation

License

Name		Name	Last commit message	Last commit date
parent directory ..
agent		agent
behaviour		behaviour
cli		cli
configuration		configuration
factory		factory
model		model
utils		utils
README.md		README.md
__init__.py		__init__.py
analyzer.py		analyzer.py
instance_generator.py		instance_generator.py
simulation.py		simulation.py
simulation_controller.py		simulation_controller.py

FilesExpand file tree

framework

Directory actions

More options

Directory actions

More options

Latest commit

History

framework

Folders and files

parent directory

README.md

Framework

Main components

Instance Generator

Model

Simulator

Analyzer

Simulation controller

Configuration Manager

Implemented Strategies

How to add other strategies

How to execute a simulation

License