Solving Bin Packing Problem on Distributed Systems

Simple Wikipedia Definition:

In the bin packing problem, items of different volumes must be packed into a finite number of bins or containers each of a fixed given volume in a way that minimizes the number of bins used. In computational complexity theory, it is a combinatorial NP-hard problem. The decision problem (deciding if items will fit into a specified number of bins) is NP-complete.

We aproached this problem on distributed system with our own algortihm. Solving bin packing problem on big input data is challange and while doing so one must consider efficiency of algorthim and running time for the program.

While solving this problem on distributed systems we created an algorithm to minimize side effects of adding new worker nodes(computers) to the system.

Requirements:

Python 3.8+ : Download.
pip Package Manager: Download.
Windows 8, 10 or Linux Based System
Spark&Hadoop : Download.

Installation

Installing Packages:

Make sure you have at least python 3.8 installed on your machine and your pip is up to date.

	$ python --version
	$ pip --version

Clone this project to somewhere on your system and create virtual environment. How to create virtual environment.
After activating virtual environment, download and install packages from requirements.txt with:

	$ pip install -r requirements.txt

Installation for Spark on Windows can be found on this link: https://aamargajbhiye.medium.com/apache-spark-setup-a-multi-node-standalone-cluster-on-windows-63d413296971

Setting up Spark and Changing Connection Strings

Our implementation for algrotihm is located at BinPackingDistributed.py. You will need to configure connection string after setting up spark master node and worker nodes.

spark = SparkSession.builder.appName("BinPackingDistributed ").master("spark://192.168.0.10:7077").getOrCreate()

Master function must take your spark url for master node.

Name		Name	Last commit message	Last commit date
Latest commit Cannot retrieve latest commit at this time. History 12 Commits
.gitignore		.gitignore
BinPackingDistributed-OldImplementation.py		BinPackingDistributed-OldImplementation.py
BinPackingDistributed.py		BinPackingDistributed.py
README.md		README.md
SingleMachine-OrTools.py		SingleMachine-OrTools.py
SingleMachineFirstFit.py		SingleMachineFirstFit.py
randomNumber.py		randomNumber.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving Bin Packing Problem on Distributed Systems

Requirements:

Installation

Installing Packages:

Setting up Spark and Changing Connection Strings

About

Releases

Packages

Contributors 2

Languages

utkuc/BinPackingDistributed

Folders and files

Latest commit

History

Repository files navigation

Solving Bin Packing Problem on Distributed Systems

Requirements:

Installation

Installing Packages:

Setting up Spark and Changing Connection Strings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages