Skip to content

DecisionsDev/simulatte

SimuLatte

This projects aims to build confidence in your decision automation through simulations.

Motivations

  • Trust your decision automation on large datasets coming from your production warehouse or synthetic.
  • How to execute a large dataset (millions to billions) of automated decisions based on business rules and machine learning.
  • How to apply analytics on these large decision sets to get business insights by computing KPIs and visualizing the results.
  • How can we navigate with ease in a decision set to search for specific decisions?
  • How to compose analytical dashboards for tour decision automation?
  • How to empower champion/challenger comparisions for automated decision services?

Simulatte combines:

  • a Spark based execution invoker to read a JSON request dataset and automate at scale the business decisions, and write them with their traces,
  • an analytics post-processing for computing metrics, KPIs and visualizations through Python and Jupyter notebooks.
  • a decision automation delegated to IBM Automation Decision Services(ADS) or IBM Operational Decision Services(ODM), 2 capabilities of the Cloud Pak for Business Automation. These 2 capabilities run an inference rule engine and ADS allows to call any ML model to bring probabilities in the decision making.

You can use Simulatte in an online mode through REST calls on microservices. Additionaly you can submit the decision invoker in a Spark batch to run in an Hadoop Spark cluster for maximum performances.

Flavors

The simulation frameworks delivers a matrix of modes on the following dimensions:

  • online vs offline execution
  • remote REST vs embedded Java calls for the decision invocation,
  • basic Java single process single threaded, or Spark local, Spark cluster for full power,

How to use the project

Setup for docker use

Clone the repository: By using http protocol

git clone https://github.com/DecisionsDev/simulatte.git

or ssh protocol

git clone git@github.com:DecisionsDev/simulatte.git

Move to the project directory.

cd simulatte

Configure Decision Service invocation

SimuLatte comes by default with a standalone ODM Docker container. To run simulations with IBM ADS or another ODM setup please follow the following configuration steps:

Run

Run docker compose commands

docker compose build 

Note: If you don't configure an ADS connection then you will face the following error:

ERROR [simulatte/simulation-service:latest stage-1 3/3] COPY --from=builder /builder/simulatte-online/t  0.0s

Reason is that the ADS Java libraries are missing to build simulatte. Neverthess SimuLatte is good to manage simulations through remote invocations with ODM.

docker compose up -d

At the end you should see all containers running as follows:

[+] Running 1/1
 ⠿ grafana Pulled                           1.4s                                                                                         
[+] Running 7/7
 ⠿ Container simulatte-db        Healthy    11.4s
 ⠿ Container prometheus          Started    0.6s
 ⠿ Container notebooks           Started    0.9s
 ⠿ Container simulation-service  Started    12.0s
 ⠿ Container analytic-service    Started    12.0s
 ⠿ Container grafana             Started    1.3s
 ⠿ Container odm                 Started     0.7s

Ports

Be aware that Simulatte uses by default the following ports:

  • port 3000 for grafana
  • port 8080 for simulatte online
  • port 8888 for Jupyter notebooks
  • port 4040 for Spark jobs monitoring
  • port 9060 only for ODM image use
  • port 9443 only for ODM image use
  • port 5432 for database
  • port 9090 for Prometheus

In case these port numbers collide with existing applications you can change then in the docker-compose.yml.

The OpenAPI UI is available at http://localhost:8080/swagger-ui/index.html

To use simulatte notebooks, run the command below:

docker logs notebooks

Click on the http://127.0.0.1:8888 link printed in your console. You see the Jupyter web page.

Entering in the notebook server

On the left side navigate into the work/notebooks hierarchy. You see ADS and ODM folders.

ODM

Select the decisioning capability ODM.
Select notebook for loanvalidation.
Run all cells.

ADS

Select the decisioning capability ADS.
Select notebook for loyalty.
Run all cells.
Congratulations you have created a decision simulation, run it, to get insights about how your logic applies to the sample dataset!

Stop

To stop all simulatte containers :

docker compose stop

Troubleshooting

  • Build - No space left on device during Build You can encounter a space limit when building the images with the message like:
    ERROR: Could not install packages due to an OSError: [Errno 28] No space left on device: '/tmp/pip-uninstall-t79kgz4i'
    Resolution: Clean up the temporary folders and unnecessary resources in your Docker installation by running:
    docker system prune    

Structure

The simulatte project is composed of the following building blocks:

  • core: a core library. It contains all common material to run simulations in online and offline. It covers the default decision service invokers for ADS and ODM.
  • online: online. This part comes as Docker microservices to execute decisions through a Java based container, and perform analytic post processing to produce a simulation report.
  • offline: offline. This part gives a generic batch simulation runner capable to run standalone but on Hadoop Spark cluster too for maximum of scalability and performance with data residing in the lake.

Structure

Benchmarks

benchmarks: performances

About

Simulations for Decision Automation

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages