Deploy C2D
This chapter will present how to deploy the C2D component of the Ocean stack. As mentioned in the C2D Architecture chapter, the Compute-to-Data component uses Kubernetes to orchestrate the creation and deletion of the pods in which the C2D jobs are run.
For the ones that do not have a Kubernetes environment available, we added to this guide instructions on how to install Minikube, which is a lightweight Kubernetes implementation that creates a VM on your local machine and deploys a simple cluster containing only one node. In case you have a Kubernetes environment in place, please skip directly to step 4 of this guide.
Requirements
Communications: a functioning internet-accessible provider service
Hardware: a server capable of running compute jobs (e.g. we used a machine with 8 CPUs, 16 GB Ram, 100GB SSD, and a fast internet connection). See this guide for how to create a server;
Operating system: Ubuntu 22.04 LTS
Steps
Install Docker and Git
Install Minikube
Start Minikube
The first command is important and solves a PersistentVolumeClaims problem.
Depending on the number of available CPUs, RAM, and the required resources for running the job, consider adding options --cpu
, --memory
, and --disk-size
to avoid runtime issues.
For other options to run minikube refer to this link
Install the Kubernetes command line tool (kubectl)
Wait until all the defaults are running (1/1).
Download all required files
Create a folder, cd into it, and clone the following repositories:
Create namespaces
In this tutorial, we are going to create only one environment, called ocean-compute
.
Setup up Postgresql
For now, communication between different components is made through pgsql. This will change in the near future.
Edit operator-service/kubernetes/postgres-configmap.yaml
. Change POSTGRES_PASSWORD
to a nice long random password.
Then deploy pgsql
Congrats, pgsql is running now.
Run the IPFS host (optional)
To store the results and the logs of the C2D jobs, you can use either an AWS S3 bucket or IPFS.
In case you want to use IPFS you need to run an IPFS host, as presented below.
Update the storage class
The storage class is used by Kubernetes to create the temporary volumes on which the data used by the algorithm will be stored.
Please ensure that your class allocates volumes in the same region and zone where you are running your pods.
You need to consider the storage class available for your environment.
For Minikube, you can use the default 'standard' class.
In AWS, we created our own 'standard' class:
For more information, please visit https://kubernetes.io/docs/concepts/storage/storage-classes/
If you need to use your own classes, you will need to edit 'operator_engine/kubernetes/operator.yml'.
Setup C2D Orchestrator
C2D Orchestrator (aka operator-service) has two main functions:
First, it's the outside interface of your C2D Cluster to the world. External components(like Provider) are calling APIs exposed by this
Secondly, operator-service manages multiple environments and sends the jobs to the right environment.
Edit operator-service/kubernetes/deployment.yaml
. Change ALLOWED_ADMINS
to a nice long random password.
Let's deploy C2D Orchestrator.
Now, let's expose the service.
You can run a port forward in a new terminal (see below) or create your ingress service and setup DNS and certificates (not covered here):
Alternatively you could use another method to communicate between the C2D Environment and the provider, such as an SSH tunnel.
And now it's time to initialize the database.
If your Minikube is running on compute.example.com:
(where myAdminPass is configured in Setup C2D Orchestrator)
Congrats, you have operator-service running.
Setup your first environment
Let's create our first environment. Edit operator-service/kubernetes/deployment.yaml
.
set OPERATOR_PRIVATE_KEY. This has to be unique among multiple environments. In the future, this will be the account credited with fees.
optionally change more env variables, to customize your environment. Check the README section of the operator engine to customize your deployment. At a minimum, you should add your IPFS URLs or AWS settings, and add (or remove) notification URLs.
Finally, let's deploy it:
Optional: For production enviroments, it's safer to block access to metadata. To do so run the below command:
Congrats,your c2d environment is running.
If you want to deploy another one, just repeat the steps above, with a different namespace and different OPERATOR_PRIVATE_KEY.
Update Provider
Update your existing provider service by updating the operator_service.url
value in config.ini
, or set the appropiate ENV variable.
Restart your provider service.
Automated deployment example
If your setup is more complex, you can checkout (our automated deployment example)[https://github.com/oceanprotocol/c2d_barge/blob/main/c2d_barge_deployer/docker-entrypoint.sh]. This script is used by barge to automaticly deploy the C2D cluster, with two environments.
Last updated