AWS Terraform

The deployment consists of several Docker containers, a database for storing metadata, and persistent file storage for storing files.

This document contains instructions to deploy Datagrok using Terraform on an AWS ECS cluster with AWS RDS and AWS S3.

We addressed many typical security considerations during the development of the Terraform code. As a result, the deployed Datagrok infrastructure adheres to standard AWS security best practices.

More information about Datagrok design and components:

Basic usage

Prerequisites

Check that you have the required permissions on the AWS account to perform Terraform deployment to ECS.
Check that you have the required S3 permissions and DynamoDB permissions.
Create an IAM role for CloudFormation stack deployment with the following policies attached:
- IAMFullAccess
- PowerUserAccess

You can name it something like CloudFormationExecutionRole. Its ARN will be used as the iam_role_arn variable in the Terraform module.

Deploy Datagrok components

You can use our example as a template and modify it to your needs. Example: link.
Make sure to change the module source to source = "git@github.com:datagrok-ai/tf-module-datagrok-core.git//aws?ref=main" or any other version you want to use.
Set up AWS credentials to run Terraform code
Apply the Terraform code
```
terraform init --upgrade
terraform apply
```
Once the Datagrok container starts, the server will initialize the database. You can monitor the status by checking the running task logs in CloudWatch.

Advanced usage

The Terraform code is highly configurable. Feel free to adapt the code and variables to meet your needs and requirements. Terraform modules documentation:

Datagrok Core README

Basic usage​

Prerequisites​

Deploy Datagrok components​

Advanced usage​

Basic usage

Prerequisites

Deploy Datagrok components

Advanced usage