Repurpose PSE terraform config to initialize cloud resources #2

zschira · 2024-02-27T17:24:59Z

This PR takes sets up cloud infrastructure using terraform. We will be using similar infrastructure to that used for the PSE project, so most of this was repurposed from there. Specifically, we will have storage bucket with raw 10-K filings, and postgres instance containing filing metadata. This will be used during development so we can quickly search through filings, manually inspect them, and do experimental development with minimal hassle. Eventually this infrastructure will likely be taken down, as all of the dev work for mozilla should be migrated into PUDL, and we will access filings using our normal archiver workflow.

zschira · 2024-02-28T15:03:04Z

terraform/main.tf

+
+variable "project_id" {
+  type    = string
+  default = "catalyst-cooperative-pudl"


Is there anything weird about using the PUDL project from a different repo? Should we just create a mozilla project?

If we're billing infrastructure costs to Mozilla, we should probably make a separate mozilla project - that will make the bookkeeping much easier.

If not, I think it's fine to use the same GCP project - though if we do that, I'd rather this terraform configuration live in the pudl repo. I think one tfstate per GCP project is easier to think about.

jdangerx

The terraform stuff is a little funky. I personally think we should make a new Mozilla project in GCP, and set stuff up for that.

But if we're going to end up reusing a bunch of this infrastructure for PUDL one day, it might make sense to bring it into the main project + terraform state.

jdangerx · 2024-02-28T16:09:12Z

terraform/main.tf

+
+variable "project_id" {
+  type    = string
+  default = "catalyst-cooperative-pudl"


If we're billing infrastructure costs to Mozilla, we should probably make a separate mozilla project - that will make the bookkeeping much easier.

If not, I think it's fine to use the same GCP project - though if we do that, I'd rather this terraform configuration live in the pudl repo. I think one tfstate per GCP project is easier to think about.

terraform/main.tf

jdangerx · 2024-02-28T16:12:56Z

terraform/main.tf

+    "[email protected]",
+    "[email protected]",
+    "[email protected]",
+    "[email protected]"


Might want to add @bendnorman too.

jdangerx · 2024-02-28T16:16:51Z

terraform/main.tf

+}
+
+resource "google_storage_bucket" "tfstate" {
+  name          = "${random_id.bucket_prefix.hex}-tfstate"


So if we create a new Mozilla project, we'll need to actually follow this guide which basically says:

use a block such as this one to create a bucket to hold tfstate

only then do you configure the tfstate to be remote like in lines 1-4 - though point the gcs backend at the new bucket

If you want to stay in the PUDL project because the billing isn't separate, you can avoid making this new bucket and just use what we have already set up (the f344... bucket).

terraform/main.tf

jdangerx · 2024-02-28T16:20:29Z

terraform/main.tf

+  type     = "CLOUD_IAM_USER"
+}
+
+resource "google_secret_manager_secret" "postgres_pass" {


Note that you'll have to create a new "secret version" manually to actually hold a secret password value.

zschira · 2024-02-28T17:53:06Z

Ok so I've switched to using a dedicated mozilla project. We do have some funding for cloud resources, so we should be billing separately. It seems like that solves many of these problems.

I think I need someone else to create the project though, as I don't have permission

jdangerx

This seems like it'll mostly work, except for the whole "remote tfstate has to have a bucket, but the bucket has to be created by terraform first" thing. 🐣 I'm happy to pair on the operations of that if that makes life easier for you!

jdangerx · 2024-03-01T14:34:41Z

terraform/main.tf

@@ -0,0 +1,140 @@
+terraform {
+  backend "gcs" {
+    bucket = "f3441e415e6e5e7d-bucket-tfstate"


This is the right shape of things, but you'll run into issues with f34... bucket not existing in the new project! You'll have to:

take away this gcs backend

create a new state bucket with a new random prefix

re-configure this gcs backend with the new state bucket

It's fine to do that all locally before committing the changes, which should look exactly like this but with a different bucket name.

jdangerx · 2024-03-01T14:35:57Z

terraform/main.tf

+  display_name = "Mozilla dev"
+}
+
+resource "random_id" "bucket_prefix" {


super-nit: might be a little easier to read later if this bucket prefix is defined right next to the only place it's used - the tfstate bucket.

jdangerx · 2024-03-01T14:38:24Z

Oh and I think either @zaneselvans or @bendnorman have permissions to create a new project.

bendnorman · 2024-03-07T20:30:48Z

terraform/main.tf

+
+resource "google_project_iam_binding" "catalyst_people_editors" {
+  project = var.project_id
+  role    = "roles/editor"


Is this making everyone an editor on the entire project?

zschira added 3 commits February 27, 2024 12:19

Repurpose PSE terraform config to initialize cloud resources

97edb19

Remove references to docker

82a1196

Fix tox

53fde4c

zschira requested a review from jdangerx February 27, 2024 17:43

zschira commented Feb 28, 2024

View reviewed changes

Add Ben to people

33a7cd3

jdangerx requested changes Feb 28, 2024

View reviewed changes

Use dedicated mozilla project

2b157c9

jdangerx requested changes Mar 1, 2024

View reviewed changes

bendnorman reviewed Mar 7, 2024

View reviewed changes

This was referenced Mar 7, 2024

Make SEC 10K metadata into a structured database catalyst-cooperative/pudl#3431

Closed

Archive SEC Ex. 21 Documents catalyst-cooperative/pudl#3308

Closed

Update bucket id after state initialization

0a25c2f

jdangerx approved these changes Mar 8, 2024

View reviewed changes

zschira merged commit b48b769 into main Mar 8, 2024
11 checks passed

zschira deleted the init_infra branch March 8, 2024 19:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repurpose PSE terraform config to initialize cloud resources #2

Repurpose PSE terraform config to initialize cloud resources #2

zschira commented Feb 27, 2024

zschira Feb 28, 2024

jdangerx Feb 28, 2024

jdangerx left a comment

jdangerx Feb 28, 2024

jdangerx Feb 28, 2024

jdangerx Feb 28, 2024

jdangerx Feb 28, 2024

zschira commented Feb 28, 2024

jdangerx left a comment

jdangerx Mar 1, 2024

jdangerx Mar 1, 2024

jdangerx commented Mar 1, 2024

bendnorman Mar 7, 2024

Repurpose PSE terraform config to initialize cloud resources #2

Repurpose PSE terraform config to initialize cloud resources #2

Conversation

zschira commented Feb 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jdangerx left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zschira commented Feb 28, 2024

jdangerx left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jdangerx commented Mar 1, 2024

Choose a reason for hiding this comment