Update to utilize new multi-tenancy approach #118

jordanpadams · 2023-03-30T21:58:53Z

💡 Description

handshake auth to only access data owned by the node
update documentation as needed

sjoshi-jpl · 2023-12-18T19:05:36Z

Modify Harvest to:

Get Cognito JWT tokens (for write access to OpenSearch)
Get signed URLs (call API Gateway path for GET /signed-url)
For write access to OpenSearch with Signed-URLs (directly calling OpenSearch with signed-url) for read-only access to OpenSearch (call API Gateway path for GET /registry)

al-niessner · 2024-01-02T17:08:34Z

@jordanpadams @sjoshi-jpl

Is there a cognito tutorial for us somewhere? Just the readme of some repo maybe?

al-niessner · 2024-01-08T19:30:37Z

@alexdunnjpl @jordanpadams @tloubrieu-jpl

Looking for opinions. Currently, harvest splits up server information. In the harvest config, provide registry URL, index, and auth file. In auth file provide trust self signed and username/password. Trusting self signed is a server problem like the URL not auth information. Also, harvest uses a sax parser to read the config file making it difficult to understand what is legal and illegal in the file.

There are several theories of approach here:

do least change
minimize change while maximize consistency
do it the right way

least change

So, least change would mean shoving clientID and urlIsCognito into auth file. Already had some discontent at last breakout over that. Not a huge fan of it myself because now you have repeated information for Nick and Nora who just care about their username/password and all the rest is meaningless. The auth file would look like:

trust.self-signed = true
clientID = a123bajkhfdkjalkjalj212lsdlkjsgjkl
urlIsCognito = false
# TODO Warning: Use the default username and password only for testing purposes in local setup
user = nickandnora
password = asta

minimal change but consistent

Here, group the server information in harvest config and make auth just username/password:

user = nickandnora
password = asta

However this pushes three new items into the harvest config. Since it is a sax parser it will unduly complicate the code and provide no explanation of what is possible. In turn, all the examples would have to be expanded and expounded to cover the new options. It would looks something like:

<registry url="https://es:9200" 
               index="registry"
               clientID=akjdsajfdkjaklfoiwi"
               trust.self_signed="false"
               urlIsCognito="true"
               auth="auth.cfg" />

right way

First, write a schema for harvest config. It would have docs to explain the options. It would allow for xmllint to determine if the config is valid and tell the user where the error is. Since the user base is PDS, they should be familiar enough with XML and schema to use it effectively.

Second, use JAXB and get out of the XML parsing business. Is much more secure with respect to injection and it also validates the XML as reading.

Lastly, auth file would be username/password only. The line would look more like:

<registry url="https://es:9200" 
               index="registry"
               clientID=akjdsajfdkjaklfoiwi"
               trust.self_signed="false"
               style="direct" --> schema would limit to direct or cognito for now
               auth="auth.cfg" />

Obviously I prefer the last. When I had to write my own harvest from scratch I had to read java code and was really annoyed that there was no schema. It will take a little bit of time (days) to do schema and stuff but will pay dividends over time.

Thoughts?

alexdunnjpl · 2024-01-08T19:34:04Z

Firm default-vote for "right way".

jordanpadams · 2024-01-08T21:21:05Z

@al-niessner +1 to "right way". We have gotten complaints about not having a schema file in the past for this. They have some schema generators online you can use to get a first pass at what that could be as well.

tloubrieu-jpl · 2024-01-09T21:30:59Z

@al-niessner is looking at the xsd schema that requires update for the cognito configuration in the harvest configuration file.

tloubrieu-jpl · 2024-01-11T22:28:32Z

The schema is now updated and is backward compatible. @tloubrieu-jpl can review the new schema available in the draft PR.

al-niessner · 2024-01-25T17:47:12Z

#146 was only start to solving this ticket.

jordanpadams · 2024-03-21T21:09:31Z

@al-niessner status: Got a response from Cognito team. Waiting for @sjoshi-jpl to make the happenings happen.

tloubrieu-jpl · 2024-06-10T15:56:11Z

Task for @tloubrieu-jpl : write an updated documentation for Gary.

Task plan for @gxtchen :

build harvest and registry-manager with registry-common from branches:
- harvest issue_118.1
- registry-manager issue_66
- registry-common issue_36
test plan:
- find out tests which need to be done, from the documentation https://nasa-pds.github.io/registry/admin/tasks.html, user tasks and admin tasks), make it a test plan in test rail for non regression
- otherwise nominal tests:
  - create registry (registry-mgr)
  - update data dictionaries (registry-mgr)
  - load data (harvest), multiple job configurations, load bundle, directory...
  - update archive status (registry-mgr)
test venues:
- AWS MCP Dev account
- Docker compose local deployment.

tloubrieu-jpl · 2024-06-10T15:59:23Z

@gxtchen as a note, after we met I also thought, we also need to think of how to automate the tests that you are going to write for harvest and registry-manager. That can be done in the docker compose set-up.

jordanpadams added B14.0 task i&t.skip labels Mar 30, 2023

jordanpadams assigned alexdunnjpl Mar 30, 2023

jordanpadams mentioned this issue Mar 30, 2023

Registry Multi-tenancy Design and implementation with Cognito NASA-PDS/registry#155

Closed

jordanpadams mentioned this issue Apr 12, 2023

Implement Registry Multi-tenancy with Cognito in the loop NASA-PDS/registry#185

Closed

jordanpadams assigned sjoshi-jpl Apr 13, 2023

jordanpadams added B14.1 and removed B14.0 labels Aug 17, 2023

jordanpadams mentioned this issue Dec 18, 2023

Modify Harvest - OpenSearch Serverless Authentication NASA-PDS/registry#257

Closed

jordanpadams assigned al-niessner and unassigned alexdunnjpl Dec 19, 2023

jordanpadams added the sprint-backlog label Dec 19, 2023

al-niessner mentioned this issue Jan 11, 2024

Refactor harvest to operate with new multi-tenant, serverless OpenSearch architecture #146

Merged

jordanpadams closed this as completed in #146 Jan 24, 2024

al-niessner reopened this Jan 25, 2024

al-niessner mentioned this issue Apr 19, 2024

Update Harvest to support multi-tenancy and OpenSearch Serverless #152

Merged

jordanpadams assigned sjoshi-jpl and unassigned sjoshi-jpl Apr 23, 2024

al-niessner closed this as completed in #152 Jul 5, 2024

jordanpadams removed the sprint-backlog label Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to utilize new multi-tenancy approach #118

Update to utilize new multi-tenancy approach #118

jordanpadams commented Mar 30, 2023

sjoshi-jpl commented Dec 18, 2023

al-niessner commented Jan 2, 2024

al-niessner commented Jan 8, 2024

alexdunnjpl commented Jan 8, 2024

jordanpadams commented Jan 8, 2024

tloubrieu-jpl commented Jan 9, 2024

tloubrieu-jpl commented Jan 11, 2024

al-niessner commented Jan 25, 2024

jordanpadams commented Mar 21, 2024

tloubrieu-jpl commented Jun 10, 2024

tloubrieu-jpl commented Jun 10, 2024

Update to utilize new multi-tenancy approach #118

Update to utilize new multi-tenancy approach #118

Comments

jordanpadams commented Mar 30, 2023

💡 Description

sjoshi-jpl commented Dec 18, 2023

al-niessner commented Jan 2, 2024

al-niessner commented Jan 8, 2024

least change

minimal change but consistent

right way

alexdunnjpl commented Jan 8, 2024

jordanpadams commented Jan 8, 2024

tloubrieu-jpl commented Jan 9, 2024

tloubrieu-jpl commented Jan 11, 2024

al-niessner commented Jan 25, 2024

jordanpadams commented Mar 21, 2024

tloubrieu-jpl commented Jun 10, 2024

tloubrieu-jpl commented Jun 10, 2024