#539 Spark schema generation works with record relations #556

cwoods-cpointe · 2025-01-31T19:41:13Z

Update Spark Schema MDA generation to account for relations between records. Updated the relation mda to include column and required fields.

Added record relation unit tests.

Spark Schema to/from pojos work with records with relations.

Updated documentation to account for the new relation fields.

Implement validation for relations except one to M.

ewilkins-csi · 2025-01-31T19:44:41Z

docs/modules/ROOT/pages/record-metamodel.adoc

@@ -247,13 +247,22 @@ namespacing (e.g., package in Java, namespace in XSD).
 | `relations/relation/documentation`
 | No
 | None
-| A description of the field.
+| A description of the relation.


Q: Do we define what we mean by "relation" anywhere?

it is referenced in the root record docs here

ewilkins-csi · 2025-01-31T19:49:14Z

...da/src/main/java/com/boozallen/aiops/mda/metamodel/element/util/SparkRelationAttributes.java

+/**
+ * Class for common Spark-related attributes.
+ */
+public class SparkRelationAttributes {


Q: Why not add these methods to BaseRecordRelationDecorator? Specifically, it seems reasonable to include the defaulting logic of column and required/nullable so that we use the same defaults everywhere.

Related, it may be worth adding some unit tests asserting how the defaulting logic works like we do with other metamodel options to help remind us that it's a breaking change if we update it.

The column name is really just used for the spark saving but I agree it makes sense to pull these up to the greater level in case it needs to be used elsewhere.

ewilkins-csi · 2025-01-31T19:52:40Z

...-models/test-data-delivery-spark-model/src/test/resources/specifications/sparkSchema.feature

+      | multiplicity | record |
+      | 1-1          | Mayor  |
+      | 1-M          | Street |
+      | M-1          | State  |


Texarkana? 🙃

...data-delivery-spark-model/src/test/java/com/boozallen/aiops/mda/pattern/SparkSchemaTest.java

...mda/src/main/java/com/boozallen/aiops/mda/metamodel/element/BaseRecordRelationDecorator.java

...dation-mda/src/main/resources/templates/data-delivery-data-records/spark.schema.base.java.vm

...n-mda/src/main/java/com/boozallen/aiops/mda/metamodel/element/spark/SparkRecordRelation.java

...da/src/main/java/com/boozallen/aiops/mda/metamodel/element/util/SparkRelationAttributes.java

ewilkins-csi · 2025-01-31T20:06:59Z

...dation-mda/src/main/resources/templates/data-delivery-data-records/spark.schema.base.java.vm

+        #set($hasOneToManyRelation = 0)
+        #foreach($relation in $record.relations)
+            #if($relation.isOneToManyRelation())
+                #set($hasOneToManyRelation = 1)


A: This might be a little more maintainable to move into the Java object or context variables. It's pretty straightforward logic, but in general I find velocity template logic pretty hard to parse/update so prefer doing as much as possible in Java.

...dation-mda/src/main/resources/templates/data-delivery-data-records/spark.schema.base.java.vm

DRAFT_RELEASE_NOTES.md

carter-cundiff · 2025-01-31T20:37:52Z

docs/modules/ROOT/pages/record-metamodel.adoc

+| `relations/relation/column`
+| No
+| None
+| The name of the storage field for data persistence.


Q: If each relation refers to another record type, should we be able to use the column attribute from the field within the relation record?

relation is basically just field on the record with a type that corresponds to another record. So there is no specific field in the relation record that ties to the relation definition in this record. Hope that makes sense.

docs/modules/ROOT/pages/record-metamodel.adoc

Update Spark Schema MDA generation to account for relations between records. Updated the relation mda to include column and required fields. Added record relation unit tests. Spark Schema casting and to/from pojos work with records with relations. Updated documentation to account for the new relation fields. Implement validation for relations except one to M.

csun-cpointe · 2025-01-31T23:48:14Z

.../foundation-mda/src/main/java/com/boozallen/aiops/mda/metamodel/element/RelationElement.java

@@ -34,6 +34,11 @@ public class RelationElement extends NamespacedMetamodelElement implements Relat
    @JsonInclude(Include.NON_NULL)
    protected Multiplicity multiplicity;

+    @JsonInclude(Include.NON_NULL)
+    protected Boolean required;


Q: should we default this to false since Boolean allows a null value or the isRequired() function will return null by default if not set?

cwoods-cpointe force-pushed the 539-spark-schema-relation-records branch from 04f5983 to d35a20f Compare January 31, 2025 19:43