V3 casl 663 twkb srid fix #380

Amaneusz · 2024-11-12T11:36:34Z

No description provided.

Signed-off-by: Jakub Amanowicz <[email protected]>

Amaneusz · 2024-11-12T11:47:41Z

here-naksha-lib-psql/src/commonTest/kotlin/naksha/psql/SridTest.kt

+
+class SridTest : PgTestBase() {
+
+    data class SridTestConfig(val encodingName: String, val flags: Flags, val collectionId: String)


This might look odd at first glance but there's a reason to have test configs as separate being.
Currently in Naksha, when we want to write features with specific encoding we need to specify this as a collection property (via NakshaCollection.defaultFlags). Then, whenever we write features to collection, this field is read and proper encoding is enforced on geometry, tags etc.

In the tests below, we are validating correct encoding of geometry (more specifically, we check whether naksha_geometry function, when run against persisted feature, returns geometry with SRID set to 4326 which is our default and only supported SRID).

To test all encodings we have, we need separate collection for each encoding - we need features to be properly encoded before persisted so we need proper collection.flags to be set (see 1st paragraph above).

Why not use single collection and modify it with defferent flags for each test?

Because collections and their properties are cached, so if we had shared collection for all test cases, the application wouldn't fetch collection from DB, rather it will use what it has in cache (with previous encoding) and we would end up with encoding not matching our expectation

To avoid the above, the collection cache would have to be cleared, and since it is not public (rightly so), the only way to do that is dropping the collection - this already makes the whole process more complicated and slowe than simply having N collections for N encodings

Amaneusz · 2024-11-12T11:59:29Z

here-naksha-lib-psql/src/jvmMain/resources/naksha.sql

  elsif (encoding = 2) then
    RETURN ST_GeomFromWKB(geo);
  elsif (encoding = 4) then
    RETURN ST_GeomFromEWKB(geo);
  elsif (encoding = 6) then
-    RETURN ST_GeomFromGeoJSON(geo::text);
+    RETURN ST_GeomFromGeoJSON(convert_from(geo, 'UTF8'));


We had a bug here, casting bytea::text won't work as expected, I was getting SQL Error [XX000]: ERROR: unexpected character (at offset 0). The correct way to get text represntation of JSON previously encoded as byte array is to use convert_from with UTF8 encoding specified (this is also the encoding we use in lib-psql).

Some snippets to prove it step by step:

Hex representation of simple Line String:

// Kotlin: LineStringCoord( PointCoord(longitude = 25.0, latitude = 25.0), PointCoord(longitude = 25.0, latitude = 26.0), ) // bytes (hex): '\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d'

This is what happened before - confirming that we passed something invalid to ST_GeomFromGeoJSON:

> SELECT ST_GeomFromGeoJSON((bytea '\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d')::text) SQL Error [XX000]: ERROR: unexpected character (at offset 0)

Checking the cast - we can see that we did not get json, rather raw bytes as text

SELECT (bytea '\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d')::text \x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d

Passing these byte representation is not suitable for ST_GeomFromGeoJSON - we get the same error as above

SELECT ST_GeomFromGeoJSON('\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d') SQL Error [XX000]: ERROR: unexpected character (at offset 0)

Switching from cast to convert_from, we get proper JSON

SELECT convert_from(bytea '\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d', 'UTF8') {"type":"LineString","coordinates":[[25.0,25.0],[25.0,26.0]]}

Passing proper JSON to ST_GeomFromGeoJSON we get what wanted - geometry instance

SELECT ST_GeomFromGeoJSON(convert_from(bytea '\x7b2274797065223a224c696e65537472696e67222c22636f6f7264696e61746573223a5b5b32352e302c32352e305d2c5b32352e302c32362e305d5d7d', 'UTF8')) LINESTRING (25 25, 25 26)

Signed-off-by: Jakub Amanowicz <[email protected]>

github-actions · 2024-11-12T12:19:57Z

Code Coverage

Overall Project	31.1%	🍏

There is no coverage information present for the Files changed

CASL-663 srid fix & tests

126b50b

Signed-off-by: Jakub Amanowicz <[email protected]>

Amaneusz commented Nov 12, 2024

View reviewed changes

CASL-663 fix GEO_JSON decoding in naksha.sql

d43739a

Signed-off-by: Jakub Amanowicz <[email protected]>

Amaneusz force-pushed the v3_CASL-663_twkb_srid_fix branch from cfcbada to d43739a Compare November 12, 2024 12:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V3 casl 663 twkb srid fix #380

V3 casl 663 twkb srid fix #380

Amaneusz commented Nov 12, 2024

Amaneusz Nov 12, 2024 •

edited

Loading

Amaneusz Nov 12, 2024

github-actions bot commented Nov 12, 2024


		class SridTest : PgTestBase() {

		data class SridTestConfig(val encodingName: String, val flags: Flags, val collectionId: String)

V3 casl 663 twkb srid fix #380

Are you sure you want to change the base?

V3 casl 663 twkb srid fix #380

Conversation

Amaneusz commented Nov 12, 2024

Amaneusz Nov 12, 2024 • edited Loading

Choose a reason for hiding this comment

Amaneusz Nov 12, 2024

Choose a reason for hiding this comment

github-actions bot commented Nov 12, 2024

Code Coverage

Amaneusz Nov 12, 2024 •

edited

Loading