You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The example queries below src/sql/examples/cc-index/ were developed using Athena engine v1 or v2. There might be issues when engine v3 (based on Trino instead of PrestoDb) is used. Eg. (num_pages/total_pages_host) >= .05 in one of the queries needs to be updated to (1.0*num_pages/total_pages_host) >= .05 (int/float cast).
Ideally, should run the queries against each of the engines and compare the results to verify that everything works as expected.
The text was updated successfully, but these errors were encountered:
- WARC storage metrics by MIME type:
extract common/frequent file suffixes from URL
- site discovery by language: implicit cast to floating point
number (address #23)
- select robots.txt records for a given list of domains
The example queries below src/sql/examples/cc-index/ were developed using Athena engine v1 or v2. There might be issues when engine v3 (based on Trino instead of PrestoDb) is used. Eg.
(num_pages/total_pages_host) >= .05
in one of the queries needs to be updated to(1.0*num_pages/total_pages_host) >= .05
(int/float cast).Ideally, should run the queries against each of the engines and compare the results to verify that everything works as expected.
The text was updated successfully, but these errors were encountered: