Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TitlePattern geocoder #162

Merged
merged 14 commits into from
Nov 8, 2024
Merged

TitlePattern geocoder #162

merged 14 commits into from
Nov 8, 2024

Conversation

danvk
Copy link
Owner

@danvk danvk commented Nov 8, 2024

This looks for image titles that match the pattern:

Borough: Street1 - Street2

These are then run through either the extended-grid geocoder or Google Maps. This results in changes in two situations:

  1. The match is on the alt_title field, which wasn't previously used by extended-grid, and wasn't consistently used by milstein.
  2. There's been a data update in the past 12 years (Geocoding updates #150), in which case this will pick up the new location.

This results in 265 new items on the site, and 162 location changes. I reviewed 20 of these, 85% were wins and 15% were neutral (moving across an intersection).

On its own, this is able to geocode 23454 / 41463 = 56.6% of the items. This takes a lot of pressure off the downstream coders, and reduces reliance on the mysterious address field from the 2013 CSV file, which has unclear provenance.

@danvk danvk marked this pull request as ready for review November 8, 2024 21:51
@danvk danvk merged commit 7d70122 into master Nov 8, 2024
4 checks passed
@danvk danvk deleted the title-pattern branch November 8, 2024 21:56
This was referenced Nov 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant