[0.2.dev7] MergedChoiceTable check for duplicate column names #54

smmaurer · 2018-12-11T23:07:41Z

This PR adds a check to the MergedChoiceTable constructor to make sure there aren't any column names that overlap between the observations and alternatives tables.

It's ok for the chosen_alternatives column of the observations table to have the same name as the index of the alternatives table, though.

Discussion

This aligns ChoiceModels with the strategy for overlapping column names discussed in UrbanSim Templates issue #67.

Other changes

All the existing unit tests pass, and I wrote some new ones to test permutations of overlapping column names.

Versioning

updates the version number to 0.2.dev7

coveralls · 2018-12-11T23:14:20Z

Coverage increased (+0.3%) to 75.189% when pulling d98e956 on column-name-safety into 5915217 on master.

mxndrwgrdnr

LGTM.

Eh2406 · 2018-12-12T18:12:19Z

choicemodels/tools/mergedchoicetable.py

+        # Check for duplicate column names
+        obs_cols = list(observations.columns) + list(observations.index.names)
+        alt_cols = list(alternatives.columns) + list(alternatives.index.names)
+        dupes = [c for c in obs_cols if c in alt_cols]


Why is this using lists (O(a*o)) instead of sets (O(min(a,o)))? like:

obs_cols = set(observations.columns) + set(observations.index.names) alt_cols = set(alternatives.columns) + set(alternatives.index.names) dupes = obs_cols & in alt_cols

@Eh2406 Thanks, that's definitely better! Unfortunately i just merged this PR, but i'll update this in the next one

smmaurer added 2 commits December 11, 2018 14:57

Check for duplicate column names

3d86071

Updating version number

d98e956

smmaurer requested a review from mxndrwgrdnr December 11, 2018 23:07

mxndrwgrdnr approved these changes Dec 12, 2018

View reviewed changes

smmaurer merged commit 769800d into master Dec 12, 2018

smmaurer deleted the column-name-safety branch December 12, 2018 17:54

Eh2406 reviewed Dec 12, 2018

View reviewed changes

smmaurer mentioned this pull request Dec 12, 2018

Strategy for dealing with column names that overlap between tables UDST/urbansim_templates#67

Open

This was referenced Jan 21, 2019

[0.1.1.dev0] Allow join keys as data filters in MNL simulation UDST/urbansim_templates#85

Merged

[0.2.dev9] Small fixes and doc improvements #57

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[0.2.dev7] MergedChoiceTable check for duplicate column names #54

[0.2.dev7] MergedChoiceTable check for duplicate column names #54

smmaurer commented Dec 11, 2018

coveralls commented Dec 11, 2018

mxndrwgrdnr left a comment

Eh2406 Dec 12, 2018

smmaurer Dec 12, 2018

smmaurer Dec 12, 2018

[0.2.dev7] MergedChoiceTable check for duplicate column names #54

[0.2.dev7] MergedChoiceTable check for duplicate column names #54

Conversation

smmaurer commented Dec 11, 2018

Discussion

Other changes

Versioning

coveralls commented Dec 11, 2018

mxndrwgrdnr left a comment

Choose a reason for hiding this comment

Eh2406 Dec 12, 2018

Choose a reason for hiding this comment

smmaurer Dec 12, 2018

Choose a reason for hiding this comment

smmaurer Dec 12, 2018

Choose a reason for hiding this comment