Implement bulk generation module #17

AusIV · 2016-06-20T15:38:07Z

I needed to generate several million names for a sample dataset,
and looking up names out of the file every time was very time
consuming.

I left the original logic untouched, as it's the best approach for
a quick, one-off lookup. I've added a separate module, 'names.bulk'
which offers the same interface, but caches the entire file in memory
and picks names with a binary search instead of a scan.

Should resolve: #3

I needed to generate several million names for a sample dataset, and looking up names out of the file every time was very time consuming. I left the original logic untouched, as it's the best approach for a quick, one-off lookup. I've added a separate module, 'names.bulk' which offers the same interface, but caches the entire file in memory and picks names with a binary search instead of a scan.

coveralls · 2016-06-20T15:39:46Z

Coverage decreased (-1.3%) to 98.75% when pulling b0eb4b4 on AusIV:master into c485a43 on treyhunner:master.

coveralls · 2016-06-20T15:50:07Z

Coverage remained the same at 100.0% when pulling 79934f8 on AusIV:master into c485a43 on treyhunner:master.

coveralls · 2016-06-20T15:54:30Z

Coverage remained the same at 100.0% when pulling 495e650 on AusIV:master into c485a43 on treyhunner:master.

AusIV · 2016-06-20T15:58:01Z

I'm a bit perplexed. The Travis-CI build seems to be failing on Python 3.2 on something that has nothing to do with my changes. It looks like the coverage package is failing on Python 3.2

coveralls · 2016-06-21T15:27:50Z

Coverage remained the same at 100.0% when pulling 808e1b0 on AusIV:master into c485a43 on treyhunner:master.

dblackdblack · 2017-08-01T19:52:59Z

@treyhunner this looks like a great PR

altendky · 2019-08-08T19:08:48Z

So I failed to bother to look at PRs before writing my own ([WIP]) in #23 which is 'similar' to this. Here at least a bisect is used but I think the random.choices() with weights specified might be better? I don't know how they actually implement it but maybe something to consider.

alexisszabo · 2023-09-19T16:32:42Z

@treyhunner Any chance we could approve this PR? This looks like a big performance boost on apps that call the library repeatedly. Anything I could do to help if QA needed?

Increase code coverage

79934f8

Fix integer division in python 3

495e650

Use stdlib bisect instead of custom implementation

808e1b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement bulk generation module #17

Implement bulk generation module #17

AusIV commented Jun 20, 2016

coveralls commented Jun 20, 2016 •

edited

Loading

coveralls commented Jun 20, 2016 •

edited

Loading

coveralls commented Jun 20, 2016 •

edited

Loading

AusIV commented Jun 20, 2016

coveralls commented Jun 21, 2016 •

edited

Loading

dblackdblack commented Aug 1, 2017

altendky commented Aug 8, 2019

alexisszabo commented Sep 19, 2023

Implement bulk generation module #17

Are you sure you want to change the base?

Implement bulk generation module #17

Conversation

AusIV commented Jun 20, 2016

coveralls commented Jun 20, 2016 • edited Loading

coveralls commented Jun 20, 2016 • edited Loading

coveralls commented Jun 20, 2016 • edited Loading

AusIV commented Jun 20, 2016

coveralls commented Jun 21, 2016 • edited Loading

dblackdblack commented Aug 1, 2017

altendky commented Aug 8, 2019

alexisszabo commented Sep 19, 2023

coveralls commented Jun 20, 2016 •

edited

Loading

coveralls commented Jun 20, 2016 •

edited

Loading

coveralls commented Jun 20, 2016 •

edited

Loading

coveralls commented Jun 21, 2016 •

edited

Loading