Support for pandas `_query_iterator` #145

TomAugspurger · 2015-03-18T20:38:47Z

It would be nice if this worked:

gen = pd.read_sql_query(complex_sql, engine, chunksize=10)
odo.into('data.hdf5::/test', gen)

Right now I don't think odo is aware that gen, a _query_iterator is just an iterator of DataFrames. You get back that _query_iterator from pd.read_sql_table or pd.read_sql_query when you specify a chunksize.

I used a workaround Matthew suggested on the mailing list

def f():
    gen = pd.read_sql_query(complex_sql, engine, chunksize=10)
    for df in gen:
        yield df

odo.into('data.hdf5::/test', odo.chunks(pd.DataFrame)(f))

For testing, this doesn't even need to be a complex query, you can just use pd.read_sql_table with a chunksize.

The text was updated successfully, but these errors were encountered:

cpcloud added this to the 0.3.2 milestone Mar 21, 2015

cpcloud added the enhancement label Mar 21, 2015

cpcloud modified the milestones: 0.3.2, 0.3.3 Apr 17, 2015

cpcloud modified the milestones: 0.3.3, 0.3.4 Jul 1, 2015

cpcloud modified the milestones: 0.3.4, 0.4.0 Sep 15, 2015

cpcloud modified the milestones: 0.4.0, 0.3.5 Oct 5, 2015

cpcloud modified the milestones: 0.4.0, 0.4.1 Dec 4, 2015

kwmsmith modified the milestones: 0.4.1, 0.4.2, 0.5.0 Feb 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for pandas `_query_iterator` #145

Support for pandas `_query_iterator` #145

TomAugspurger commented Mar 18, 2015

Support for pandas _query_iterator #145

Support for pandas _query_iterator #145

Comments

TomAugspurger commented Mar 18, 2015

Support for pandas `_query_iterator` #145

Support for pandas `_query_iterator` #145