[pytorch] Add encoders for X #1219

ebezzi · 2024-07-01T20:26:05Z

Description

It would be useful to add custom encoders not only for obs, but also for the X matrix. This would allow e.g. to let the user decide whether they want a sparse or a dense without relying on a flag (currently return_sparse_X), provide a different formats for sparse matrices, etc.

The text was updated successfully, but these errors were encountered:

pablo-gar · 2024-07-22T14:25:09Z

We should have the encoder be flexible to allow on-the-fly tokenization of data. Form @AlejandroTL we heard the following:

Only thing I am missing [in the PyTorch loaders] a priori is something to handle different tokenization strategies during the dataloading. For instance, retrieving the counts, retrieving the indices of the genes given some previously stored dictionary, or binning the data on the fly, or creating a ranked list, etc.

ebezzi added the user request label Jul 1, 2024

ebezzi mentioned this issue Jul 1, 2024

[python] Support custom obs encoders #1191

Merged

pablo-gar added the pytorch label Jul 22, 2024

pablo-gar added the P0 Priority 0 - Critical, fix ASAP! label Aug 5, 2024

cathystoli added Priority backlog items tileDB work and removed Priority backlog items labels Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pytorch] Add encoders for X #1219

[pytorch] Add encoders for X #1219

ebezzi commented Jul 1, 2024

pablo-gar commented Jul 22, 2024 •

edited

Loading

[pytorch] Add encoders for X #1219

[pytorch] Add encoders for X #1219

Comments

ebezzi commented Jul 1, 2024

Description

pablo-gar commented Jul 22, 2024 • edited Loading

pablo-gar commented Jul 22, 2024 •

edited

Loading