Use buffering (`bufio`?) for reading image tarball on `export` #1339

abitrolly · 2022-04-09T17:00:47Z

In #1274 while reading the image tarball from stdin, it is first saved to file, because export algorithm uses random file access as described here #1274 (comment)

To makes the export more efficient, the algorithm could cache image bits in memory until they are no longer needed. So, for example, if manifest.json is parsed, keep the parsed structure in memory and discard cached bytes that contained it. For well-aligned images it will save both speed and memory. For badly aligned the performance will be the same as with current temp file, because temp file is still written into memory tmpfs on Linux.

bufio can potentially help https://pkg.go.dev/bufio but it looks like a lib for helping with string scanning. Not sure it can handle several GB of memory cache efficiently.

And here is a good article with code that allows to see random tar access in debug mode - https://blog.gopheracademy.com/advent-2017/seekable-http/

The text was updated successfully, but these errors were encountered:

github-actions · 2022-07-09T01:30:31Z

This issue is stale because it has been open for 90 days with no
activity. It will automatically close after 30 more days of
inactivity. Keep fresh with the 'lifecycle/frozen' label.

TarBuffered scans stream (`io.Reader`) once for filename and saves unused sections in memory for later access. This should speedup parsing a bit, because right now tarball is scanned several times, and should save resources and speed for parsing well-formed images from network. See google#1339.

abitrolly · 2022-08-14T09:40:38Z

Not sure I can reopen this, but I gave it a try without bufio in #1429.

TarBuffered scans stream (`io.Reader`) once for filename and saves unused sections in memory for later access. This should speedup parsing a bit, because right now tarball is scanned several times, and should save resources and speed for parsing well-formed images from network. See google#1339.

github-actions bot added the lifecycle/stale label Jul 9, 2022

github-actions bot closed this as completed Aug 8, 2022

abitrolly mentioned this issue Aug 14, 2022

tarball: Streaming image parser PoC #1429

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use buffering (`bufio`?) for reading image tarball on `export` #1339

Use buffering (`bufio`?) for reading image tarball on `export` #1339

abitrolly commented Apr 9, 2022 •

edited

Loading

github-actions bot commented Jul 9, 2022

abitrolly commented Aug 14, 2022

Use buffering (bufio?) for reading image tarball on export #1339

Use buffering (bufio?) for reading image tarball on export #1339

Comments

abitrolly commented Apr 9, 2022 • edited Loading

github-actions bot commented Jul 9, 2022

abitrolly commented Aug 14, 2022

Use buffering (`bufio`?) for reading image tarball on `export` #1339

Use buffering (`bufio`?) for reading image tarball on `export` #1339

abitrolly commented Apr 9, 2022 •

edited

Loading