-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retries on failed downloaded media #201
Comments
Can you verify with curl or wget that simply retrying these downloads actually helps? It may be that you need cookies to access certain resources. Also, what specific error codes are you seeing that cannot be reproduced in a browser? API requests are in fact retried up to 10 times, but |
Aside Here the curl output within the same machine, same connection:
Seems fine. I checked the file directly on web browser and it's normal. |
I have a feeling this has to do with headers (including user agent). It seems unlikely that Tumblr's servers would be resetting the connection because of e.g. too many requests. |
Try to throttle network your network while download this:
|
Additional log:
That's a safe-for-work tumblr blog. Most of error images that shown are exits (not 404), but couldn't downloaded. Missing image can be download manually. After it got downloaded, I have to Please fix this problem. |
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" - Fix some incorrectly sorted imports Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" - Fix some incorrectly sorted imports Fixes bbolli#201
Uses urllib3 and a temporary file to make downloading media smoother. Also supports various modes of timestamping, inspired by wget. Included revisions: - Fix urllib3 status code handling - Cleanup fsync logic - Specific type: ignore comments - Cleanup file_isfat and merge into util.py - note_scraper: Smarter handling of cookies and error messages - Remove enospc handler - Use os.supports_dir_fd - No more log_queue - Global, abstract no_internet - Support "canceled" WaitOnMainThread status - Fix "TypeError: 'bytes' object does not support item assignment" - Fix some incorrectly sorted imports Fixes bbolli#201
Once or twice,
tumblr_backup.py
failed to download media (images and videos, I haven't got any error on audio so far). It isn't ERROR 404, but 500, 104, etc. The media itself is exist, I checked it on web browser.I had to download them manually one-by-one, move to
media/
folder, then rename it to related post page and archive. I have dogrep
and edit the html manually. It lot's of works.Suggestion
Would it nice if there is retries option for media? Eg, if error, retry to download 3 or 5 times.
That's it. Sorry my English isn't good.
Edit:
Fixed typos and grammars.
The text was updated successfully, but these errors were encountered: