Calculate `ProgressBar` ETA from most recent updates rather than for whole period #4120

davep · 2024-02-05T11:48:46Z

There are two main changes in this PR:

The method used to calculate the ETA now looks at the most recent updates to the percentage and works off how long they took (underlying this is a class that helps estimate a time to completion based on a timed window, capped at a maximum number of samples to constrain memory usage, and with a final fallback position should the window become near-exhausted).
Added a reset_eta parameter to ProgressBar.update so that the dev can signal that the ProgressBar is being reused in some way so that the ETA calculation will always start fresh (fixes ProgressBar ETA goes adrift if a progress bar is "reused" #4096)

The PR also includes a fix to what seemed to be a bug with how ETAStatus stopped refreshing itself; in that before now it looks like it never did (there looks to have been some confusion over using its own timer, but trying to stop the updates via auto_refresh).

Fixes Textualize#4096

Rather than calculating just from when progress started, instead use a recent set of samples; this means that the progress bar's ETA should better reflect what's happening around now, rather than what's happened overall. This have almost no impact on constant time progress; but should better reflect (for example) the state of affairs when dealing with accelerating or decelerating progress bars.

I had it in my head it was a reference to the ETA widget; it isn't; it's a flag.

It looks like the ETAStatus widget has never properly stopped its update, because it was setting up a timer to do the refresh, but assigning None to auto_refresh, which it wasn't actually using.

There's enough of a different in the calculation between Windows and other operating systems that it's a problem which doesn't relate to this actual test.

This should hopefully solve the Windows vs the World discrepancy during snapshot testing.

This reverts commit f593480.

I keep wanting to use distance_covered as if it's the total distance covered; overall; period. It isn't. It's the distance covered by the window. So here I rename it so I can't ever get confused about that again.

Swapping out the previous simple deque of times for the TimeToCompletion class. Still working off the percentage; mainly because this should work just as well as any other approach (the percentage is just a modified version of the actual position and total after all), but also because it's easier to plug it in this way and test. From here on in I think I can make the tests work again because I should be able to reach into the ETAStatus widget and plug a different class into _samples that returns a constant desired value for the estimated time to completion.

This may change again, especially if I add the sliding window of time.

In anticipation of adding a time window too.

If we've hit a point where all the samples we have to hand are too old, given a time window to look at, fall back to everything we have to hand to make a best-efforts estimate. This feels better than just shrugging.

One edge case that was bugging me was how the progress bar ETC calculation would collapse if we ran out of samples in the time window (or, more correctly, whittled it down to just one sample). Rich's progress bar goes back to showing --:--:-- in this situation; which is a fair estimation in that it's a shrug because there's nothing useful to work off now. The problem here though is that people have often complained that it has "stopped" or is "broken" in some way (it isn't, it's starved). So, what I was wanting to do was to keep something going, that was ballpark correct, looked active, and would become increasingly pessimistic. The thinking here is that if someone sees a big number slowly getting bigger, they'd blame their underlying process and not the design of the ProgressBar itself. Here then is an approach to that situation where we keep a reference to the most-recently-dropped sample and, if we've only got the one sample left, we bring the older one back into consideration. This gives a nice "big number creeps up I think your download has stalled my friend" show.

darrenburns

Asked a few questions.

Also, not enough walruses.

tests/test_etc.py

darrenburns · 2024-02-12T10:49:58Z

tests/test_etc.py

+def test_no_go_past_end() -> None:
+    """It should not be possible to go past the destination value."""
+    with pytest.raises(ValueError):
+        TimeToCompletion(1).record(2)


Personally I feel clamping would be safer here compared to raising an exception. If a TimeToCompletion(100) receives a value of 100.0001, do we want things to blow up?

I could be convinced by that, although I wonder if given the lower-level nature of this code, it should be up to the caller to clamp its values?

tests/test_etc.py

willmcgugan

I think there may be too much emphasis on keeping samples.

We only need to discard old samples when we calculate the speed.

src/textual/_time_to_completion.py

willmcgugan · 2024-02-12T13:06:44Z

src/textual/_time_to_completion.py

+        """
+        if samples := self._samples:
+            # Trim off any "too old" samples.
+            oldest_time = samples[-1].moment - self._time_window_size


Should the pruning really occur from the last sample, or should it occur from the current time?

It makes sense to me that it works off the last-known sample rather than off "now", as what we're making here is a heuristic for figuring out a likely time to completion, and going off "last best" makes more sense to me than going off nothing.

willmcgugan · 2024-02-12T13:08:33Z

src/textual/_time_to_completion.py

+            Self.
+        """
+        self._samples.append(sample)
+        return self._prune()


Do we need to call prune on every sample?

Could we defer pruning to when we need the samples? i.e. if there are 1000 samples between refreshes, could we prune just once?

This could go either way; there could be 1,000 "refreshes" between samples too. It feels to me like, on balance, code that is showing an estimated time is more likely to get new estimates than it is to record new samples.

willmcgugan · 2024-02-12T13:16:08Z

src/textual/_time_to_completion.py

+            return self._elapsed
+
+    @property
+    def _speed_now(self) -> float:


I don't follow the need for a "speed" and a "speed now".

It's possible I could prune this back now, but at least for coding/testing purposes it's been a lot easier to reason about "what's the estimate for the data in the window, as of the last recording point" vs "what's the estimate as of now?"

davep · 2024-03-11T08:39:14Z

Seems we're going to go with a very totally different PR, so closing this unreviewed.

davep added 4 commits February 5, 2024 09:48

Add a reset method to the ETAStatus widget

4162071

Fixes Textualize#4096

Provide an interface for requesting an ETA reset

d655d82

Fixes Textualize#4096

Fix how we check if we're showing the ETA

652804d

I had it in my head it was a reference to the ETA widget; it isn't; it's a flag.

davep linked an issue Feb 5, 2024 that may be closed by this pull request

Revised progress bar ETA #4054

Closed

davep self-assigned this Feb 5, 2024

davep added bug Something isn't working enhancement New feature or request labels Feb 5, 2024

davep added 22 commits February 5, 2024 11:57

Actually stop the ETA refresh timer

bd1ec0d

It looks like the ETAStatus widget has never properly stopped its update, because it was setting up a timer to do the refresh, but assigning None to auto_refresh, which it wasn't actually using.

Don't show the progress ETA in the tooltip snapshop test

91e80ea

There's enough of a different in the calculation between Windows and other operating systems that it's a problem which doesn't relate to this actual test.

Adapt some progress examples to the new ETA approach

f593480

This should hopefully solve the Windows vs the World discrepancy during snapshot testing.

Revert "Adapt some progress examples to the new ETA approach"

f783284

This reverts commit f593480.

Add a class for estimating a time to completion

b030501

Have ETC speeds fall back to elapsed

5347989

Be very clear for the reader how distance remaining is calculated

d21b0b4

Rename distance_covered to distance_covered_in_window

36fb880

I keep wanting to use distance_covered as if it's the total distance covered; overall; period. It isn't. It's the distance covered by the window. So here I rename it so I can't ever get confused about that again.

Tweak the window size of the ETC calculation

175c779

Fix trying to get a sample that doesn't yet exist

f82bddf

Update the doc screenshot apps for the new ETAStatus

d3875ad

Revert to looking at the full range of time (for now anyway)

759fe7b

This may change again, especially if I add the sliding window of time.

Add some initial unit tests for TimeToCompletion

706862e

Rename window_size to sample_window_size

1388889

In anticipation of adding a time window too.

Remove the "as of now" tests; they make zero sense in testing

12a0b03

Move to a ETC calculation that also takes a window of time into account

91376a1

Add Rich repr support to TimeToCompletion

4cdb82d

Swap to using a window of time and max samples

a1d56b8

Add some missing return sections to docstrings

f266bd3

Fall back to all available samples if they're too old

7a0a1a5

If we've hit a point where all the samples we have to hand are too old, given a time window to look at, fall back to everything we have to hand to make a best-efforts estimate. This feels better than just shrugging.

Rework samples window to be time-first, samples-constrained

57d0cae

davep added 13 commits February 8, 2024 14:27

Update ProgressBar snapshot tests

a5f4a61

Make a single sample the whole window to help calculation

401db02

Clear out the last-dropped sample when clearing samples

8532e57

Add a test for the too-narrow window fallback estimation

818ade5

Fix a typo

188f0d9

Tidy up some docstrings

ba5680f

Expand the docstring a wee bit

a6717aa

Apply the no TLA rule

da1f6ab

Reduce property lookups in the recording method

40cf23a

Add a Raises section to the record docstring

e8e5a25

Merge branch 'main' into progress-progress

40012f2

Docstring tidying

8e94605

davep changed the title ~~WiP: Calculate ProgressBar ETA from most recent updates rather than for whole period~~ Calculate ProgressBar ETA from most recent updates rather than for whole period Feb 12, 2024

davep marked this pull request as ready for review February 12, 2024 09:55

davep requested review from willmcgugan, rodrigogiraoserrao and darrenburns February 12, 2024 09:55

darrenburns reviewed Feb 12, 2024

View reviewed changes

davep added 3 commits February 12, 2024 12:01

Apply the no-TLA rule to the tests for ETC

6284fd0

Make it explicit that the backward value test is about the value

4db035c

Don't allow time to go backwards when recording a sample

856d8c0

willmcgugan requested changes Feb 12, 2024

View reviewed changes

davep added 2 commits February 12, 2024 13:36

Swap the Sample class to being a NamedTuple

ceda0f9

Merge branch 'main' into progress-progress

5eedaf4

davep closed this Mar 11, 2024

davep deleted the progress-progress branch March 11, 2024 08:39

davep removed a link to an issue Mar 11, 2024

Revised progress bar ETA #4054

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculate `ProgressBar` ETA from most recent updates rather than for whole period #4120

Calculate `ProgressBar` ETA from most recent updates rather than for whole period #4120

davep commented Feb 5, 2024 •

edited

Loading

darrenburns left a comment

darrenburns Feb 12, 2024

davep Feb 12, 2024

willmcgugan left a comment

willmcgugan Feb 12, 2024

davep Feb 12, 2024

willmcgugan Feb 12, 2024

davep Feb 12, 2024

willmcgugan Feb 12, 2024

davep Feb 12, 2024

davep commented Mar 11, 2024

Calculate ProgressBar ETA from most recent updates rather than for whole period #4120

Calculate ProgressBar ETA from most recent updates rather than for whole period #4120

Conversation

davep commented Feb 5, 2024 • edited Loading

darrenburns left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

willmcgugan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davep commented Mar 11, 2024

Calculate `ProgressBar` ETA from most recent updates rather than for whole period #4120

Calculate `ProgressBar` ETA from most recent updates rather than for whole period #4120

davep commented Feb 5, 2024 •

edited

Loading