Should non-root elements be clipped when capturing them? #73

vmpstr · 2021-11-16T02:52:45Z

Related to #72 , for non-root elements: should we be able to clip them since they can be massive as well?

khushalsagar · 2021-11-25T00:35:10Z

This could be a good performance optimization but also something we can punt to v2.

vmpstr · 2021-11-25T01:36:42Z

Presumably we have to clip to max texture size for old content, right? I think figuring out how and when to clip is within scope for v1

khushalsagar · 2021-11-25T02:01:40Z

The fact that we have to clip to max texture size is an implementation short-coming that could be fixed. We could cache the content in a tiled image instead of a single texture, right? That's why I was seeing this as a perf optimization.

vmpstr · 2021-11-25T02:43:57Z

I think elements exceeding max texture size may be pretty frequent on the web (don't quote me on that). So, is the suggestion to implement a tiled image solution then?

Basically, my concern is that v1 needs to deal with elements that are larger than max texture size, but whether it is enforced clipping, or tiled textures, I don't have a strong opinion

khushalsagar · 2021-11-25T16:40:13Z

It's not going to be feasible to do tiled image in v1 for sure. So the fallback will have to be that we clip to max texture size. If that implies that we're not spec compliant (since we are implicitly clipping the image) then yes, we have to deal with this for v1.

My proposal would be to clip at max texture size in v1. And add a performance hint that allows clipping further later or think about tiled image if we see legit use-cases which need content larger than max texture size.

What's your preferred option?

vmpstr · 2021-11-25T17:53:32Z

My preferred option is that we clip non-root elements to the viewport in the same way we clip root elements to the viewport. We should also have an ability to add a padding to this clip, for both root and non-root content, in v1.

I don't see a need to have different behaviors for root and non-root elements here. I think the only reason we have them is that we know that roots will be large and we will have to clip them to max texture size. All I'm proposing is that non-roots can be equally large almost as often.

I think that would make for a good API that also naturally precludes elements flying from/to "really far away" which may be outside of the painted region.

If the stepping stone to get there is always clip root, never clip non-root, then that's fine. The implementation for clipping non-root elements has to exist in v1 anyway, since clipping to max texture size needs to be done at some offset from element's origin, so I propose we clip centered around the viewport.

khushalsagar · 2021-11-25T18:11:52Z

Ok. So what you're proposing is that an element's snapshot bounds is its visible rect in the viewport. I'm assuming that includes the ink overflow from self effects and will also include padding from overflow-clip-margin. What happens if there is a transform (like rotation or skew on it) so its visible rect is not axis aligned with the viewport?

khushalsagar · 2021-11-25T18:30:57Z

Btw, this is also related to the sizing of boxes where this content goes (using a replaced element). But let's figure out the desired paint bounds for a shared element first. I filed #99 to discuss box sizing after this.

vmpstr · 2021-11-25T18:39:31Z

It's a good question about what to do if there's a transform. What do we do if there's a transform on body like this
https://www.google.com/search?q=askew or is body here a content of the root?

In the case of transform, the general idea is that we want to capture the content clipped in such a way that it has the minimal clip which still paints all of the viewport visible contents. I don't know how to describe this operation, but I think given the transform it should be possible to find such a rect.

As for what it includes, it includes the same things it would've included otherwise. It's just clipped afterwards.

Is the visible rect in Chromium specified relative to the layer itself or to its transform parent?

khushalsagar · 2021-11-25T19:02:52Z

body is a content of the root indeed. We don't have this issue for the root snapshot if it's going to be the output of the root stacking context since there can't be any transform on it.

I agree with the motivation for the general idea (to spend less memory on snapshots) but trying to think through how to implement/define this makes it sound hard. The problem is that an element's visible quad (let's call it quad because of the non axis aligned transforms) is dependent on multiple properties that come from its hierarchy. There are transforms and clips coming from the ancestor chain (gonna exclude effects like clip-path since that's even more complicated).

So the inputs we need for sure are the element's paint rect (which is expanded from the overflow outside its bounding box) and the viewport rect. And the output we want is a clip rect in the paint rect space (or content space in chromium terminology)? What other inputs do we need to define this?

"Is the visible rect in Chromium specified relative to the layer itself or to its transform parent?" : Which visible rect? Can you link to the code?

khushalsagar · 2021-11-25T19:30:13Z

Actually, there maybe a way to lean on Element.getBoundingClientRect() to define this. Though I don't think that rect includes overflow.

vmpstr · 2021-11-25T19:51:18Z

For the Chromium part, I think that we paint +/- some amount from the viewport for each of the composited layers. So we have an ability to detect the viewport rects in layers post transform (otherwise we'd have to paint the whole thing under any non-translate transform, right?).

getBoundingClientRect is definitely a part of this solution, we should know the visible quads that comprise the element, so the union of those into an axis aligned rect should give us the answer here?

I don't mean to get hung up on details, since that's always hard to do in an issue like this one. But what's the alternative? We just let the max texture size clip happen somewhere? That is obviously worse to me, so we need to come up with some sort of an answer here.

khushalsagar · 2021-11-25T20:22:47Z

The reason I'm prodding on the details here is that while doing a clip in viewport space would be ideal, it's harder to define. I'm happy to consider it if we can sort out the details and maybe they are not as complicated as I think. Would appreciate your help in narrowing them down.

The alternate proposal is to size the image based on element() with the exception that the size is expanded to include ink overflow from self effects. Then this image is clipped to max texture size. The case this could miss is if you have a massive element and the part of it in the visible viewport is what gets clipped when we clamp to max texture size.

vmpstr · 2021-11-25T20:50:45Z

I'd start by looking at the interest rect code in painting and see if there are any useful insights. I'm pretty sure we have the ability to take a rect in element's local space and map it to an axis aligned rect in viewport space. So the inverse, taking a viewport rect and mapping it down to element space should also be possible

(by we, I mean Chromium here)

khushalsagar · 2021-11-25T21:31:38Z

Ok. So let's summarize the 2 options with as much detail as we can :

Option 1
The image is sized based on element() with its decorated bounding box expanded to include ink overflow from effects and overflow-clip-margin.

Option 2
The decorated bounding box mentioned above is mapped to an axis aligned rect in viewport space. This can be the smallest rectangle that completely includes this element's visible quad. Let's call this its painted rect. Then the painted rect is clipped by the viewport and the intersection is mapped back to the decorated bounding box space. This clipped rect in decorated bounding box space is what's cached. Is that a fair summary?

vmpstr · 2021-11-25T21:37:33Z

For the second part of the second option, you assumed that a rect in viewport space stays a rect in element space but other than that, something similar to that yeah.

I don't know if it's easier to just map the viewport bounds rect into element space and do clipping there. It depends on what space the developer will provide the padding in

khushalsagar · 2021-11-25T22:13:06Z

"you assumed that a rect in viewport space stays a rect in element space" : You mean when the painted rect clipped by viewport is being mapped back to element space (or decorated bounding box space)? We'll have to make this a rect somehow...

That was a good question : "depends on what space the developer will provide the padding in". I'm not sure what's the right answer. Seems like viewport space is better, if the idea is for the developer to say : "this is how much I want to move this element when it animates". But when we do preserve the hierarchy, it probably makes sense to do all of this in the ancestor shared element's space instead of viewport space?

I also realized that this will affect both the sizing and transform on the transition elements which render the snapshot. Right now we're saying that the transform maps the element's border-box to viewport (or ancestor) space. But now it will have to be the transform that maps this clipped captured rect to the corresponding quad...?

I also want to summarize the downsides of option 1 for why this can't be punted to v2 :

We'll use unnecessary memory for massive elements.
If a massive element's visible rect is outside the area clipped by max texture size then it won't be rendered correctly.

Is there anything else? I'm asking so that if we have to make a call to punt it we understand what use-cases will be broken.

hvanops · 2021-11-26T18:38:57Z

@khushalsagar (sorry to break this discussion up with some meta but...) When you share this with folks on Monday and we discuss this on Tuesday, could you make sure to describe both why it should be in v1 and also the options? I think those are both crucial pieces to this convo that others will need for context.

ianvollick · 2021-11-26T21:39:42Z

Hopefully layout/paint experts can correct me if this is wrong, but I think that in this case we can do the following:

start with the viewport rect in viewport space (possibly inflated)
get the transformation matrix to go from viewport to element space
project the inflated rect into an element space (may indeed be a quad and not a rect)
take the axis-aligned bounding box of this quad
intersect this with whatever we have in element space

In blink, at least, I think TransformationMatrix::MapRect handles some of this.

(edit: grammar and s/rect/quad/)

jakearchibald · 2021-11-29T14:06:56Z

@vmpstr

My preferred option is that we clip non-root elements to the viewport in the same way we clip root elements to the viewport. We should also have an ability to add a padding to this clip, for both root and non-root content, in v1.

Hm, I think that'd create weird cases with headers that are slightly scrolled out of view. It creates a bit of a footgun as the developer might not test for that.

Would it be sufficient to define two clipping modes:

max-clipped - The element is captured from 0,0 but clamped to some maximum. Could we define this as something like 120% of viewport width by 120% of viewport height? This is the default for non-root elements.
viewport-clipped - The area the element intersects with the viewport is captured. This is the default for root.

I don't see a need to have different behaviors for root and non-root elements here. I think the only reason we have them is that we know that roots will be large and we will have to clip them to max texture size. All I'm proposing is that non-roots can be equally large almost as often.

Agreed. There's crossover with #85 here. Maybe we're overcomplicating things by treating the root as special.

vmpstr · 2021-11-29T16:06:52Z

I like the two modes approach, it keeps the API simple and allows the developer to not use too much memory if they are careful even on large (non-root) elements.

A fine-grained control of precise padding amount can then be a possible v2 feature.

ianvollick · 2021-11-29T17:05:56Z

Stepping back, this seems analogous to checkerboarding and feels like a UA implementation detail.

That said, I do agree that it makes sense to capture reasonably-sized elements even if they're outside the viewport. For items that are smaller than the maximum size, clearly you can capture the whole element. For items larger than the texture size, taking the-part-closest-to-the-viewport seems like a reasonable heuristic. This is more complicated than what I've suggested above (and I'm not sure it's important to dig into the details here), but seems possible and could work identically for both root and non-root elements.

khushalsagar · 2021-11-29T17:35:10Z

For the 2 modes approach, the max-clipped sounds similar to clamping to max texture size except we're using viewport bounds instead of max texture size. Sounds reasonable but it wouldn't address the concern where the visible portion of the element is outside this rect though. And I thought that was the functional issue with clamping to max texture size (other than the memory concerns).

The viewport-clipped mode helps ensuring we always capture what's visible and allow padding (maybe a UA default but exposing fine-grained control if that becomes useful).

I'm trying to understand how a developer would decide which mode to use. If the element is small enough use max-clipped. If it's big and the visible portion won't get captured with max-clipped then use viewport-clipped?

khushalsagar · 2021-11-29T17:38:30Z

@ianvollick are you suggesting we still have the 2 modes but instead of asking developers to choose, the UA should do it internally? If the whole element fits within maximum size (based on viewport bounds or max texture size) then capture the whole element. If not use viewport-clipped.

hvanops · 2021-11-29T18:15:03Z

A fine-grained control of precise padding amount can then be a possible v2 feature.

If we have these two modes in v1, what is the plan to achieve just one mode in v2? If devs are choosing the mode in v1, will they have to undo anything for v2? Just looking to reduce the amount of work required overall

ianvollick · 2021-11-29T18:17:08Z

I just spoke with Khushal and one point I should clarify is that the approach I'm suggesting seems like it will naturally produce the behavior of the two modes Jake described, but with one difference: if we the element is very large and we use the close-to-the-viewport heuristic, the rectangle we capture won't necessarily be positioned at the origin in element space -- rather, it will be the max-sized rectangle closest to the viewport (determining this rectangle isn't as simple as i described above, but I don't think the details are important here).

I.e., we won't need two modes with this approach.

khushalsagar · 2021-11-29T18:24:44Z

+1 to Ian's point. I'll summarize the idea to ensure we can all be on the same page :

The element's captured painted content has to be capped to some bounds. That can be based on viewport bounds or max texture size. I hope this can be a UA defined heuristic instead of explicitly spelling out which rect.
The rectangle within the element's space which is captured will use the close-to-the-viewport heuristic instead of being the element's origin. This ensures that the element's visible portion is captured (the point I had missed above).

@vmpstr @jakearchibald @hvanops does this sound reasonable to you?

khushalsagar · 2021-11-29T18:35:11Z

Not to miss the point about unnecessary memory use earlier.

"The element's captured painted content has to be capped to some bounds" : It makes sense for this heuristic to start with something that captures more than the developer might have intended (if we use max texture size or a padded viewport rect). And we can add the option for the developer to hint to capture less as a memory optimization later.

jakearchibald · 2021-11-30T11:27:42Z

Yeah, I think a heuristic approach is fine. We can always change from auto later.

A couple of things that aren't clear to me:

If an element is 300x300, but partially out of the viewport, can it be captured in its entirety?

If an element is 20x20, but 20,000 pixels north of the viewport, can it be captured?

I'm hoping we can have a ruleset like this:

If the whole element can be reasonably captured in its entirety, return its capture.
Return a partial capture that includes the in-viewport area, but expanded by some amount.

We need to think about how this will impact the size of the captured elements. I guess this gets less complicated using @flackr's model of detaching the texture size from the replaced element #99, although that comes with complications of its own.

khushalsagar · 2021-11-30T17:23:49Z

"If an element is 300x300, but partially out of the viewport, can it be captured in its entirety?" : Absolutely.

"If an element is 20x20, but 20,000 pixels north of the viewport, can it be captured?" : Yes to this too. We already have heuristics which check for how close an element is from the viewport to decide whether to paint/raster it. This case likely gets optimized out today but we can force it to paint/raster if it's a shared element.

For the ruleset you mentioned, I think that's what is being proposed :

If the whole element fits within our cap size (I'm leaning on max texture size) then it's captured in entirety. This is irrespective of where the element is with respect to the viewport.
If the element doesn't fit, then it's partially captured. And we use a heuristic to capture the rect which is closest to the viewport. So if the element is big and it's not in the viewport, you still get a partial capture.

This should make sure all functional use-cases are addressed. There is still room for perf optimizations where a developer can hint that only in-viewport area should be captured. But I'm hoping we can add that later.

You're absolutely right that we'll need to think through how this gets reflected in the box sizes. I actually think it gets complicated if we use shared element box size instead of texture size for pseudo elements. But we can discuss the details for it once the painted content size is finalized.

jakearchibald · 2021-12-01T10:08:43Z

Yep, we're on the same page here. Sounds good!

khushalsagar · 2021-12-02T16:52:21Z

The conclusion for this was to always capture the full element content and allow the UA to limit it to a capped size (max texture size) if needed. The region captured if the content needs to be clipped to a capped size is the area closest to the viewport.

vmpstr added the open question label Nov 16, 2021

jakearchibald mentioned this issue Nov 29, 2021

Should we be able to expand root capture area #72

Closed

khushalsagar closed this as completed Dec 2, 2021

khushalsagar mentioned this issue Dec 8, 2021

Proposal: shared element transitions w3c/csswg-drafts#6464

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should non-root elements be clipped when capturing them? #73

Should non-root elements be clipped when capturing them? #73

vmpstr commented Nov 16, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021 •

edited

Loading

khushalsagar commented Nov 25, 2021 •

edited

Loading

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021 •

edited

Loading

hvanops commented Nov 26, 2021

ianvollick commented Nov 26, 2021 •

edited

Loading

jakearchibald commented Nov 29, 2021

vmpstr commented Nov 29, 2021

ianvollick commented Nov 29, 2021

khushalsagar commented Nov 29, 2021

khushalsagar commented Nov 29, 2021

hvanops commented Nov 29, 2021 •

edited

Loading

ianvollick commented Nov 29, 2021

khushalsagar commented Nov 29, 2021 •

edited

Loading

khushalsagar commented Nov 29, 2021

jakearchibald commented Nov 30, 2021

khushalsagar commented Nov 30, 2021

jakearchibald commented Dec 1, 2021

khushalsagar commented Dec 2, 2021

Should non-root elements be clipped when capturing them? #73

Should non-root elements be clipped when capturing them? #73

Comments

vmpstr commented Nov 16, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021

vmpstr commented Nov 25, 2021 • edited Loading

khushalsagar commented Nov 25, 2021 • edited Loading

vmpstr commented Nov 25, 2021

khushalsagar commented Nov 25, 2021 • edited Loading

hvanops commented Nov 26, 2021

ianvollick commented Nov 26, 2021 • edited Loading

jakearchibald commented Nov 29, 2021

vmpstr commented Nov 29, 2021

ianvollick commented Nov 29, 2021

khushalsagar commented Nov 29, 2021

khushalsagar commented Nov 29, 2021

hvanops commented Nov 29, 2021 • edited Loading

ianvollick commented Nov 29, 2021

khushalsagar commented Nov 29, 2021 • edited Loading

khushalsagar commented Nov 29, 2021

jakearchibald commented Nov 30, 2021

khushalsagar commented Nov 30, 2021

jakearchibald commented Dec 1, 2021

khushalsagar commented Dec 2, 2021

vmpstr commented Nov 25, 2021 •

edited

Loading

khushalsagar commented Nov 25, 2021 •

edited

Loading

khushalsagar commented Nov 25, 2021 •

edited

Loading

ianvollick commented Nov 26, 2021 •

edited

Loading

hvanops commented Nov 29, 2021 •

edited

Loading

khushalsagar commented Nov 29, 2021 •

edited

Loading