[pull] main from gpuweb:main #11

pull · 2023-08-22T13:40:21Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

This PR add execution tests for bitcast built-in from and to f16 types. Issue: #1248, #1609

It's not guaranteed that the -0.0 value will survive once it is stored in a Scalar object Issue: #2901

* Prevent details toggle if user is selecting text * select all on click

This refactors the existing code to have a clearer separation from the non-AF test running code, and sets up for implementing vector support. Issue #1626

This reverts commit 75c5460.

* Remove orientation from several texture copy tests The orientation argument for these tests was copied from tests which use an ImageBitmap source. Because CreateImageBitmap has an imageOrientation option it made sense to test against all variants of it. However, other tests, such as those that consume ImageData directly, don't have any native mechanism for Y-flipping the source. The data could be generated Y-flipped, but that doesn't increase coverage of any implementation features. Despite not flipping the source orientation, however, the test was still expected the data to be flipped. By removing this argument and no longer testing for flipped source data, these tests all begin passing with no loss of platform coverage. Bug: dawn:2017 * Lint fix

This is run by `npm test`, and should catch cache file collision issues which have caused reverts in the past.

wgsl: Add execution tests for AF negation (#2909) This refactors the existing code to have a clearer separation from the non-AF test running code, and sets up for implementing vector support. Issue #1626 This PR fixes the issues with cache collisions that caused the initial revert

This may be a temporary change if copying compressed textures get removed from core and moved to a feature.

This PR add execution tests for f16 built-in clamp. Issue: #1248

This PR add execution tests for f16 built-in sign and step. Issue: #1248, #2583, #2529

…2922) Matrix addition will be covered in future PRs. Issue #1626 Co-authored-by: jzm-intel <[email protected]>

This PR make frexp function in util/math.ts handle f16 and f64 as well as f32, and add f16 execution test for built-in frexp. Issue: #1248, #2587

…float32-filterable formats. (#2915) * Adds magFilter op tests * Adds inital minFilter tests * Adds remaining sampling tests * Fix formatting issue * Redo min/magFilter tests for mirror, and adds more doc.

And fix state tracking so that subcases are actually marked skipped in the logger

Issue #1626

Add missing entry to `listing_meta.json`

@Builtin

@Builtin(vertex_index) and @buildin(instance_index) each take an attribute in compat mode

Issue #1626

Fixes #2938

Issue #1626 Co-authored-by: jzm-intel <[email protected]>

Issue #1626

Fixes #3076

Fixes #2582

Issue #1297

* depthCompare is not required for depth attachments if not used * Refactor for success * Refactor success definition

These were landed on the wrong branch, and are causing tests to fail to build, because some of the code that they depend on is implemented in a PR that hasn't landed yet.

Rewrites how test cases are generated for atan2, so that if running in const-eval unbounded results will not be generated, since those will cause compilation errors. Fixes #3088

This is only defined for f32, so doesn't really need to be defined in the common super class. This allows for removing the various stub references to it, that will never be implemented.

This was accidentally removed in 30c129e

* Slim down on typed array allocations in conversion.ts * Add comments explaining requirements of aliased working data

…n. (#3096)

…odified (#3097) This should hopefully categorically prevent bugs like the one fixed in #3096

This removes a the need to create bunch of temporary JSON objects, reducing the amount of garbage collection we need to do. This change also changes the DataCache to be unbounded to a 4-element LRU cache, capping the amount of memory used.

@p

Unlike doxygen, TSDoc doesn't support @p to link to a parameter. Use code backticks instead.

* Use DataView instead of a bunch of separate typed arrays. * Avoid small allocations where it's trivial to do so. Speeds up deserialization around ~10% based on profiling in Chrome.

Reduces thh number of cases by using sparse instead of full ranges, since there is going to be a cartesian product of input values when generating cases. Optimizes two quantization functions that had not been updated to re-used their TypedArray. Creation and then immediate destruction of TypedArrays are a type of hotspot we have encountered in other areas of the code base.

…ents tests (#3104) This patch uses the largest value of maxInterStageShaderVariables supported on current adapter in the tests about maxInterStageShaderComponents when creating devices so that when the value we use as maxInterStageShaderComponents is larger than the default one, we won't be limited by the default value of maxInterStageShaderVariables. This patch also removes the assertion that the value of maxInterStageShaderVariables must be larger than a quarter of maxInterStageShaderComponents as on many backends the largest value of maxInterStageShaderComponents is equal to 4x maxInterStageShaderVaraibles, so in "overLimit" tests the value of maxInterStageShaderComponents can be greater than 4x device.limits.maxInterStageShaderVaraibles.

To match all the other data types

* dev_server: serve on localhost only by default * Limit characters in route for /out/*/listing.js

Instead of passing the input through a F16Array, use the library provided function hfround. hfround is a fast look up table based rounding function for f16. Benchmarking locally this provides a ~20% improvement to fma interval calculations, which are particularly sensitive to quantization cost. Overall I was seeing more on the order of ~10% improvement.

Instead of passing the input through a F32Array, use the builtin Math.fround. This leads to a ~5% improvement benchmarking locally. This is less than the equivalent f16 change, because F32Array is provided by the runtime, whereas F16Array is being polyfilled, so is probably more efficient to begin with.

buffer() was offseting the array instead of truncating the returned array.

Another small bump (~5%) to be gained through using a builtin instead of trampolining through a TypedArray.

wgsl: f16 built-in execution test for bitcast (#2897)

3acbf58

This PR add execution tests for bitcast built-in from and to f16 types. Issue: #1248, #1609

pull bot added the ⤵️ pull label Aug 22, 2023

dneto0 and others added 28 commits August 22, 2023 16:37

Remove TODO about f32(-0.0) printing as '-0.0f' (#2903)

4fc7483

It's not guaranteed that the -0.0 value will survive once it is stored in a Scalar object Issue: #2901

Prevent details toggle if user is selecting text (#2906)

2ee990a

* Prevent details toggle if user is selecting text * select all on click

Fix handling of batches in case filtering (#2908)

5dfa3b8

wgsl: Add execution tests for AF negation (#2909)

75c5460

This refactors the existing code to have a clearer separation from the non-AF test running code, and sets up for implementing vector support. Issue #1626

Revert "wgsl: Add execution tests for AF negation (#2909)" (#2912)

a0dcafc

This reverts commit 75c5460.

Don't color tests which have not been run as skipped (#2910)

7f4eced

Add generate-cache step to grunt pre

4c8d2f6

This is run by `npm test`, and should catch cache file collision issues which have caused reverts in the past.

Update fp_primer.md

d2b2bad

Compat: Skip if copyTextureToTexture not supported (#2923)

4f3574d

This may be a temporary change if copying compressed textures get removed from core and moved to a feature.

wgsl: f16 built-in execution test for clamp (#2918)

8ca48c8

This PR add execution tests for f16 built-in clamp. Issue: #1248

wgsl: f16 built-in execution test for sign and step (#2911)

ef82e7b

This PR add execution tests for f16 built-in sign and step. Issue: #1248, #2583, #2529

wgsl: Implement scalar/vector AbstractFloat addition execution tests (#…

23cdd20

…2922) Matrix addition will be covered in future PRs. Issue #1626 Co-authored-by: jzm-intel <[email protected]>

wgsl: f16 built-in execution test for frexp (#2925)

7f1e5af

This PR make frexp function in util/math.ts handle f16 and f64 as well as f32, and add f16 execution test for built-in frexp. Issue: #1248, #2587

Implements filtering tests for mag/min/mipmapFilters with additional …

0126724

…float32-filterable formats. (#2915) * Adds magFilter op tests * Adds inital minFilter tests * Adds remaining sampling tests * Fix formatting issue * Redo min/magFilter tests for mirror, and adds more doc.

Allow unused variables starting with underscore

a8f254a

websocket-logger tool

da1527d

Check that if the test is skipped, all subcases are skipped

f42aeb4

And fix state tracking so that subcases are actually marked skipped in the logger

Tools for generating timing metadata and auto-chunking WPT

691e6b4

Add generated metadata for webgpu:*

76e56df

wgsl: Add f16 negation execution tests (#2927)

05e32a6

Issue #1626

Fix presubmits (npm test)

90edae1

Add missing entry to `listing_meta.json`

Compat: Test vertex_index, instance_index limits (#2940)

f0044b9

@Builtin(vertex_index) and @buildin(instance_index) each take an attribute in compat mode

wgsl: Add AbstractFloat matrix addition execution tests (#2926)

0b49ea7

Issue #1626

Add documentation for adding timing metadata (#2942)

7536133

Fixes #2938

wgsl: Add non-matrix AbstractFloat subtraction execution tests (#2928)

18468be

Issue #1626 Co-authored-by: jzm-intel <[email protected]>

wgsl: Add AbstractFloat matrix subtraction execution tests (#2929)

fd0cf88

Issue #1626

zoddicus and others added 29 commits October 23, 2023 14:32

wgsl: Add AF select execution tests (#3077)

3148e15

Fixes #3076

wgsl: Add AbstractFloat sign execution tests (#3081)

2499ea9

Fixes #2582

Fix a bad slice operation in image_copy stencil tests

b3c2508

Run grunt fix

3fe36f2

wgsl: Add AbstractFloat floor execution tests (#3084)

d491499

Issue #1297

depthCompare is not required for depth attachments if not used (#3069)

73bcf42

* depthCompare is not required for depth attachments if not used * Refactor for success * Refactor success definition

Fix a minor issue introduced in previous stencil test fix (#3086)

7991cc7

Remove pipeline statistics query feature (#3085)

8e7a995

wgsl: Revert changes to round execution tests (#3090)

b929ebb

These were landed on the wrong branch, and are causing tests to fail to build, because some of the code that they depend on is implemented in a PR that hasn't landed yet.

wgsl: Filter atan2 tests based on if const-eval or not (#3089)

2405593

Rewrites how test cases are generated for atan2, so that if running in const-eval unbounded results will not be generated, since those will cause compilation errors. Fixes #3088

wgsl: Cleanup cruft related to quantizeToF16 (#3082)

199c8f1

This is only defined for f32, so doesn't really need to be defined in the common super class. This allows for removing the various stub references to it, that will never be implemented.

Add back generate-cache grunt command (#3091)

1281ee1

This was accidentally removed in 30c129e

Slim down on typed array allocations in conversion.ts (#3092)

42e6b6d

* Slim down on typed array allocations in conversion.ts * Add comments explaining requirements of aliased working data

Remove duplicate definitions of reinterpret* (#3095)

2be0e90

Fixes flaky test because parameters were changed when the test was ra…

e5f120e

…n. (#3096)

Make test params readonly so they can't be accidentally permanently m…

f3196f8

…odified (#3097) This should hopefully categorically prevent bugs like the one fixed in #3096

tsdoc: Remove @p with backticks

250e583

Unlike doxygen, TSDoc doesn't support @p to link to a parameter. Use code backticks instead.

More cache deserialization micro-optimisations

ba9e5d6

* Use DataView instead of a bunch of separate typed arrays. * Avoid small allocations where it's trivial to do so. Speeds up deserialization around ~10% based on profiling in Chrome.

BinaryStream: Use little endian for f64

2ef3f32

To match all the other data types

Add missing tests for adapter capability guarantees (#3107)

3dbe4ce

Test that DOMExceptions from WebGPU always have stacks (#3105)

ef5d229

dev_server: Serve on localhost by default (#3115)

ccee5a9

* dev_server: serve on localhost only by default * Limit characters in route for /out/*/listing.js

Fix cache files being padded with trailing 0's

2f3b68c

buffer() was offseting the array instead of truncating the returned array.

wgsl: Convert quantizeToI32/U32 to used Math.trunc (#3120)

ab09ed4

Another small bump (~5%) to be gained through using a builtin instead of trampolining through a TypedArray.

ErichDonGubler merged commit ab09ed4 into mozilla:main Nov 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from gpuweb:main #11

[pull] main from gpuweb:main #11

pull bot commented Aug 22, 2023 •

edited

Loading

[pull] main from gpuweb:main #11

[pull] main from gpuweb:main #11

Conversation

pull bot commented Aug 22, 2023 • edited Loading

pull bot commented Aug 22, 2023 •

edited

Loading