You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
importdpctl.tensorasdpta=dpt.ones((8192, 8192), dtype='i4', device='cpu')
b=dpt.ones((8192+2, 8192+2), dtype='i4', device='cpu')
%timeitb[2:, 2:]+=a#209 ms ± 36.8 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)importnumpya_np=numpy.ones((8192, 8192), dtype='i4')
b_np=numpy.ones((8192+2, 8192+2), dtype='i4')
%timeitb_np[2:, 2:]+=a_np#75.7 ms ± 1.31 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
The text was updated successfully, but these errors were encountered:
npolina4
changed the title
Performance: in-place dpctl.tensor.add with strides performance
Performance: in-place dpctl.tensor.add with strides
Jul 12, 2023
The text was updated successfully, but these errors were encountered: