Tags · MJxiaomings/pytorch

ciflow/unstable/106438

[ROCm] fix ROCm 5.5 nightly build after hipblas change

Aug 2, 2023
9f4f404
zip
tar.gz

ciflow/trunk/105827

Update on "fix bf16 constant accuracy"

This PR aims to sort out the data type for `constant`.

The constant should be promoted to float pytorch#105440. So there are serval changes to do:
 - Data type propagation should propagate constant node to `float` dtype if original dtype is `bfloat16`
 - We do not need to insert `to_dtype` after the `constant` node, directly init an `fp32` constant is faster.
```
    vectorized<bfloat16> tmp(value);
    vectorized <float> tmp1 = cvt_bf16_fp32(tmp);
->
    vectorized<float> tmp(value);
```
 - move `constant` out of the list for `all operations can support bf16 without converting to fp32`







cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov

[ghstack-poisoned]

Aug 2, 2023
236cc3d
zip
tar.gz

ciflow/slow/106355

Update on "Move ASAN to clang12 and Ubuntu-22.04 (Jammy)"

Modify `install_conda` to remove libstdc++ from libstdcxx-ng to use one from OS
Modify `install_torchvision` to workaround weird glibc bug, where malloc interposers (such as ASAN) are causing a hang in internationalization library, see https://sourceware.org/bugzilla/show_bug.cgi?id=27653 and https://gcc.gnu.org/bugzilla/show_bug.cgi?id=90589

Extracted from pytorch#105260

[ghstack-poisoned]

Aug 2, 2023
dd3801f
zip
tar.gz

ciflow/periodic/102604

Add dependency to mkl

Aug 2, 2023
d8fd1bb
zip
tar.gz

ciflow/nightly/106438

[ROCm] fix ROCm 5.5 nightly build after hipblas change

Aug 2, 2023
9f4f404
zip
tar.gz

ciflow/inductor/106445

[dynamo] Support dict.get with no default specified

Aug 2, 2023
ef5690a
zip
tar.gz

ciflow/inductor/106443

Update on "adding mixed_dtype_mm to torch._inductor"

Summary: if torch._inductor.config.use_mixed_mm then we can convert
torch.mm(a, b.to(some_dtype)) into a triton kernel where the casting b
is fused into the matmul rather than needing to instantiate the casted b
tensor. This is necessary for weight-only quantization where we don't
want to load memory with a tensor 4x the size of our quantized one.

Test Plan: python test/inductor/test_pattern_matcher.py -k "mixed_mm"

python test/inductor/test_torchinductor.py -k "mixed_mm"

Reviewers:

Subscribers:

Tasks:

Tags:

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov

[ghstack-poisoned]

Aug 2, 2023
3170be1
zip
tar.gz

ciflow/inductor/106442

AOTInductor compile in prod env (pytorch#106442)

Summary:
Pull Request resolved: pytorch#106442

This diff updates the Inductor internal compile workflow

Reviewed By: houseroad, wushirong

Differential Revision: D47958727

fbshipit-source-id: be9660319c2400cb8c2d247fa6255cab14c4546c

Aug 2, 2023
2955af3
zip
tar.gz

ciflow/inductor/106431

Update test_misc.py

Aug 2, 2023
98465a8
zip
tar.gz

ciflow/inductor/106429

reword error message, import TEST_WITH_TORCHDYNAMO

Aug 2, 2023
abd6755
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ciflow/unstable/106438

ciflow/trunk/105827

ciflow/slow/106355

ciflow/periodic/102604

ciflow/nightly/106438

ciflow/inductor/106445

ciflow/inductor/106443

ciflow/inductor/106442

ciflow/inductor/106431

ciflow/inductor/106429

Tags: MJxiaomings/pytorch