8000 Release v0.3.0: Add QAT support to more models (#29) · entn-at/transformer-deploy · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

v0.3.0

@pommedeterresautee pommedeterresautee tagged this 28 Dec 22:28
* first version of QDQ monkey patching

* add Albert, Electra and Distilbert QAT support

* add QDQDeberta V1

* fix distilbert

* add ast patch
add quant onnx export

* simplify quantization process

* fix qdq deberta

* quantization refactoring

* add documentation
add quantization tests
add deberta v2

* add quant of layernorm
refactor ast modif
add tests

* add operator name in quantizer name
update notebook

* update notebook

* update notebook
Assets 2
Loading
0