Decoupled PerChannel/PerTensor quantization #1025

Giuseppe5 · 2024-09-12T09:51:52Z

To have a decouple per channel pre-scale, and per tensor post scale:

class Int8WeightNormL2PerChannelPerTensorFixedPoint(Int8WeightNormL2PerChannelFixedPoint):
    scaling_per_output_type = ScalingPerOutputType.TENSOR

It currently works for all quantizers that inherit from WeightNormPerChannelFloatDecoupled

It can be extended to other decoupled quantization quantizers if needed

nickfraser

Single question about test organisation (specifically, fixtures), otherwise LGTM!

nickfraser · 2024-10-07T13:37:17Z

tests/brevitas/export/quant_module_fixture.py

This is a code style question, any reason why we don't use the standard pytest way of sharing fixtures across multiple files (i.e., conftest.py)?

We have a lot of duplication for various reasons and in general tests would need to be restructured a bit.
All this to say that I wouldn't be sure at his point of the cleanest way to do it myself, but if you have a suggestion, all ears.

Giuseppe5 added 2 commits September 12, 2024 10:50

Decoupled PerChannel/PerTensor quantization

2609941

fix

028c352

Giuseppe5 marked this pull request as ready for review September 13, 2024 13:10

Giuseppe5 requested a review from i-colbert September 13, 2024 13:10

Giuseppe5 added 2 commits September 13, 2024 16:10

wrong inheritance

48b0e3b

fix permute dims

377471e

Giuseppe5 requested review from i-colbert and removed request for i-colbert September 13, 2024 16:27

i-colbert and others added 4 commits September 23, 2024 07:59

Feat (tests): adding A2Q per-tensor/per-channel tests

25dd388

fix some errors

eb6e108

Correct zero_point_impl inj

7e1fd5e

precommit

6ee6547

Giuseppe5 requested review from i-colbert and removed request for i-colbert October 1, 2024 09:35

Fix

68db384

Giuseppe5 requested review from i-colbert and removed request for i-colbert October 1, 2024 10:23

i-colbert approved these changes Oct 1, 2024

View reviewed changes

Giuseppe5 added the next release PRs which should be merged for the next release label Oct 2, 2024

i-colbert added 2 commits October 2, 2024 09:19

Feat (tests): adding export tests

25a3dba

Pre-commit

0377ee7

nickfraser approved these changes Oct 7, 2024

View reviewed changes

Giuseppe5 merged commit 9048ecb into Xilinx:dev Oct 8, 2024
23 checks passed

Giuseppe5 deleted the a2q_per_tensor branch October 8, 2024 13:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decoupled PerChannel/PerTensor quantization #1025

Decoupled PerChannel/PerTensor quantization #1025

Giuseppe5 commented Sep 12, 2024 •

edited

Loading

nickfraser left a comment

nickfraser Oct 7, 2024

Giuseppe5 Oct 7, 2024

Decoupled PerChannel/PerTensor quantization #1025

Decoupled PerChannel/PerTensor quantization #1025

Conversation

Giuseppe5 commented Sep 12, 2024 • edited Loading

nickfraser left a comment

Choose a reason for hiding this comment

nickfraser Oct 7, 2024

Choose a reason for hiding this comment

Giuseppe5 Oct 7, 2024

Choose a reason for hiding this comment

Giuseppe5 commented Sep 12, 2024 •

edited

Loading