QAT, latency stays the same #230

maiiabocharova · 2022-05-02T15:33:24Z

Trained a model according to documentation.
Added those configs lines for QAT

    cfg.QUANTIZATION.QAT.BATCH_SIZE_FACTOR = 1.0
    cfg.QUANTIZATION.BACKEND = "fbgemm"
    cfg.QUANTIZATION.QAT.FAKE_QUANT_METHOD = "default"
    cfg.QUANTIZATION.QAT.START_ITER = 1200
    cfg.QUANTIZATION.QAT.ENABLE_OBSERVER_ITER = 1200
    cfg.QUANTIZATION.QAT.ENABLE_LEARNABLE_OBSERVER_ITER = 1300
    cfg.QUANTIZATION.QAT.DISABLE_OBSERVER_ITER = 1200 + 300
    cfg.QUANTIZATION.QAT.FREEZE_BN_ITER = 12000 + 200

Trained for 1500 iterations.

The model is Ok and predictions are correct. But the model size and time of execution stayed exactly the same. Can you please help and advice on how to fix it?

The text was updated successfully, but these errors were encountered:

wat3rBro · 2022-06-03T05:13:43Z

Are you running the exported torchscript moddel? the speed up and model size reduction is for running torchscript model on device.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QAT, latency stays the same #230

QAT, latency stays the same #230

maiiabocharova commented May 2, 2022

wat3rBro commented Jun 3, 2022

QAT, latency stays the same #230

QAT, latency stays the same #230

Comments

maiiabocharova commented May 2, 2022

wat3rBro commented Jun 3, 2022