Fix conversion of TensorData, TensorsData to json #22166

xadupre · 2024-09-20T16:35:23Z

Description

Fix write_calibration_table to support TensorData, TensorsData

chilo-ms · 2024-10-03T17:12:18Z

Thanks for making TensorsData and TensorData serializable.

write_calibration_table writes out three calibration files in different format: json, txt, and flatbuffers.
TRT EP consumes the flatbuffers file which it expects each tensor data only contains name and max(abs(values[0]), abs(values[1])), it should be something like this:

resnetv17_stage4_conv9_fwd 0.381193
resnetv17_stage2_relu4_fwd 2.53578352
....

However, it currently serializes much more tensor data's information:

flatten_473 {'lowest': array([0.], dtype=float32), 'highest': array([10.190684], dtype=float32), 'CLS': 'TensorData'}
resnetv17_batchnorm0_fwd {'lowest': array([-5.5807953], dtype=float32), 'highest': array([5.954951], dtype=float32), 'CLS': 'TensorData'}
....

Could you help add additional functionality to simply extract the tensor name and an absolute value that required by TRT EP?

…qdq_json

xadupre · 2024-10-04T13:35:11Z

I just made a change to restore the previous format. Let me know if that's ok with you.

chilo-ms · 2024-10-04T17:06:34Z

Thanks!
TRT EP actually is reading the calibration.flatbuffers not the pure txt file, so could you please help add the code to block as below?

700     zero = np.array(0)
701     for key in sorted(calibration_cache.keys()):
702         values = calibration_cache[key]
703         d_values = values.to_dict()
704         floats = [
705             float(d_values.get("highest", zero).item()),
706             float(d_values.get("lowest", zero).item()),
707         ]
708         value = str(max(floats))  # str(max(abs(values[0]), abs(values[1])))
709
710         flat_key = builder.CreateString(key)
711         flat_value = builder.CreateString(value)

xadupre · 2024-10-04T17:20:54Z

builder.CreateString(key)

I don't know block very well. Do you know the file I should modify?

chilo-ms · 2024-10-04T17:57:19Z

builder.CreateString(key)
I don't know block very well. Do you know the file I should modify?

It's here https://github.com/xadupre/onnxruntime/blob/qdq_json/onnxruntime/python/tools/quantization/quant_utils.py#L698

xadupre · 2024-10-04T18:04:49Z

builder.CreateString(key)
I don't know block very well. Do you know the file I should modify?
It's here https://github.com/xadupre/onnxruntime/blob/qdq_json/onnxruntime/python/tools/quantization/quant_utils.py#L698

Sorry, I did not see, it was just above. I just pushed the changes.

onnxruntime/python/tools/quantization/quant_utils.py

### Description Fix write_calibration_table to support TensorData, TensorsData

xadupre added 4 commits September 20, 2024 18:32

Fix convertion of TensorData to json

48130bd

simplifies

9ac0911

add unit test

c6c9c33

lint

d98a7a5

xadupre added 2 commits October 4, 2024 13:08

Merge branch 'main' of https://github.com/microsoft/onnxruntime into …

f01cf5e

…qdq_json

restore original format expected by TRT

25e2636

fix flatbuffers

152f10f

chilo-ms reviewed Oct 4, 2024

View reviewed changes

onnxruntime/python/tools/quantization/quant_utils.py Outdated Show resolved Hide resolved

fix wrong format

6863b6b

chilo-ms approved these changes Oct 4, 2024

View reviewed changes

jywu-msft merged commit 407c1ab into microsoft:main Oct 7, 2024
86 checks passed

xadupre deleted the qdq_json branch November 7, 2024 10:35

ishwar-raut1 pushed a commit to ishwar-raut1/onnxruntime that referenced this pull request Nov 19, 2024

Fix conversion of TensorData, TensorsData to json (microsoft#22166)

031a0b5

### Description Fix write_calibration_table to support TensorData, TensorsData

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix conversion of TensorData, TensorsData to json #22166

Fix conversion of TensorData, TensorsData to json #22166

xadupre commented Sep 20, 2024

chilo-ms commented Oct 3, 2024

xadupre commented Oct 4, 2024

chilo-ms commented Oct 4, 2024

xadupre commented Oct 4, 2024

chilo-ms commented Oct 4, 2024 •

edited

Loading

xadupre commented Oct 4, 2024

Fix conversion of TensorData, TensorsData to json #22166

Fix conversion of TensorData, TensorsData to json #22166

Conversation

xadupre commented Sep 20, 2024

Description

chilo-ms commented Oct 3, 2024

xadupre commented Oct 4, 2024

chilo-ms commented Oct 4, 2024

xadupre commented Oct 4, 2024

chilo-ms commented Oct 4, 2024 • edited Loading

xadupre commented Oct 4, 2024

chilo-ms commented Oct 4, 2024 •

edited

Loading