i#7113 decode cache: Add analyzer library for decode_cache_t #7114

abhinav92003 · 2024-12-09T21:54:14Z

Adds a new drmemtrace_decode_cache library to cache information about decoded instructions using decode_cache_t. This can be used by analysis tools that need to decode the instr encodings in the trace, to avoid overhead of redundant decodings which can get expensive.

The library allows the tools to specify what information they need to cache. Also, it uses instr_noalloc_t when possible to reduce heap usage and allocation/deallocation overhead.

If the trace does not include embedded encodings or if the user wants to get encodings from the app binaries using module_mapper_t instead, they can provide the module file path to the init API on the decode_cache_t object. decode_cache_t keeps a single initialized module_mapper_t at any time, which is shared between all decode_cache_t objects (even the ones of different template types); this is done by tracking the count of active objects using the module mapper.

decode_cache_t provides the clear_cache() API which can be used in parallel_shard_exit() to keep memory consumption in check by free-ing up cached decoding info that may not be needed for result computation in later print_results() which has to wait until all shards are done.

Refactors the invariant checker and opcode mix tools to use this library.

Modifies add_encodings_to_memrefs to support a mode where encodings are not set in the generated test memref but only the instr addr and size fields are set.

Makes the opcode cache in opcode_mix_t per-shard instead of per-worker. Decodings must not be cached per-worker as that may cause stale encodings for non-first shards processed by the worker. This means the worker init and worker exit APIs can be removed now from opcode_mix_t.

Adds decode_cache_test and opcode_mix_test unit tests that verify operation of the decode_cache_t.

Issue: #7113

Adds a new library to cache information about decoded instructions. This can be used by analysis tools that need to decode the instr encodings in the trace. The library allows the tools to specify what information they need to cache. Refactors the invariant checker tool to use this library. Issue: #7113

clients/drcachesim/tools/instr_decode_cache.cpp

clients/drcachesim/tests/instr_decode_cache_test.cpp

clients/drcachesim/tools/instr_decode_cache.h

clients/drcachesim/tools/invariant_checker.cpp

clients/drcachesim/tools/invariant_checker.h

clients/drcachesim/tools/instr_decode_cache.h

abhinav92003 · 2024-12-11T02:37:40Z

Decided to try out an alternate way to support module-mapper-decoding in instr_decode_cache_t that came out of offline discussion. Okay to hold off on the re-review until then (Cannot undo re-request review)

derekbruening

Blank review to reset the requested review state.

…upport to instr_decode_cache_t

api/docs/release.dox

clients/drcachesim/docs/drcachesim.dox.in

clients/drcachesim/tests/decode_cache_test.cpp

…rs (#7193) Skips the unnecessary munmap before the subsequent mmap to the same region in elf_loader_map_phdrs. To mmap all the loadable segments of a file, elf_loader_map_phdrs first gets a large anonymous map. Then for each loadable segment, it munmaps a portion of the anonymous map and mmaps the segment to it. There is potential for a race with other threads that may mmap some memory between the munmap and mmap, which will then get stolen from that other thread because our mmap uses MAP_FIXED. This manifests as crashes when the other thread munmaps that region eventually, and the module mapper cannot access the mapped segment suddenly. This PR mitigates such a race by simply skipping the munmap call, since the following mmap call uses MAP_FIXED anyway which causes the overlapping address range in the initial map to atomically get unmapped. Note that MAP_FIXED documents that the only safe way to use it is with a range that was previously reserved using another mapping, otherwise it may end up forcibly removing someone else's existing mappings. This race manifested in #7114 during module_mapper_t initialization which loads all app binaries in the process address space. #7114 is moving this init into the analyzer worker threads whereas previously it was done before launching the workers, which hid the race. There were ~20/1000 analyzer run crashes upon testing #7114 on a small internal test trace, which are fixed with this. There are some other cases where the unmap call must be made, like when the initial address range was obtained from the vmm to honor the loaded library's preferred address (in this case there's no real race due to how os_unmap_file is implemented; see comment in code), or when d_r_(un)map_file is used (in some non-analyzer cases) which needs to perform other book-keeping besides the actual mmap/munmap (left as future TODO). Most uses of the elf_loader_map_phdrs mapping code (private library load, early injection) are during DR initialization where there's no race with other threads. Issue: #7192

derekbruening · 2025-01-23T18:32:38Z

@abhinav92003 abhinav92003 requested a review from derekbruening 22 minutes ago

Could you point at which piece needs a new review: the New Changes button showed just 2 tiny tweaks (+virtual, and copyright change).

abhinav92003 · 2025-01-23T18:36:20Z

@abhinav92003 abhinav92003 requested a review from derekbruening 22 minutes ago

Could you point at which piece needs a new review: the New Changes button showed just 2 tiny tweaks (+virtual, and copyright change).

Since your last review: https://github.com/DynamoRIO/dynamorio/pull/7114/files/268e5a7d20c98433b97a971b3b1b69eaa394ae5f..31f41466c896760c7b995be302b1a93499512f03

derekbruening · 2025-01-23T19:05:54Z

@abhinav92003 abhinav92003 requested a review from derekbruening 22 minutes ago

Could you point at which piece needs a new review: the New Changes button showed just 2 tiny tweaks (+virtual, and copyright change).

Since your last review: https://github.com/DynamoRIO/dynamorio/pull/7114/files/268e5a7d20c98433b97a971b3b1b69eaa394ae5f..31f41466c896760c7b995be302b1a93499512f03

That seems to have unrelated changes from merging: e.g. the very first patch in release.dox.

abhinav92003 · 2025-01-23T19:31:40Z

@abhinav92003 abhinav92003 requested a review from derekbruening 22 minutes ago

Could you point at which piece needs a new review: the New Changes button showed just 2 tiny tweaks (+virtual, and copyright change).

Since your last review: https://github.com/DynamoRIO/dynamorio/pull/7114/files/268e5a7d20c98433b97a971b3b1b69eaa394ae5f..31f41466c896760c7b995be302b1a93499512f03

That seems to have unrelated changes from merging: e.g. the very first patch in release.dox.

That view does yes. But it's only that file that overlaps a bit. Alternatively you could look at individual commits between two above-mentioned two hashes.

api/docs/release.dox

clients/drcachesim/tools/common/decode_cache.h

abhinav92003 added 7 commits December 9, 2024 16:53

Docx improvement, and handle regdeps branch_target case.

abebffc

Use instr_noalloc_t where possible.

18f7028

Remove redundant test.

4487168

move impl to cpp

41595eb

Move impl to cpp

d2e94c7

Cleanup and aarch64 mov fix.

f0f8a74

abhinav92003 changed the title ~~i#7113: Add library to cache information about decoded instructions~~ i#7113: Add analyzer library to cache instr decode info Dec 10, 2024

Fix windows bug

a1b1d63

abhinav92003 requested a review from derekbruening December 10, 2024 03:05

derekbruening reviewed Dec 10, 2024

View reviewed changes

abhinav92003 added 2 commits December 10, 2024 16:43

Reviewer suggested changes

db8a3ad

Cleanup

1e810b5

abhinav92003 requested a review from derekbruening December 11, 2024 02:13

derekbruening reviewed Dec 11, 2024

View reviewed changes

abhinav92003 mentioned this pull request Dec 13, 2024

i#7113 decode cache: move module read into raw2trace_shared #7124

Merged

abhinav92003 added 12 commits December 14, 2024 00:15

Merge branch 'master' into i7113-decode-cache-lib

1fc4c04

Merge branch 'master' into i7113-decode-cache-lib

bf76f70

Add instr_decode_cache_t support to opcode_mix; add module_mapper_t s…

45e062f

…upport to instr_decode_cache_t

Drop instr_ from instr_decode_cache

5e28112

Handle missing use_module_mapper case

0a33a51

Fix clang-format

29d10a3

Make add_decode_info simpler and fix build error

0e2df67

Cleanup

716a0ea

Proactive destruction of module mapper

fefe38b

Remove stale file

2f0a708

Move impl to cpp

84a2039

Fix when we use module mapper in opcode mix

141e3c5

abhinav92003 changed the title ~~i#7113: Add analyzer library to cache instr decode info~~ i#7113 decode cache: Add analyzer library for decode_cache_t Dec 16, 2024

Avoid DecodeInfo object construction if key exists.

894c675

derekbruening mentioned this pull request Jan 6, 2025

tool.scheduler.unit_tests broken on ARM/AArch32 #7173

Open

abhinav92003 added 4 commits January 6, 2025 16:25

Reviewer suggested edit

23dca98

Also include decode_pc in set_decode_info calls

7ebe54f

Cleanup and assert fix.

5dbc0fc

Fix doc xref

268e5a7

derekbruening approved these changes Jan 7, 2025

View reviewed changes

api/docs/release.dox Outdated Show resolved Hide resolved

clients/drcachesim/docs/drcachesim.dox.in Outdated Show resolved Hide resolved

clients/drcachesim/docs/drcachesim.dox.in Outdated Show resolved Hide resolved

clients/drcachesim/tests/decode_cache_test.cpp Show resolved Hide resolved

abhinav92003 added 3 commits January 7, 2025 13:13

Separate out make_decode_cache

88d0482

Move some logic out of make_module_mapper

0833242

Merge branch 'master' into i7113-decode-cache-lib

021e20c

This was referenced Jan 16, 2025

[drmemtrace analyzer] Segfault in module mapped by worker thread #7192

Open

i#7192 module map race: Skip unnecessary munmap in elf_loader_map_phdrs #7193

Merged

abhinav92003 added 5 commits January 22, 2025 15:16

Changes to allow 3p import, and other misc

c8a38b3

Throw error on different module_file_path

b01854b

Update copyright year

1347aef

Merge branch 'master' into i7113-decode-cache-lib

d171074

Address reviewer comments.

29684cb

abhinav92003 added 4 commits January 22, 2025 19:55

Merge branch 'master' into i7113-decode-cache-lib

8a67f4f

Pass verbosity to module_mapper

12da01f

Revert empty file

cfd3519

Mark init() as virtual for easier downstream use

31f4146

abhinav92003 requested a review from derekbruening January 23, 2025 18:08

derekbruening approved these changes Jan 23, 2025

View reviewed changes

api/docs/release.dox Outdated Show resolved Hide resolved

clients/drcachesim/tools/common/decode_cache.h Outdated Show resolved Hide resolved

clients/drcachesim/tools/common/decode_cache.h Outdated Show resolved Hide resolved

abhinav92003 added 2 commits January 23, 2025 19:26

Address reviewer-suggested edits.

70339cd

Cast trace_pc before printing

49345b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

abhinav92003 commented Dec 9, 2024 •

edited

Loading

abhinav92003 commented Dec 11, 2024

derekbruening left a comment

derekbruening commented Jan 23, 2025

abhinav92003 commented Jan 23, 2025

derekbruening commented Jan 23, 2025

abhinav92003 commented Jan 23, 2025

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

Are you sure you want to change the base?

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

Conversation

abhinav92003 commented Dec 9, 2024 • edited Loading

abhinav92003 commented Dec 11, 2024

derekbruening left a comment

Choose a reason for hiding this comment

derekbruening commented Jan 23, 2025

abhinav92003 commented Jan 23, 2025

derekbruening commented Jan 23, 2025

abhinav92003 commented Jan 23, 2025

abhinav92003 commented Dec 9, 2024 •

edited

Loading