[cirqflow] Quantum runtime skeleton - part 3 #4584

mpharrigan · 2021-10-18T20:50:43Z

This contains a skeleton of an execution loop that consumes QuantumExecutable and saves ExecutableResult

MichaelBroughton · 2021-10-18T23:09:51Z

cirq-google/cirq_google/workflow/quantum_runtime.py

+def execute(
+    rt_config: QuantumRuntimeConfiguration,
+    executable_group: QuantumExecutableGroup,
+    base_data_dir: str = ".",


Nit: Could we call this checkpoint_dir ?

Also: Why does this default to your current directory ? I would've thought we default to no checkpointing unless a user explicitly requests to do so by providing a path to store the data.

Maybe this is a question of intent, but I imagine the users of this workflow infrastructure will always be saving their data. I would like to encourage that data always makes a trip to persistent storage before analysis and plotting to enforce separation of data generation from data processing. I've considered removing the return value from execute so users have to do it this way; but that's probably too drastic.

As such, the files aren't checkpoints(*) but rather the main point of the execute function.

(*) my view of checkpoint files is that they are a way of restarting a long computation from a known place, which is not the intent of the files written here.

MichaelBroughton · 2021-10-18T23:10:07Z

cirq-google/cirq_google/workflow/quantum_runtime.py

+    Args:
+        rt_config: The `cg.QuantumRuntimeConfiguration` specifying how to execute
+            `executable_group`.
+        executable_group: The `QuantumExecutableGroup` containing the executables to execute.


Nit: missing "cg."

MichaelBroughton · 2021-10-18T23:11:53Z

cirq-google/cirq_google/workflow/quantum_runtime.py

+    if rt_config.run_id is None:
+        run_id = str(uuid.uuid4())
+    else:
+        run_id = rt_config.run_id


How does one identify runs if they lose the code after runtime ? Should we have a way to allow a user to specify a name they prefer ?

If you specify a run_id in the rt_config: QuantumRuntimeConfiguration it uses that.

Otherwise, you can look at your filesystem as the directory will be named according to the run_id.

MichaelBroughton · 2021-10-18T23:15:12Z

cirq-google/cirq_google/workflow/quantum_runtime.py

+    for i, exe in enumerate(executable_group):
+        runtime_info = RuntimeInfo(execution_index=i)
+
+        if exe.params != tuple():
+            raise NotImplementedError("Circuit params are not yet supported.")
+
+        circuit = exe.circuit
+
+        if not hasattr(exe.measurement, 'n_repetitions'):
+            raise NotImplementedError("Only `BitstringsMeasurement` are supported.")
+
+        sampler_run_result = sampler.run(circuit, repetitions=exe.measurement.n_repetitions)
+
+        exe_result = ExecutableResult(
+            spec=exe.spec,
+            runtime_info=runtime_info,
+            raw_data=sampler_run_result,
+        )
+        cirq.to_json_gzip(exe_result, f'{base_data_dir}/{run_id}/ExecutableResult.{i}.json.gz')
+        exegroup_result.executable_results.append(exe_result)
+        print(f'\r{i+1} / {n_executables}', end='', flush=True)


If we expect this loop to grow in complexity the number of features/functionality that these Executables take on, do you think it might be worth structuring things a little more carefully. In this case something like:
allowing a user to provide a list of callbacks that we guarantee to call at the end of this loop iteration. These callbacks could include things like logging, saving, timing, or any other kind of unknown functionality one might want to put after each iteration here ?

Yes, yes, and yes.

The QuantumRuntimeConfiguration will learn new fields whose objects' methods serve as callbacks. Logging for sure will be factored out into a callback-like object.

My plan is to get this skeleton merged. The addition of callbacks with defaults and helper functions can be done stepwise and in a backwards-compatible manner.

MichaelBroughton · 2021-10-18T23:18:30Z

cirq-google/cirq_google/workflow/quantum_runtime.py

+        shared_runtime_info=SharedRuntimeInfo(run_id=run_id),
+        executable_results=list(),
+    )
+    cirq.to_json_gzip(exegroup_result, f'{base_data_dir}/{run_id}/ExecutableGroupResult.json.gz')


The GroupResult only gets written once when it is empty ? Do we want another one after it's been filled ?

yeah this is weird. Let me try to write up the motivation for the current structure and think about how to make it less weird

Here's the motivation.

My proposed selection is to call cirq.to_json on the constituent parts of ExecutableGroupResult rather than ever saving the whole ExecutableGroupResult object. This is because of the different "lifetimes" of the constituent objects.

In this case, I'd like to introduce a bookkeeping dataclass

@dataclasses.dataclass class ExecutableGroupFilesystemRecord: runtime_configuration_fn: str shared_runtime_info_fn: str executable_result_fns: List[str] def load(self): return ExecutableGroupResult( runtime_configuration=cirq.read_json(self.runtime_configuration_fn), shared_runtime_info=cirq.read_json(self.shared_runtime_info_fn), executable_results=[cirq.read_json(exe_fn) for exe_fn in self.executable_result_fns], )

to keep things under control.

Would you like me to implement that in the current PR or as a follow-on PR? It might be easier to review if this is kept in its own follow-on PR.

mpharrigan · 2021-10-21T18:33:10Z

@MichaelBroughton PTAL

Following #4584 comment. ![ExecutableGroupResult](https://user-images.githubusercontent.com/4967059/138186981-ef4c82fc-6f05-400e-a761-d9a9f0b3257d.png) - Update the constituent parts only when necessary to avoid re-writing the full `ExecutableGroupResult` when only a part has changed. Re-writing would cause a performance hit and potential corruption risk. - Add a new dataclass to keep track of filenames to make loading in the data easier; see the test.

This contains a skeleton of an execution loop that consumes `QuantumExecutable` and saves `ExecutableResult`

Following quantumlib#4584 comment. ![ExecutableGroupResult](https://user-images.githubusercontent.com/4967059/138186981-ef4c82fc-6f05-400e-a761-d9a9f0b3257d.png) - Update the constituent parts only when necessary to avoid re-writing the full `ExecutableGroupResult` when only a part has changed. Re-writing would cause a performance hit and potential corruption risk. - Add a new dataclass to keep track of filenames to make loading in the data easier; see the test.

google-cla bot added the cla: yes Makes googlebot stop complaining. label Oct 18, 2021

CirqBot added the size: L 250< lines changed <1000 label Oct 18, 2021

mpharrigan added the area/workflow label Oct 18, 2021

[cirqflow] Quantum runtime skeleton - part 3

6d944a4

mpharrigan force-pushed the 2021-10-execution-c branch from 57b53e0 to 6d944a4 Compare October 18, 2021 20:54

mpharrigan requested a review from MichaelBroughton October 18, 2021 21:07

mpharrigan marked this pull request as ready for review October 18, 2021 21:07

mpharrigan requested review from cduck, vtomole, wcourtney and a team as code owners October 18, 2021 21:07

mpharrigan assigned MichaelBroughton Oct 18, 2021

MichaelBroughton reviewed Oct 18, 2021

View reviewed changes

mpharrigan added 2 commits October 20, 2021 16:17

cg

5e854b0

Merge remote-tracking branch 'origin/master' into 2021-10-execution-c

1ccdb29

MichaelBroughton approved these changes Oct 25, 2021

View reviewed changes

mpharrigan added the automerge Tells CirqBot to sync and merge this PR. (If it's running.) label Oct 25, 2021

CirqBot added the front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. label Oct 25, 2021

Merge branch 'master' into 2021-10-execution-c

095530d

MichaelBroughton removed the front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. label Oct 25, 2021

CirqBot added the front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. label Oct 25, 2021

CirqBot merged commit 954a7b6 into quantumlib:master Oct 25, 2021

CirqBot removed automerge Tells CirqBot to sync and merge this PR. (If it's running.) front_of_queue_automerge CirqBot uses this label to indicate (and remember) what's being merged next. labels Oct 25, 2021

This was referenced Oct 25, 2021

[cirqflow] Execution Loop I/O #4599

Merged

[cirqflow] Quantum runtime skeleton #4556

Closed

rht pushed a commit to rht/Cirq that referenced this pull request May 1, 2023

[cirqflow] Quantum runtime skeleton - part 3 (quantumlib#4584)

bb8a3cb

This contains a skeleton of an execution loop that consumes `QuantumExecutable` and saves `ExecutableResult`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cirqflow] Quantum runtime skeleton - part 3 #4584

[cirqflow] Quantum runtime skeleton - part 3 #4584

mpharrigan commented Oct 18, 2021

MichaelBroughton Oct 18, 2021

mpharrigan Oct 20, 2021

MichaelBroughton Oct 18, 2021

mpharrigan Oct 20, 2021

MichaelBroughton Oct 18, 2021

mpharrigan Oct 20, 2021

MichaelBroughton Oct 18, 2021

mpharrigan Oct 20, 2021

MichaelBroughton Oct 18, 2021

mpharrigan Oct 20, 2021

mpharrigan Oct 20, 2021

mpharrigan Oct 20, 2021 •

edited

Loading

mpharrigan commented Oct 21, 2021

[cirqflow] Quantum runtime skeleton - part 3 #4584

[cirqflow] Quantum runtime skeleton - part 3 #4584

Conversation

mpharrigan commented Oct 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mpharrigan Oct 20, 2021 • edited Loading

Choose a reason for hiding this comment

mpharrigan commented Oct 21, 2021

mpharrigan Oct 20, 2021 •

edited

Loading