Perform parsing off the main thread when Tree-sitter is enabled #17339

maxbrunsfeld · 2018-05-16T21:06:31Z

Overview

Currently, all parsing in Atom takes place on the main thread, whether you're using the new Tree-sitter system or the default TextMate system. In some circumstances, parsing can take long enough that it reduces the app's responsiveness. This PR makes it so that when you've enabled Tree-sitter, parsing will take place on a background thread so that it can never delay UI responses.

This will also improve Atom's overall parsing speed, because the native parser is now reading directly from the native superstring TextBuffer instance without any intermediate C++ -> JS -> C++ calls which add overhead and generate garbage.

Tasks

Update Atom to use the new Tree-sitter API that allows for multi-threaded parsing
When the buffer changes, parse asynchronously
When the buffer changes during a parse, enqueue another parse and record the changes
Parse buffers asynchronously when initially opening them

Related PRs

tree-sitter/tree-sitter#165
tree-sitter/node-tree-sitter#11
atom/superstring#65
tree-sitter/tree-sitter#170

maxbrunsfeld · 2018-05-23T17:11:21Z

When editing, parsing no longer impacts the app's frame rate at all:

maxbrunsfeld · 2018-05-23T23:03:27Z

Another interesting twist: parsing asynchronously is great for Atom's frame rate and responsiveness, but can cause Atom to do more work overall (bad for battery life) because for many edits, it has to render twice: once immediately after the edit, and then again once the parsing completes in order to update the syntax highlighting.

Here's a flame graph of this behavior:

I've just added a refinement to the algorithm to address this. Now, we perform a limited amount of parsing work immediately on the main thread. If this completes, we can re-render with the correct highlighting from the start. Only when parsing takes a long time do we need to do it in the background. The underlying Tree-sitter API is described in this PR.

Here's the new flame graph for the same editing action as above (the scale is different):

Parsing and resolving the parse promise only took half of a millisecond, so it's better for the user's energy usage to finish the parse up front and render only once.

maxbrunsfeld · 2018-05-23T23:09:26Z

The amount of synchronous work that we allow the parser to do is specified as a number of abstract 'parsing operations'. These don't correspond directly to any unit of time, but they just serve as a rough way to limit the the work we do that has very low overhead to keep track of.

I've currently hard-coded it to 1000 parsing operations. If we want, we could do something more sophisticated like measuring how long parsing operations take on average on the user's machine and tuning the limit appropriately. I don't think the exact number is too important though. We just want some limit on the amount of synchronous work we'll do.

maxbrunsfeld force-pushed the mb-async-parsing branch 4 times, most recently from cd6d86e to 9988e64 Compare May 22, 2018 18:47

maxbrunsfeld added 4 commits May 23, 2018 08:11

⬆️ text-buffer, tree-sitter

a66120a

Start work on async parsing

aced30d

Reparse again if there were changes since the last parse started

f6d2d57

Fix bug w/ empty node handling, comment TreeSitterHighlightIterator

3548abe

maxbrunsfeld force-pushed the mb-async-parsing branch 2 times, most recently from dd06761 to d297d50 Compare May 23, 2018 16:37

maxbrunsfeld added 2 commits May 23, 2018 09:42

🐎 Parse asynchronously when opening buffers

d4d57c2

Rename out-of-date property: layer -> languageMode

7a26674

maxbrunsfeld force-pushed the mb-async-parsing branch from d297d50 to 7a26674 Compare May 23, 2018 16:42

Allow some synchronous parsing to avoid unnecessary re-renders

53dfa83

maxbrunsfeld merged commit a78d682 into master May 23, 2018

maxbrunsfeld deleted the mb-async-parsing branch May 23, 2018 23:45

daviwil mentioned this pull request May 29, 2018

Iteration Plan: May 29, 2018 #17426

Closed

17 tasks

maxbrunsfeld mentioned this pull request Aug 24, 2018

Update Tree-sitter syntax highlighting synchronously for parses that complete sync #17923

Merged

maxbrunsfeld mentioned this pull request Nov 7, 2018

Handle changes to the included ranges set when re-parsing after an edit tree-sitter/tree-sitter#225

Merged

maxbrunsfeld mentioned this pull request Mar 14, 2019

Replace operation limit API with a clock-based timeout API tree-sitter/tree-sitter#301

Merged

maxbrunsfeld mentioned this pull request Mar 27, 2019

Update tree-sitter to v0.14.0 #19060

Merged

19 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform parsing off the main thread when Tree-sitter is enabled #17339

Perform parsing off the main thread when Tree-sitter is enabled #17339

maxbrunsfeld commented May 16, 2018 •

edited

Loading

maxbrunsfeld commented May 23, 2018

maxbrunsfeld commented May 23, 2018 •

edited

Loading

maxbrunsfeld commented May 23, 2018

Perform parsing off the main thread when Tree-sitter is enabled #17339

Perform parsing off the main thread when Tree-sitter is enabled #17339

Conversation

maxbrunsfeld commented May 16, 2018 • edited Loading

Overview

Tasks

Related PRs

maxbrunsfeld commented May 23, 2018

maxbrunsfeld commented May 23, 2018 • edited Loading

maxbrunsfeld commented May 23, 2018

maxbrunsfeld commented May 16, 2018 •

edited

Loading

maxbrunsfeld commented May 23, 2018 •

edited

Loading