Cache parsed SQL in `buildDiagram` Function #1418

ahtrotta · 2023-10-05T20:31:09Z

When loading a sequence diagram, the buildDiagram function is one of the most expensive function calls. When it uses the hash getter on an instance of Event, the buildStableHash function is called, which ultimately ends up parsing SQL if the event has any. It turns out that parsing SQL is a relatively expensive operation, so this can add up to a lot of computation if there are a lot of events with SQL in the sequence diagram. In this flame graph you can see that peg$parseRule accounts for 41% of buildDiagram:

This PR creates an LRU cache in the buildDiagram method, and then passes it into the getHashWithSqlCache method on the Event class so that we don't repeat the computation needlessly.

This will need to be split up into two PRs so that we can release a version of @appland/models that will get used in @appland/sequence-diagram.

kgilpin · 2023-10-05T23:43:37Z

Take a look at

appmap-js/packages/scanner/src/appMapIndex.ts

Line 6 in f0ede48

    
           const ASTBySQLString = new LRUCache<string, QueryAST | 'parse-error'>({ max: 1000 });

which also caches the parse tree.

kgilpin · 2023-10-05T23:44:11Z

packages/models/src/event.js

@@ -19,6 +19,8 @@ function alias(obj, prop, alias) {
 // This class supercedes `CallTree` and `CallNode`. Events are stored in a flat
 // array and can also be traversed like a tree via `parent` and `children`.
 export default class Event {
+  static parsedSqlCache = {};


Consider a Map instead of Object.

kgilpin

It’s problematic to keep a cache of every SQL query in memory because tools that process many AppMaps will end up consuming a lot of memory doing this. That’s why the scanner uses an LRU cache.

Adding a cache to the Event will impact a lot of code in addition to the code we are trying to optimize.

Can we add the cache we need in a way that’s more specific to the use case. Eg in the DiagramComponent, or an LRU cache in buildDiagram.

kgilpin

It’s problematic to keep a cache of every SQL query in memory because tools that process many AppMaps will end up consuming a lot of memory doing this. That’s why the scanner uses an LRU cache.

Adding a cache to the Event will impact a lot of code in addition to the code we are trying to optimize.

Can we add the cache we need in a way that’s more specific to the use case. Eg in the DiagramComponent, or an LRU cache in buildDiagram.

ahtrotta · 2023-10-09T21:11:17Z

It’s problematic to keep a cache of every SQL query in memory because tools that process many AppMaps will end up consuming a lot of memory doing this. That’s why the scanner uses an LRU cache.

Adding a cache to the Event will impact a lot of code in addition to the code we are trying to optimize.

Can we add the cache we need in a way that’s more specific to the use case. Eg in the DiagramComponent, or an LRU cache in buildDiagram.

I just changed how this works, so now I'm creating an LRU cache in buildDiagram here and then passing the cache into the new getHashWithSqlCache method on the Event class that can be used instead of the hash getter. This won't impact any of the other tools.

ahtrotta force-pushed the feat/cache-parsed-sql branch from e6b6fbc to 5d6f222 Compare October 5, 2023 20:32

ahtrotta changed the title ~~feat: Cache parsed SQL in Event class~~ Cache parsed SQL in Event class Oct 5, 2023

ahtrotta mentioned this pull request Oct 4, 2023

Improve sequence diagram load time #1412

Closed

ahtrotta marked this pull request as ready for review October 5, 2023 21:28

kgilpin reviewed Oct 5, 2023

View reviewed changes

kgilpin reviewed Oct 6, 2023

View reviewed changes

ahtrotta force-pushed the feat/cache-parsed-sql branch 2 times, most recently from 76ca47a to e6e3cdd Compare October 9, 2023 21:01

kgilpin requested changes Oct 9, 2023

View reviewed changes

feat: Cache parsed SQL when building sequence diagram

a6b3766

ahtrotta force-pushed the feat/cache-parsed-sql branch from e6e3cdd to a6b3766 Compare October 9, 2023 21:20

ahtrotta changed the title ~~Cache parsed SQL in Event class~~ Cache parsed SQL in buildDiagram Function Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache parsed SQL in `buildDiagram` Function #1418

Cache parsed SQL in `buildDiagram` Function #1418

ahtrotta commented Oct 5, 2023 •

edited

Loading

kgilpin commented Oct 5, 2023

kgilpin Oct 5, 2023

kgilpin left a comment •

edited

Loading

kgilpin left a comment

ahtrotta commented Oct 9, 2023

Cache parsed SQL in buildDiagram Function #1418

Are you sure you want to change the base?

Cache parsed SQL in buildDiagram Function #1418

Conversation

ahtrotta commented Oct 5, 2023 • edited Loading

kgilpin commented Oct 5, 2023

kgilpin Oct 5, 2023

Choose a reason for hiding this comment

kgilpin left a comment • edited Loading

Choose a reason for hiding this comment

kgilpin left a comment

Choose a reason for hiding this comment

ahtrotta commented Oct 9, 2023

Cache parsed SQL in `buildDiagram` Function #1418

Cache parsed SQL in `buildDiagram` Function #1418

ahtrotta commented Oct 5, 2023 •

edited

Loading

kgilpin left a comment •

edited

Loading