Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace Marshal by Granular_marshal in ocaml-index #1889

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

voodoos
Copy link
Collaborator

@voodoos voodoos commented Jan 17, 2025

Initial PR presentation by @Lucccyo:

The current implementation of ocaml-index uses Marshal to store on the disk the data.
Searching for occurrences on massive projects is time-consuming because the search loads all the data structures from the disk to perform the search.

This Pull Request aims to replace Marshal with a granular version to make the ocaml-index more efficient in reading.
It comes with two granular implementations of the data structures set and map, based on the Stdlib implementation.
During a search operation, the program lazily loads only the required part of the ocaml-index.
It works because the heavy nodes of the granular_map and granular_set have link indirections,
introducing serialization boundaries, which allows Marshal to delay the deserialization of their children.

voodoos added a commit to voodoos/merlin that referenced this pull request Jan 21, 2025
voodoos added a commit to voodoos/merlin that referenced this pull request Jan 22, 2025
Lucccyo and others added 2 commits January 22, 2025 16:46
Optimize use of reachable_words
Fix fd leak: always close granlar_marshal in_channel
Compressed index_format by de-duplicating filenames/longidents
Fix fd leak: keep latest in_channel open for granular_marshal
voodoos added a commit to voodoos/merlin that referenced this pull request Jan 22, 2025
voodoos added a commit to voodoos/merlin that referenced this pull request Jan 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants