Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Binary (BEVE) support lacking for object keys #1248

Open
RazielXYZ opened this issue Aug 7, 2024 · 8 comments
Open

Binary (BEVE) support lacking for object keys #1248

RazielXYZ opened this issue Aug 7, 2024 · 8 comments
Labels
enhancement New feature or request

Comments

@RazielXYZ
Copy link

RazielXYZ commented Aug 7, 2024

A bit of an odd one as it seems weirdly specific.

Given a relatively simple repro such as:

struct kTest {
    int a, b;
    bool operator==(const kTest&) const = default;
};

template <>
struct std::hash<kTest> {
    std::size_t operator()(const kTest& s) const noexcept {
        return std::hash<int>{}(s.a) ^ (std::hash<int>{}(s.b) << 1);
    }
};

int main() {
    const std::unordered_map<kTest, int> testmap {{kTest{3, 7}, 0}};
    std::string buf;
    const auto ec = glz::write_binary(testmap, buf);
    std::cout << buf << std::endl;
    return 0;
}

The code fails to build with glaze 3.1.9 (from conan) as well as latest 3.2.1 (manually added to the project) on MSVC latest (17.10.5), with /std:c++latest (or 20, but that's no longer supported by glaze officially anyway).

Write_json works fine on the same data.

The errors shown are the same in both versions:

1>glaze\binary\write.hpp(102,57): error C2903: 'no_header': symbol is neither a class template nor a function template
1>glaze\binary\write.hpp(102,88): error C2143: syntax error: missing ')' before '('
1>glaze\binary\write.hpp(102,78): error C2888: 'forward': symbol cannot be defined within namespace 'detail'
1>glaze\binary\write.hpp(102,102): error C2888: 'forward': symbol cannot be defined within namespace 'detail'
1>glaze\binary\write.hpp(103,78): error C2888: 'forward': symbol cannot be defined within namespace 'detail'
1>glaze\binary\write.hpp(103,98): error C2888: 'forward': symbol cannot be defined within namespace 'detail'
1>glaze\binary\write.hpp(103,113): error C2059: syntax error: ')'

A similar error happens on gcc 13.2 or 14.0 as well.

@stephenberry
Copy link
Owner

Thanks for reporting this bug! Will look into it soon.

@stephenberry stephenberry added the bug Something isn't working label Aug 8, 2024
@stephenberry
Copy link
Owner

So, the problem here is that your key type is an object (struct) and not a string (or integer). JSON requires string keys and because BEVE (binary) must map to JSON it has the same requirement (although it allows integers are these can be quoted in JSON).

Being able to switch between JSON and BEVE and represent the same data is very important.

For your use case, what do you think a good approach would be to solve your problem?

@stephenberry stephenberry removed the bug Something isn't working label Aug 8, 2024
@RazielXYZ
Copy link
Author

RazielXYZ commented Aug 8, 2024

That would make sense, except replacing write_binary with write_json in that case appears to both build and work fine.

Would it maybe make sense to use the hash (as a string) as the key? I assume that would only work for hash-based containers though.
Alternatively, perhaps representing the key as some sort of custom formatted string containing the string representation of every component of the key - not sure if this would have to be user defined (some sort of ToString method for any object used as a key) or generated by glaze.

@RazielXYZ
Copy link
Author

RazielXYZ commented Aug 8, 2024

This is the output from write_json on the same map:
{"{\"a\":3,\"b\":7}":0}
Which seems to be pretty much an automatically generated to string from all members on the key, almost like it's using the key's serialization to json and just wrapping it up in a string again - which makes sense to me! So, not sure why the same would not work for write_binary.
It also works the same way for a normal (ordered) std::map as well (but write_binary still does not work on that).

@stephenberry
Copy link
Owner

Yes, for JSON we serialize the key type in JSON string form. But, it is a bad idea to do this with binary for a number of reasons:

  • It would make the binary specification dependent on the JSON specification, which is not desirable and would be making the specification more complex
  • Implementations of the binary format (BEVE) would require a JSON parser to work
  • It would be significantly less efficient than a binary formatted version of the key
  • It would make translation to other formats more complicated and slower

Could you explain what your use case is? In this example you could just pack your ints into a uint64_t and the binary would work fine and you'd get better hashing performance. But, I expect you have more complex objects you want to use as keys.

@RazielXYZ
Copy link
Author

Correct, the use case is just generally being able to serialize maps with more complex keys (that don't fit into 64 bits) to BEVE as well, primarily for better performance where needed.
I assume I could use a custom read/write step for key objects where that is the case, to pack them in whatever way seems fit for the given object (which, most likely, will end up being a string for most of them, considering the requirement for a key to be a string or int), or I can adjust the way I structure the data to make it more usable for this. Or I can try to figure out why my performance with json isn't as good as I'd like, in the cases where it's an issue.

@stephenberry
Copy link
Owner

As I think about this more, I think serializing the object key to BEVE and treating it as a BEVE string and then using that as the key would make sense. The problem is that this wouldn't translate into a meaningful string in JSON.

I would recommend handling this key serialization to/from strings right now yourself, as you've described. As, I don't want to rush into a solution in the binary specification itself.

Thanks for bringing this up and sharing your thoughts. As you figure out a solution feel free to post thoughts here. I'll keep this issue open and just reword it to address the core issue.

@stephenberry stephenberry changed the title write_binary fails when writing map with user-defined key Binary (BEVE) support lacking for object keys Aug 8, 2024
@stephenberry stephenberry added the enhancement New feature or request label Aug 8, 2024
@RazielXYZ
Copy link
Author

That sounds good! And I agree, no need to rush. If I get any useful ideas I'll add them here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants