SAX-like API spec #465
Replies: 1 comment
-
I've been thinking about this, and I don't think I would consider this valuable as a specification. All our current specs are very specifically about ensuring data interchange, but specifying a SAX-like interface seems to me kinda like specifying a concrete format for format-preserving implementations. This very much seems to be the kind of thing that individual implementations might want to differentiate themselves with in unique ways that have different cost/benefit tradeoffs, not something that would be one-size-fits-all (particularly when you're talking about impls in many different languages with different idiomatic representations of e.g. enumerated constants, or heterogenous or sum types. Just my initial thoughts. I think this conversation is worth continuing and far be it from me to shoot stuff down 💚 |
Beta Was this translation helpful? Give feedback.
-
The ckdl parser is event-based, inspired by SAX. I wonder if it is worth standardising this API by creating a language-agnostic spec for it.
The benefit to having this API spec would be to encourage other implementations to expose a streaming API to facilitate processing of very large KDL documents without running into memory issues, or efficiently parse a KDL document being read over a slow IO stream.
This API would go hand-in-hand with the Data Model.
Currently, this is what ckdl's events API looks like:
EOF
— Regular end of document (do not continue reading)PARSE_ERROR
— Parse error (do not continue reading)START_NODE
— Start of a nodeEND_NODE
— End of a node. EverySTART_NODE
event is followed, some time later, by anEND_NODE
event (barring a parse error)ARGUMENT
— An argument to the most recently started nodePROPERTY
— A property of the most recently started nodeCOMMENT
— Normally not produced (parser's can emit this if they choose to)NOTE: ckdl specifically defines
KDL_EVENT_COMMENT
as 0x10000 in order to bitwise OR with the other event types, facilitating slashdash commented elements. This may be considered an implementation detail and therefore may not be stipulated in the spec. It could also be represented as a booleancommented
property in the event data.Each event is associated with certain event data:
event
— The event typename
— The name of the node (withSTART_NODE
), or the name of the property (withPROPERTY
)value
— The value of the argument (withARGUMENT
) or of the property (withPROPERTY
), including any potential type annotation.Note: ckdl also uses
value
to store the type annotation of theSTART_NODE
event, encoded as a(type)#null
KDL value. I feel this is a bit strange from an API point of view. While I do see the benefit of re-using thevalue
property of the event data, I feel the API might make more sense to have an explicittype
property, even if this results in redundancy due to a KDL Value itself also storing type annotation. Another possible alternative is to representSTART_NODE
using anode
property, which holds the name and type annotation with empty values forarguments
,properties
, andchildren
.Potential names for this proposed API:
Beta Was this translation helpful? Give feedback.
All reactions