Consider spans in output #35

lizgzil · 2020-04-29T11:29:48Z

In the output of split_parser, split and parser we have an output of tokens and predictions.

It may be worth considering a different type of output with the spans of each reference/token rather than the tokens themselves.

The text was updated successfully, but these errors were encountered:

nsorros · 2020-04-30T07:59:46Z

I am not sure how controversial this would be but it would definitely eliminate the need to merge tokens after as the algorithm would extract start and end for each component in a QA fashion

ivyleavedtoadflax · 2020-04-30T23:12:16Z

I thought of these outputs as placeholders. All those scripts are not suitable for production because they would instantiate the model every time they made a prediction, so their utility is somewhat limited. That said, I think I implemented an --output flag which will dump the output to a json.

lizgzil · 2020-05-01T10:19:23Z

@ivyleavedtoadflax ok that makes sense re outputs.

In terms of the instantiation of the model, is it not true that

splitter_parser = SplitParser(config_file=MULTITASK_CFG)

instantiates the model and then you could do

reference_predictions = splitter_parser.split_parse(text)

as many times as you wanted without having to reinstantiate the model?

nsorros · 2020-05-01T12:35:15Z

@ivyleavedtoadflax ok that makes sense re outputs.

In terms of the instantiation of the model, is it not true that
splitter_parser = SplitParser(config_file=MULTITASK_CFG)
instantiates the model and then you could do
reference_predictions = splitter_parser.split_parse(text)
as many times as you wanted without having to reinstantiate the model?

Even though unrelated to this issue, I am almost 100% you are right. @ivyleavedtoadflax can confirm.

ivyleavedtoadflax · 2020-05-01T19:28:19Z

Yup exactly right @lizgzil. That's not how I had done it in the split, parse, split_parse commands, which is why they are no good for prod.

lizgzil assigned ivyleavedtoadflax, nsorros, lizgzil and aCampello Apr 29, 2020

lizgzil changed the title ~~Consider output type~~ Consider spans in output Apr 29, 2020

nsorros removed their assignment Feb 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider spans in output #35

Consider spans in output #35

lizgzil commented Apr 29, 2020

nsorros commented Apr 30, 2020

ivyleavedtoadflax commented Apr 30, 2020 •

edited

Loading

lizgzil commented May 1, 2020

nsorros commented May 1, 2020

ivyleavedtoadflax commented May 1, 2020

Consider spans in output #35

Consider spans in output #35

Comments

lizgzil commented Apr 29, 2020

nsorros commented Apr 30, 2020

ivyleavedtoadflax commented Apr 30, 2020 • edited Loading

lizgzil commented May 1, 2020

nsorros commented May 1, 2020

ivyleavedtoadflax commented May 1, 2020

ivyleavedtoadflax commented Apr 30, 2020 •

edited

Loading