You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In interactive mode, the __repr__ for a Table differs depending on the run-time environment. The __repr__ includes Unicode characters and ANSI escape codes for colours when repr(table) is called from a Jupyter notebook but not when called from the Python or IPython interpreter.
outputs 3262 when run from Jupyter but 1206 when run from IPython or as a regular Python script.
I find both of these facts surprising:
that the __repr__ differs depending on the runtime environment
that the __repr__ sometimes contains non-ASCII characters
I expect that both are likely to cause problems in various workflows in ways that are hard to anticipate. But one simple example is when rendering notebooks via LaTeX. If the following cell appears in a notebook called ibis_str.ipynb along with its output:
currently fails with a very different exception when run in Python / IPython:
ParserException: Parser Error: zero-length delimited identifier at or near """"
versus the ValueError raised in a Jupyter notebook (as reported in issue #10514):
ValueError: Target schema's field names are not matching the table's field names: ...
I believe the standard approach would be for the Table class to have a single code path for __repr__ that produces the same ASCII string independent of the runtime environment and to define IPython-compatible methods like ._repr_pretty_, _repr_html_, and _repr_latex_ for fancier output in IPython and Jupyter.
If you agree that this would be an improvement, I can volunteer a PR as my first contribution to the project.
What version of ibis are you using?
9.5.0
What backend(s) are you using, if any?
No response
Relevant log output
No response
Code of Conduct
I agree to follow this project's Code of Conduct
The text was updated successfully, but these errors were encountered:
Hey @edschofield -- thanks for the thorough report!
I haven't delved into the repr lately. It would probably be worth first clarifying which of these behaviors is coming from rich, which we use for table formatting, and which are coming from Ibis, or Ibis' (possibly odd) use of rich.
Also, I don't view the use of unicode as a bug. Even if our display skeleton didn't have any unicode characters, a great number of our backends will emit them in result-sets.
but not when called from the Python or IPython interpreter.
This is not correct. Both Python and IPython will display unicode box characters for the table repr. Python will not render escape codes by default, but IPython will. This is probably due to a rich behavior or related configuration that we're using.
That said, I think the approach you're describing is the right one: ASCII for __repr__ (even if data would contain unicode), independent of runtime environment, and then we can get fancier with the Jupyter/IPython methods.
What happened?
In interactive mode, the
__repr__
for a Table differs depending on the run-time environment. The__repr__
includes Unicode characters and ANSI escape codes for colours whenrepr(table)
is called from a Jupyter notebook but not when called from the Python or IPython interpreter.For example, this code:
outputs
3262
when run from Jupyter but1206
when run from IPython or as a regular Python script.I find both of these facts surprising:
__repr__
differs depending on the runtime environment__repr__
sometimes contains non-ASCII charactersI expect that both are likely to cause problems in various workflows in ways that are hard to anticipate. But one simple example is when rendering notebooks via LaTeX. If the following cell appears in a notebook called
ibis_str.ipynb
along with its output:then converting it to PDF as follows fails with a LaTeX error due to the use of Unicode characters:
Another very surprising effect of the different code paths taken for
__repr__
depending on the run-time environment is that this code:currently fails with a very different exception when run in Python / IPython:
versus the
ValueError
raised in a Jupyter notebook (as reported in issue #10514):I believe the standard approach would be for the
Table
class to have a single code path for__repr__
that produces the same ASCII string independent of the runtime environment and to define IPython-compatible methods like._repr_pretty_
,_repr_html_
, and_repr_latex_
for fancier output in IPython and Jupyter.If you agree that this would be an improvement, I can volunteer a PR as my first contribution to the project.
What version of ibis are you using?
9.5.0
What backend(s) are you using, if any?
No response
Relevant log output
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: