fix weird boto docstrings #656

thomasballinger · 2016-11-18T03:56:09Z

Boto is doing something pretty weird: in Python 3, it makes it possible to end up with bytestring docstrings. We fix this here by always assuming utf8 in this case. Previously we assumed ascii, and did it implicitly by letting string.split(u'\n') turn it into unicode, which was no good.

sebastinas · 2016-11-18T11:29:54Z

bpython/curtsiesfrontend/replpainter.py

+    elif isinstance(docstring, str if py3 else unicode):
+        pass
+    else:
+        return []


Is the elif and else really necessary? Or in other words: does the elif really cover all valid cases?

The cases to cover:

Py2 bytes -> decode
Py2 unicode -> nop
Py2 something else (integer etc) -> abort

Py3 bytes -> shouldn't happen, but decode
Py3 bytes -> nop
Py3 something else -> abort

Might be nicer to:

if unicode: pass else: try: docstring = docstring.decode

To answer your question, docstrings should always be unicode in python 3, and in Python 2 they should always be bytestrings. (since we're getting them from pydoc.getdoc, which does this normalization) If we got a unicode string somehow in Python 2 that would be ok, but I don't know how that would happen. If we got a bytestring in Python3, which shouldn't happen, we would try to decode. So this does cover all valid cases, but it covers some extra too.

Now that I see where docstring comes from (pydoc.getdoc) I agree that the else isn't necessary.

The correct thing to do here is to find out the encoding of the source file the docstring comes from, since it doesn't have to be utf8, or at least catch errors here so a bad docstring doesn't crash bpython.

fix weird boto docstrings

ab1cbec

thomasballinger mentioned this pull request Nov 18, 2016

Something about boto3 crashes bpython #653

Closed

sebastinas reviewed Nov 18, 2016

View reviewed changes

sebastinas merged commit f4f05b2 into master Nov 19, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix weird boto docstrings #656

fix weird boto docstrings #656

Uh oh!

thomasballinger commented Nov 18, 2016 •

edited

Loading

Uh oh!

sebastinas Nov 18, 2016 •

edited

Loading

Uh oh!

thomasballinger Nov 18, 2016 •

edited

Loading

Uh oh!

thomasballinger Nov 18, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix weird boto docstrings #656

fix weird boto docstrings #656

Uh oh!

Conversation

thomasballinger commented Nov 18, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sebastinas Nov 18, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasballinger Nov 18, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thomasballinger Nov 18, 2016

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thomasballinger commented Nov 18, 2016 •

edited

Loading

sebastinas Nov 18, 2016 •

edited

Loading

thomasballinger Nov 18, 2016 •

edited

Loading