add zlib each_char by danini-the-panini · Pull Request #8608 · jruby/jruby

danini-the-panini · 2025-01-31T12:48:24Z

Resolves #8351

headius

Nice, new spec. 👍

enebo

shoot I forgot to submit this and as I said in comments this is landable as-is. Just adding these comments for posterity.

enebo · 2025-01-31T17:02:39Z

core/src/main/java/org/jruby/ext/zlib/JZlibRubyGzipReader.java

+                position++;
+                // TODO: must handle encoding. Move encoding handling methods to util class from RubyIO and use it.
+                // TODO: StringIO needs a love, too.
+                block.yield(context, runtime.newString(String.valueOf((char) (value & 0xFF))));


I totally understand how this code got here looking at getc but this has to be wrong as it uses a byte-oriented API and returns it as a char. Perhaps no one does multiple byte chars using this API as I would think #getc would have had a report by now since this has that same logic in it.

Another amusing thing to notice is MRI unit tests have no tests for this method. No wonder we are missing it. You will be the first person to actually add a test for this!

I am ok landing this if you open an issue that both each_char and getc is not recreating chars but is using bytes. It appears initialize will process opts and set encoding so I think we just need to use that and do something like RubyString#enumerateChars where we use StringSupport to look for length and that many bytes as a new char.

I should add since there are no tests perhaps each_char really does only return bytes. I would be a pretty big misnaming if so.

In MRI it does return multi-byte chars, and should work the same as calling each_char on the inflated string

@danini-the-panini so then perhaps you want to take a look at fixing that for both each_char and getc in zlib? It will largely just be looking at our String-related code and doing the same thing.

(as a new issue)

Issue created: #8621

add zlib each_char

41459fe

danini-the-panini force-pushed the zlib-fixes branch from f5d0f17 to 41459fe Compare January 31, 2025 12:50

headius added this to the JRuby 9.4.13.0 milestone Feb 8, 2025

headius approved these changes Feb 8, 2025

View reviewed changes

headius merged commit b81b8ea into jruby:master Feb 8, 2025
95 checks passed

enebo approved these changes Feb 9, 2025

View reviewed changes

danini-the-panini deleted the zlib-fixes branch February 11, 2025 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add zlib each_char#8608

add zlib each_char#8608
headius merged 1 commit intojruby:masterfrom
danini-the-panini:zlib-fixes

danini-the-panini commented Jan 31, 2025

Uh oh!

headius left a comment

Uh oh!

Uh oh!

enebo left a comment

Uh oh!

enebo Jan 31, 2025

Uh oh!

enebo Jan 31, 2025

Uh oh!

danini-the-panini Feb 11, 2025

Uh oh!

enebo Feb 11, 2025

Uh oh!

enebo Feb 11, 2025

Uh oh!

danini-the-panini Feb 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

danini-the-panini commented Jan 31, 2025

Uh oh!

headius left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

enebo left a comment

Choose a reason for hiding this comment

Uh oh!

enebo Jan 31, 2025

Choose a reason for hiding this comment

Uh oh!

enebo Jan 31, 2025

Choose a reason for hiding this comment

Uh oh!

danini-the-panini Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

enebo Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

enebo Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

danini-the-panini Feb 11, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants