fix: content encoding inference from meta tag, especially for various unusual format by edenw97 · Pull Request #8031 · mitmproxy/mitmproxy

edenw97 · 2025-12-30T23:29:28Z

fix the content encoding inference from meta tag, especially for various unusual format(unquoted, whitespace), follows https://html.spec.whatwg.org/#extracting-character-encodings-from-meta-elements

Description

this PR address multiple edge case when we infer content-encoding in the meta tag

Checklist

I have updated tests where applicable.
I have added an entry to the CHANGELOG.

…ous unusual format(unquoted, whitespace), follows https://html.spec.whatwg.org/#extracting-character-encodings-from-meta-elements

mhils

Thanks!

mitmproxy/net/http/headers.py

mhils · 2026-01-01T23:22:14Z

mitmproxy/net/http/headers.py

+            match = meta_charset.group(2) or meta_charset.group(3)
+            if match:
+                enc = match.decode("ascii", "ignore").strip()
+            else:


We already validated that meta_charset is not None, so can this case ever happen?

(charset="" would be an option, but then we should move the .strip() before the if as well)

Hi, do you mean move the strip() to before the decode?

Co-authored-by: Maximilian Hils <github@maximilianhils.com>

Yidong Wei and others added 7 commits December 30, 2025 15:21

fix the content encoding inference from meta tag, especially for vari…

9cd184d

…ous unusual format(unquoted, whitespace), follows https://html.spec.whatwg.org/#extracting-character-encodings-from-meta-elements

update test case

27ce7c1

nit

88efe5f

[autofix.ci] apply automated fixes

fdd95f3

changlog

feedbf8

nit

87b356f

update one more test case

20ba66f

mhils reviewed Jan 1, 2026

View reviewed changes

edenw97 and others added 6 commits January 5, 2026 11:26

Update mitmproxy/net/http/headers.py

5c1ea8c

Co-authored-by: Maximilian Hils <github@maximilianhils.com>

add test case

612f362

move strip to before decode

3103d7e

Merge branch 'main' into content-encoding-meta

098fa63

[autofix.ci] apply automated fixes

2732fe7

fix missing '>' in 2 test cases

a11dbd2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: content encoding inference from meta tag, especially for various unusual format#8031

fix: content encoding inference from meta tag, especially for various unusual format#8031
edenw97 wants to merge 13 commits intomitmproxy:mainfrom
edenw97:content-encoding-meta

edenw97 commented Dec 30, 2025 •

edited

Loading

Uh oh!

mhils left a comment

Uh oh!

Uh oh!

mhils Jan 1, 2026

Uh oh!

edenw97 Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

edenw97 commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

mhils left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mhils Jan 1, 2026

Choose a reason for hiding this comment

Uh oh!

edenw97 Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

edenw97 commented Dec 30, 2025 •

edited

Loading

edenw97 Jan 5, 2026 •

edited

Loading