don't tolerate wrong te headers #3260

benoitc · 2024-08-06T21:02:51Z

Just follow the new specification here and accept to introduce a breaking change. Also support multiple encoding on same line.

benoitc · 2024-08-06T21:03:05Z

cc @pajod @tilgovi

changes: - Just follow the new TE specification (https://datatracker.ietf.org/doc/html/rfc9112#name-transfer-encoding) here and accept to introduce a breaking change. - gandle multiple TE on one line ** breaking changes ** : invalid headers and position will now return an error.

pajod · 2024-08-07T12:49:45Z

gunicorn/http/message.py

+                        if chunked:
+                            raise InvalidHeader("TRANSFER-ENCODING", req=self)
+                        self.force_close()
+                    else:


Now empty headers (or leading/trailing comma) are still reported as Unsupported, while you deliberately changed the other cases where the spec sanctions our outright refusal to Invalid. Maybe worth adding back the extra branch to avoid the (not wrong, but also not immediately clear) Unsupported transfer coding: "" message?

do you mean we need to support empty value for Transfer-Encoding?

Ie. Put back this:

elif value.lower() == "": # lacking security review on this case # offer the option to restore previous behaviour, but refuse by default, for now self.force_close()

Shouldn't we rather handle a special parsing for trailers instead ? Though why an empty value?

I do not suggest a change in what we accept and what we refuse (here). I merely suggest revisiting the choice of Exception.

Some cases could be (theoretically) valid per the spec, yet unsupported by us. Others are Invalid. I reckon the first instance of encountering an empty val is always a case of Invalid, as the rfc9112 ABNF for Transfer-Encoding does not allow for empty headers (or consecutive commas outside quoting). So that branch (otherwise processed in else) should be added back. Not changing the fact that we refuse those. Merely swapping the exception.

gunicorn/http/message.py

benoitc · 2024-08-07T12:53:28Z

we will ditch compatibility. It's a major version. Either we follow the spec or not imo. I don't think it will create a major issue especially if people uses a proxy in front of gunicorn , which is encouraged. I prefer the simplicity there/. We need to be clear about the change in the changelog about it.

For "", it's still handled since when you split by "," it can then either be supported (chuunked, identity, deflate, ... or not. Do you mean we need to handle the return diffreently for this case? To which part of the spec does it refers? We can add back this case if needed .

pajod · 2024-08-07T12:57:19Z

gunicorn/http/message.py

-                    if not self.cfg.tolerate_dangerous_framing:
+                # T-E can be a list
+                # https://datatracker.ietf.org/doc/html/rfc9112#name-transfer-encoding
+                vals = [v.strip() for v in value.split(',')]


Sorry, I was wrong earlier. This is not immediately incorrect (as everything containing quotes is rejected below). Might send a patch later to re-verbosify the comments warning about the complex nature of the list.

@pajod o for me. I think it would be goo dif we can make a release later this week :)

what complex nature of the list though?

@benoitc we split the list in unexpected places, because of the comma in the quoted-string inside transfer-parameter inside transfer-coding, I suggested comments in #3273

pajod · 2025-01-15T23:48:30Z

gunicorn/http/message.py

+                        # safe option: nuke it, its never needed
+                        if chunked:
+                            raise InvalidHeader("TRANSFER-ENCODING", req=self)
+                    elif val.lower() in ('compress', 'deflate', 'gzip'):


I still do not see how this leads to anything but trouble. We are reading a list of encodings, handle one (even though they all are, as hop-by-hop headers our responsibility only), act like the other does not matter.. then leave it up to the app to figure out how to interpret the body. How could an application possibly correctly deal with this behavior?

https://peps.python.org/pep-3333/#other-http-features

https://datatracker.ietf.org/doc/html/rfc9110#section-7.6.1

WSGI servers must handle any supported inbound “hop-by-hop” headers on their own

benoitc changed the title ~~don't tolerate wrong te heade~~ don't tolerate wrong te headers Aug 6, 2024

benoitc force-pushed the fix-te branch from af1210d to 2ef4440 Compare August 6, 2024 21:46

benoitc force-pushed the fix-te branch from 2ef4440 to 555d2fa Compare August 6, 2024 21:47

benoitc merged commit ff2109e into master Aug 6, 2024
46 checks passed

pajod reviewed Aug 7, 2024

View reviewed changes

gunicorn/http/message.py Show resolved Hide resolved

pajod reviewed Aug 7, 2024

View reviewed changes

iamsobanjaved mentioned this pull request Aug 17, 2024

chore: Upgrade Python requirements openedx/edx-platform#35305

Closed

pajod mentioned this pull request Jan 15, 2025

[bug] gunicorn re-chunks responses which were already chunked #3322

Open

pajod reviewed Jan 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

don't tolerate wrong te headers #3260

don't tolerate wrong te headers #3260

benoitc commented Aug 6, 2024 •

edited

Loading

benoitc commented Aug 6, 2024

pajod Aug 7, 2024

benoitc Aug 13, 2024

pajod Aug 13, 2024

benoitc commented Aug 7, 2024 •

edited

Loading

pajod Aug 7, 2024

benoitc Aug 7, 2024

benoitc Aug 13, 2024

pajod Aug 13, 2024

pajod Jan 15, 2025 •

edited

Loading

don't tolerate wrong te headers #3260

don't tolerate wrong te headers #3260

Conversation

benoitc commented Aug 6, 2024 • edited Loading

benoitc commented Aug 6, 2024

pajod Aug 7, 2024

Choose a reason for hiding this comment

benoitc Aug 13, 2024

Choose a reason for hiding this comment

pajod Aug 13, 2024

Choose a reason for hiding this comment

benoitc commented Aug 7, 2024 • edited Loading

pajod Aug 7, 2024

Choose a reason for hiding this comment

benoitc Aug 7, 2024

Choose a reason for hiding this comment

benoitc Aug 13, 2024

Choose a reason for hiding this comment

pajod Aug 13, 2024

Choose a reason for hiding this comment

pajod Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

benoitc commented Aug 6, 2024 •

edited

Loading

benoitc commented Aug 7, 2024 •

edited

Loading

pajod Jan 15, 2025 •

edited

Loading