Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PARQUET-2369: Clarify GZIP Support for Multiple GZIP Members #218

Merged
merged 1 commit into from
Nov 14, 2023

Conversation

tustvold
Copy link
Contributor

@tustvold tustvold commented Oct 19, 2023

See apache/parquet-testing#41 for more context

@tustvold tustvold changed the title Clarify GZIP Support for Multiple GZIP Members PARQUET-2369: Clarify GZIP Support for Multiple GZIP Members Oct 19, 2023
Copy link
Member

@pitrou pitrou left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, but can we perhaps surface this on the ML to get more exposure?

@tustvold
Copy link
Contributor Author

Posted to the mailing list - https://lists.apache.org/thread/zzdh20j8fhrrt3dmx6ow4dbbccmvh277

@emkornfield
Copy link
Contributor

+1 from me.

@amassalha
Copy link

Posted to the mailing list - https://lists.apache.org/thread/zzdh20j8fhrrt3dmx6ow4dbbccmvh277

@tustvold I don't see much progress in the mailing list, do you have a deadline for waiting for responds on it and moving forward?

@shangxinli
Copy link

LGTM, thanks for working on it.

@amassalha
Copy link

amassalha commented Oct 31, 2023

+1 from me.
@emkornfield
not sure if this what holds this merge but your review is still requested, can you please click "approve" if you have no comments?

@wgtmac
Copy link
Member

wgtmac commented Nov 2, 2023

+1 from me.
@emkornfield
not sure if this what holds this merge but your review is still requested, can you please click "approve" if you have no comments?

I will merge this if there is no objection by the end of this week.

@@ -58,6 +58,10 @@ formats) defined by [RFC 1952](https://tools.ietf.org/html/rfc1952).
If any ambiguity arises when implementing this format, the implementation
provided by the [zlib compression library](https://zlib.net/) is authoritative.

Readers should support reading pages containing multiple GZIP members, however,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Might be worth linking to the RFC of GZIP members: https://datatracker.ietf.org/doc/html/rfc1952#section-2.2

Copy link

@amassalha amassalha Nov 4, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

its already mentioned:

GZIP

A codec based on the GZIP format (not the closely-related "zlib" or "deflate"
formats) defined by [RFC 1952](https://tools.ietf.org/html/rfc1952).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right to clarify I'm suggesting that we link "members" to its specific section where it is defined.

@amassalha
Copy link

@wgtmac are you planning to merge? I see all required approved were achieved

@wgtmac
Copy link
Member

wgtmac commented Nov 13, 2023

@wgtmac are you planning to merge? I see all required approved were achieved

I thought there is a minor suggestion: #218 (comment). Would you mind adding that?

@amassalha
Copy link

amassalha commented Nov 13, 2023

@wgtmac are you planning to merge? I see all required approved were achieved

I thought there is a minor suggestion: #218 (comment). Would you mind adding that?

@emkornfield @wgtmac
the suggesed link (adding https://datatracker.ietf.org/doc/html/rfc1952#section-2.2 instead of just https://tools.ietf.org/html/rfc1952) , leads to same page as the exists link https://tools.ietf.org/html/rfc1952
seems that this tools.ietf.org site doesn't provide an option to link to a spesific page/section
so I don't see a point to duplicate the link

@pitrou
Copy link
Member

pitrou commented Nov 13, 2023

seems that this tools.ietf.org site doesn't provide an option to link to a spesific page/section
so I don't see a point to duplicate the link

Agreed, let's not overdo this.

@emkornfield
Copy link
Contributor

seems that this tools.ietf.org site doesn't provide an option to link to a spesific page/section
so I don't see a point to duplicate the link

Agreed, let's not overdo this.

I'm ok with this.

@wgtmac wgtmac merged commit 6f3f909 into apache:master Nov 14, 2023
3 checks passed
@wgtmac
Copy link
Member

wgtmac commented Nov 14, 2023

I just merged this. Thanks everyone!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

9 participants