Decimal Support for Binary Precision #91

wilwade · 2023-06-23T12:45:29Z

Currently this library only supports DECIMAL reading and writing when the precision is <= 18

To annotate the Parquet Spec: https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#decimal

DECIMAL can be used to annotate the following types:

int32: for 1 <= precision <= 9
int64: for 1 <= precision <= 18; precision < 10 will produce a
warning
fixed_len_byte_array: precision is limited by the array size. Length n
can store <= floor(log_10(2^(8*n - 1) - 1)) base-10 digits
binary: precision is not limited, but is required. The minimum number of
bytes to store the unscaled value should be used.

Test Files:

additional support for decimal type #81 (comment)
See Decimal test files here: https://github.com/apache/parquet-testing/tree/master/data

Related Issues:

Thanks to @nirmal82 in additional support for decimal type #81 bringing this up
Current Decimal Read Support Added: Add ability to read decimal columns #79
Current Decimal Write Support Added: Decimal Writer Support #90
Byte array Support added: Add support to byte array decimal fields #97

The text was updated successfully, but these errors were encountered:

YECHUNAN · 2023-08-13T21:21:11Z

I made a PR attempting to add rudimentary support for Decimal fields that are represented by byte arrays, which may have precision over 18.

Problem ======= Address #91 Solution ======== When encountering such byte array represented "Decimal" fields, parse them into raw buffers. Change summary: --------------- - Added code to parse "Decimal" type fields represented by byte arrays (fixed length or non-fixed length) into raw buffer values for further client side processing. - Added two test cases verifying the added code. - Loosen the precision check to allow values greater than 18 for byte array represented "Decimal" fields. Steps to Verify: ---------------- - Use the library to open a parquet file which contains a "Decimal" field represented by a byte array whose precision is greater than 18. - Before the change, library will throw an error saying precision cannot be greater than 18. - After the change, library will parse those fields to their raw buffer values and return records normally. --------- Co-authored-by: Wil Wade <[email protected]>

craxal · 2024-03-12T17:56:33Z

I suspect that the earlier pull request has caused some regression issues related to DECIMAL values. Some folks are reporting the following error:

missing option: typeLength (required for FIXED_LEN_BYTE_ARRAY)

From what I can gather, this occurs even if there are no FIXED_LEN_BYTE_ARRAY backed DECIMAL values (only INT64 in one case).

wilwade · 2024-03-13T12:55:02Z

@craxal the fix from @JasonYeMSFT released in v1.6.1 (just this morning) should fix it.

craxal · 2024-03-13T18:08:31Z

@wilwade Ah, yes, I think it does. Just tested it myself. Sorry, I thought the pull request had already been released.

craxal · 2024-09-09T21:50:04Z

Is there any status update on this item? We're hoping we can start parsing fixed length array decimals in the near future.

wilwade added good first issue Good for newcomers help wanted Extra attention is needed labels Jun 23, 2023

wilwade mentioned this issue Jun 23, 2023

additional support for decimal type #81

Closed

JasonYeMSFT mentioned this issue Jun 30, 2023

Unable to view new .parquet files using the new "Preview" function microsoft/AzureStorageExplorer#6990

Closed

3 tasks

JasonYeMSFT mentioned this issue Jul 7, 2023

Parquet Preview: Support decimal field with precision > 18 microsoft/AzureStorageExplorer#7042

Closed

YECHUNAN mentioned this issue Aug 11, 2023

Add support to byte array decimal fields #97

Merged

wilwade changed the title ~~Decimal Support for Precision > 18~~ Decimal Support for Binary Precision Aug 14, 2023

craxal mentioned this issue Mar 12, 2024

Unable to preview xxx .parquet file microsoft/AzureStorageExplorer#7807

Closed

3 tasks

craxal mentioned this issue Sep 9, 2024

Incorrect preview of parquet files with decimals microsoft/AzureStorageExplorer#7957

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decimal Support for Binary Precision #91

Decimal Support for Binary Precision #91

wilwade commented Jun 23, 2023 •

edited

Loading

YECHUNAN commented Aug 13, 2023

craxal commented Mar 12, 2024 •

edited

Loading

wilwade commented Mar 13, 2024

craxal commented Mar 13, 2024

craxal commented Sep 9, 2024

Decimal Support for Binary Precision #91

Decimal Support for Binary Precision #91

Comments

wilwade commented Jun 23, 2023 • edited Loading

YECHUNAN commented Aug 13, 2023

craxal commented Mar 12, 2024 • edited Loading

wilwade commented Mar 13, 2024

craxal commented Mar 13, 2024

craxal commented Sep 9, 2024

wilwade commented Jun 23, 2023 •

edited

Loading

craxal commented Mar 12, 2024 •

edited

Loading