GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

alamb · 2026-01-14T19:50:30Z

This is a proposed implementation of the ALP described in

"ALP: Adaptive Lossless floating-Point Compression" (SIGMOD 2024, https://dl.acm.org/doi/10.1145/3626717

It is based on (largely a reformatted version of) @prtkgaur 's ALP Encoding Specification Google Doc

See rendered preview here: https://github.com/alamb/parquet-format/blob/alamb/alp/Encodings.md#adaptive-lossless-floating-point-adaptive_lossless_floating_point--10

Rationale for this change

This encoding has the following properties:

Targets real-world floating-point (IEEE 754) data.
It achieves higher compression ratios (close to ZSTD)
Much faster to decompress than zstd (and other floating point algorithms)

See Mailing List Discussion: https://lists.apache.org/thread/tjtln1mmjqfoql1ls2dw9xpdk91r1909

Source ALP Results Document

(Todo summarize the mailing list discussion here)

What changes are included in this PR?

Closes [Proposal] Add ALP encoding support in parquet file format #533

Do these changes have PoC implementations?

Yes

C/C++: GH-48701: [C++][Parquet] Add ALPpd encoding arrow#48345

apacheGH-533: Add ALP Encoding

f70742c

alamb force-pushed the alamb/alp branch from a7d986c to f70742c Compare January 14, 2026 20:17

alamb marked this pull request as ready for review January 14, 2026 20:17

alamb changed the title ~~GH-533: Add ALP Encoding~~ GH-533: Adaptive Lossless Floating-Point (ALP) Encoding Jan 14, 2026

Add to parquet.thrift

c0638f1

alamb mentioned this pull request Jan 14, 2026

[Proposal] Add ALP encoding support in parquet file format #533

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

Uh oh!

alamb commented Jan 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

Are you sure you want to change the base?

GH-533: Adaptive Lossless Floating-Point (ALP) Encoding #548

Uh oh!

Conversation

alamb commented Jan 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Do these changes have PoC implementations?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

alamb commented Jan 14, 2026 •

edited

Loading