Skip to content

Conversation

@jwils
Copy link
Collaborator

@jwils jwils commented Oct 6, 2025

This migrates the existing square internal gem to the open source elasticgraph repo.

@jwils jwils force-pushed the joshuaw/warehouse_lambda branch from d66bc3e to d4a0486 Compare October 6, 2025 17:56
This migrates the existing square internal gem to the open source elasticgraph repo.
@jwils jwils force-pushed the joshuaw/warehouse_lambda branch from d4a0486 to 5cfe8e8 Compare October 6, 2025 18:01
params = payload_body.fetch(:script).fetch(:params)
data = params.fetch("data").merge({
"id" => params.fetch("id"),
"__eg_version" => params.fetch("version")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the only metadata that needs to be set within EG? Can a comment be added for this?

# `elasticgraph-indexer` so that it can be used in place of the standard datastore indexing router:
#
# https://github.com/squareup/elasticgraph-public/blob/v0.18.0.5/elasticgraph-indexer/lib/elastic_graph/indexer/datastore_indexing_router.rb#L76
class WarehouseDumper
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this only support JSONL or can parquet/avro/other options be supported in the future?


def compress(jsonl_data)
io = ::StringIO.new
gz = ::Zlib::GzipWriter.new(io)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Aside: Was snappy ever considered over gzip?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants