Skip to content

Add JSON Reading/Writing Support for Parquet Files#79649

Merged
Avogar merged 34 commits intoClickHouse:masterfrom
nihalzp:parquet-json-support
Jun 3, 2025
Merged

Add JSON Reading/Writing Support for Parquet Files#79649
Avogar merged 34 commits intoClickHouse:masterfrom
nihalzp:parquet-json-support

Conversation

@nihalzp
Copy link
Member

@nihalzp nihalzp commented Apr 27, 2025

Changelog category (leave one):

  • New Feature

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Support writing CH JSON columns to Parquet and reading Parquet JSON columns directly as CH JSON columns.

Documentation entry for user-facing changes

  • Documentation is written (mandatory for new features)

Usage

Write:

CREATE TABLE example (
    id        Int32,
    json_str  String,
    json_col  JSON
) ENGINE = MergeTree
ORDER BY id;

INSERT INTO example (id, json_str, json_col) VALUES
    (1, '{"user":"alice","age":30}',         '{"user":"alice","age":30}'),
    (2, '{"items":[1,2,3],"flag":true}',     '{"items":{"yes":[1,2,3]},"flag":true}'),
    (3, '{"meta":{"x":10,"y":20}}',          null);

select * from example into outfile 'json_to_pq.parquet' TRUNCATE  format parquet;

-- json_col should have Parquet logical type JSON
select * from file('json_to_pq.parquet', ParquetMetaData);

Read:

SET input_format_parquet_enable_json_parsing = 1;

-- json_col should have CH type JSON
DESCRIBE TABLE (
SELECT *
FROM file(
    'json_to_pq.parquet',
    Parquet
));

@nihalzp
Copy link
Member Author

nihalzp commented Apr 27, 2025

Closes #78992

@alexey-milovidov alexey-milovidov added the can be tested Allows running workflows for external contributors label Apr 27, 2025
@clickhouse-gh
Copy link
Contributor

clickhouse-gh bot commented Apr 27, 2025

Workflow [PR], commit [fa87f7b]

@clickhouse-gh clickhouse-gh bot added the pr-feature Pull request with new product feature label Apr 27, 2025
@nihalzp
Copy link
Member Author

nihalzp commented Apr 28, 2025

The failed tests appear to be unrelated.

@Avogar Avogar self-assigned this Apr 28, 2025
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jun 29, 2025
Fix rebase issue:
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jun 29, 2025
Fix rebase issue:
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 3, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 4, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 5, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 6, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 7, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 7, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 8, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 9, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 10, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 11, 2025
Fix rebase issue:
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 14, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 14, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 15, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 18, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 18, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 20, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 20, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 21, 2025
Fix rebase issue:
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 28, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 29, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 29, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 30, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Jul 31, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Aug 1, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Aug 2, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
kyligence-git pushed a commit to Kyligence/ClickHouse that referenced this pull request Aug 3, 2025
Fix rebase issue:
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
baibaichen pushed a commit to Kyligence/ClickHouse that referenced this pull request Aug 7, 2025
Fix rebase issue:
- 20250806 ClickHouse#84821
- 20250804 ClickHouse#83997
- 20250728 ClickHouse#84180
- 20250713 ClickHouse#82949
- 20250703 ClickHouse#82934
- 20250626 ClickHouse#80931
- 20250604 ClickHouse#79649
- 20250502 ClickHouse#79180
- 20250416 ClickHouse#78485
- 20250306 ClickHouse#76662

Co-authored-by: liuneng1994 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

can be tested Allows running workflows for external contributors pr-feature Pull request with new product feature pr-synced-to-cloud The PR is synced to the cloud repo

Projects

None yet

Development

Successfully merging this pull request may close these issues.

JSON support for Parquet import / export

4 participants