Skip to content

refactor(read): split ArrowReader into DataFileReader, DataEvolutionReader and TableRead#246

Merged
JingsongLi merged 2 commits intoapache:mainfrom
JingsongLi:refactor_readers
Apr 15, 2026
Merged

refactor(read): split ArrowReader into DataFileReader, DataEvolutionReader and TableRead#246
JingsongLi merged 2 commits intoapache:mainfrom
JingsongLi:refactor_readers

Conversation

@JingsongLi
Copy link
Copy Markdown
Contributor

Purpose

Move the monolithic arrow/reader.rs into three focused modules under table:

  • data_file_reader.rs: regular Parquet file reading with schema evolution
  • data_evolution_reader.rs: column-merge reading for data evolution mode
  • table_read.rs: TableRead extracted from read_builder.rs

Also replaces assert! with proper error returns and adds bounds checking in attach_row_id.

Brief change log

Tests

API and Format

Documentation

…eader and TableRead

Move the monolithic arrow/reader.rs into three focused modules under table/:
- data_file_reader.rs: regular Parquet file reading with schema evolution
- data_evolution_reader.rs: column-merge reading for data evolution mode
- table_read.rs: TableRead extracted from read_builder.rs

Also replaces assert! with proper error returns and adds bounds checking
in attach_row_id.
Copy link
Copy Markdown

@jerry-024 jerry-024 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@JingsongLi JingsongLi merged commit fa8091e into apache:main Apr 15, 2026
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants