Skip to content

[Python] CSV reader returns different values in 23.0.0 #49003

@rhshadrach

Description

@rhshadrach

Describe the bug, including details regarding any error messages, version, and platform.

This is currently causing failures on the pandas CI.

import pyarrow.csv

table = pyarrow.csv.read_csv(io.BytesIO(f"data\n10E-617".encode()))
print(table)
# pyarrow.Table  <-- pyarrow 22.0.0
# data: double
# ----
# data: [[0]]

# pyarrow.Table  <-- pyarrow 23.0.0
# data: string
# ----
# data: [["10E-617"]]

It's not clear to me if this was an intentional change. The only thing that looked possibly related from the release notes was

#47925

but I haven't bisected to find if this was the cause.

Component(s)

Python

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions