Skip to content

Conversation

@phofl
Copy link
Member

@phofl phofl commented Dec 30, 2022

- Performance improvement when iterating over pyarrow and nullable dtypes (:issue:`49825`, :issue:`49851`)
- Performance improvements to :func:`read_sas` (:issue:`47403`, :issue:`47405`, :issue:`47656`, :issue:`48502`)
- Memory improvement in :meth:`RangeIndex.sort_values` (:issue:`48801`)
- Performance improvement in :meth:`Series.to_numpy` if ``copy=True`` but data was copied anyway (:issue:`24345`)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

data was copied anyway is kinda confusing since I would expect copy=True to have made a copy of the data. Is this supposed to refer to the fact that the data is no longer copied twice?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not sure what I was thinking here, this makes no sense, your interpretation is correct. Will adjust accordingly

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

@mroeschke mroeschke added Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate NA - MaskedArrays Related to pd.NA and nullable extension arrays labels Jan 3, 2023
@mroeschke mroeschke added this to the 2.0 milestone Jan 5, 2023
@mroeschke mroeschke merged commit 60dc3f5 into pandas-dev:main Jan 5, 2023
@mroeschke
Copy link
Member

Thanks @phofl

@phofl phofl deleted the 48951 branch January 6, 2023 07:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Missing-data np.nan, pd.NaT, pd.NA, dropna, isnull, interpolate NA - MaskedArrays Related to pd.NA and nullable extension arrays

Projects

None yet

Development

Successfully merging this pull request may close these issues.

BUG: 'Series.to_numpy(dtype=, na_value=)' behaves differently with 'pd.NA' and 'np.nan' Avoid double copy in Series/Index.to_numpy

2 participants