The PyArrow revolution in Pandas
- Track:
- PyData: Software Packages & Jupyter
- Type:
- Talk
- Level:
- intermediate
- Room:
- Terrace 2A
- Start:
- 14:35 on 10 July 2024
- Duration:
- 30 minutes
Abstract
Pandas has long used NumPy for its back-end storage. But things are changing, and the future of Pandas will likely be tied closely with PyArrow. What are Arrow and PyArrow? How do they affect Pandas users today, and how will they affect us in the future? In this talk, I introduce PyArrow, tell you what it does, how we can already use it in our Pandas work, and whether that’s a good idea.