Skip to main content
EuroPython logo

The PyArrow revolution in Pandas

Track:
PyData: Software Packages & Jupyter
Type:
Talk
Level:
intermediate
Room:
Terrace 2A
Start:
14:35 on 10 July 2024
Duration:
30 minutes

Abstract

Pandas has long used NumPy for its back-end storage. But things are changing, and the future of Pandas will likely be tied closely with PyArrow. What are Arrow and PyArrow? How do they affect Pandas users today, and how will they affect us in the future? In this talk, I introduce PyArrow, tell you what it does, how we can already use it in our Pandas work, and whether that’s a good idea.