Skip to content

Latest commit

 

History

History
48 lines (41 loc) · 1.65 KB

File metadata and controls

48 lines (41 loc) · 1.65 KB

Apache Arrow (Python)

Arrow is a columnar in-memory analytics layer designed to accelerate big data. It houses a set of canonical in-memory representations of flat and hierarchical data along with multiple language-bindings for structure manipulation. It also provides IPC and common algorithm implementations.

This is the documentation of the Python API of Apache Arrow. For more details on the format and other language bindings see the main page for Arrow. Here will we only detail the usage of the Python API for Arrow and the leaf libraries that add additional functionality such as reading Apache Parquet files into Arrow structures.

.. toctree::
   :maxdepth: 2
   :caption: Getting Started

   install
   development
   memory
   data
   ipc
   filesystems
   plasma
   pandas
   parquet
   extending
   api
   getting_involved