Demonstrating PyIceberg

Tabular

description

In this video, we demonstrate how to use the PyIceberg CLI. For the demo, we use the docker-spark-iceberg setup that's available here: https://github.com/tabular-io/docker-spark-iceberg You can also read the companion article at https://tabular.medium.com/reading-apache-iceberg-from-python-with-pyiceberg-8b8cff36f4f0

First, we create a table using Spark through the Jupyter notebook. Next, we browse the catalog using the pyiceberg CLI. We install pyiceberg from pip using pip install "pyiceberg[pyarrow]".

For a complete overview of all the installation options, please refer to the documentation: https://py.iceberg.apache.org/

Next we demonstrate several commands like list, describe, and files to retrieve information about the iceberg tables. In the end, we show how easy it is to accidentally drop a table using the CLI.

If there are any questions, please reach out using the Iceberg Slack: https://iceberg.apache.org/community/

or open an issue or pull request on Github https://github.com/apache/iceberg ... https://www.youtube.com/watch?v=XNoUXO_yucM

created

2023-04-15

staked

0.0 LBC

license

File size

15753901 Bytes