Demonstrating PyIceberg
Tabular
In this video, we demonstrate how to use the PyIceberg CLI. For the demo, we use the docker-spark-iceberg setup that's available here: https://github.com/tabular-io/docker-spark-iceberg You can also read the companion article at https://tabular.medium.com/reading-apache-iceberg-from-python-with-pyiceberg-8b8cff36f4f0
First, we create a table using Spark through the Jupyter notebook.
Next, we browse the catalog using the pyiceberg
CLI. We install pyiceberg from pip using pip install "pyiceberg[pyarrow]"
.
For a complete overview of all the installation options, please refer to the documentation: https://py.iceberg.apache.org/
Next we demonstrate several commands like list, describe, and files to retrieve information about the iceberg tables. In the end, we show how easy it is to accidentally drop a table using the CLI.
If there are any questions, please reach out using the Iceberg Slack: https://iceberg.apache.org/community/
or open an issue or pull request on Github https://github.com/apache/iceberg ... https://www.youtube.com/watch?v=XNoUXO_yucM
15753901 Bytes