Data Science with Python! Analyzing File Types from Avro to Stata
Adrian Dolinay
Tutorial comparing various file types using Python and pandas.
The notebook can be found in the "Data Science with Python" folder within the below repo. GitHub Repo - https://github.com/ad17171717/YouTube-Tutorials
CONNECT: LinkedIn: https://www.linkedin.com/in/adrian-dolinay-frm-96a289106/ GitHub: https://github.com/ad17171717 Twitter: https://twitter.com/DolinayG Odysee: https://odysee.com/@adriandolinay:0
------Video Chapters------ 0:00 - Intro 0:04 - Installing required packages 0:47 - Overview of the file types 2:44 - Text vs binary files 7:39 - Corrupted text vs binary files 10:41 - Reading stock data into a pandas DataFrame 11:22 - Timing the write times of different file types 19:39 - Timing the read times of different file types 23:59 - Analyzing the size of different file types 28:24 - Timing the write/read times as the number of rows grow 35:11 - Timing the write/read times as the number of columns grow 37:26 - Issues with writing to SAS 38:59 - References and additional learning ... https://www.youtube.com/watch?v=7FX6_p5Qafg
131629408 Bytes