NLP with Python! Term Frequency-Inverse Document Frequency (tf-idf)
Adrian Dolinay
Tutorial on calculating the term frequency inverse document frequency from scratch. Learn how to preprocess a corpus of documents and create functions for the term frequency, inverse document frequency and combine them to calculate the tf-idf.
Stop Words Removal Tutorial - https://www.youtube.com/watch?v=0D7ae7OaaHQ&t=3s Bag of Words Tutorial - https://www.youtube.com/watch?v=0D7ae7OaaHQ&t=3s
The notebook can be found in the "NLP" folder within the below repo. GitHub Repo - https://github.com/ad17171717/YouTube-Tutorials
CONNECT: LinkedIn: https://www.linkedin.com/in/adrian-dolinay-frm-96a289106/ GitHub: https://github.com/ad17171717 Twitter: https://twitter.com/DolinayG Odysee: https://odysee.com/@adriandolinay:0
|-Video Chapters-| 0:00 - Intro 0:19 - Reviewing the data set 0:32 - Preprocessing text data 2:11 - Term frequency (tf) function 7:35 - Inverse document frequency (idf) function 11:59 - Term Frequency-Inverse Document Frequency (tf-idf) function 13:55 - Saving the tf-idf into a DataFrame 14:28 - Comparing our function to Scikit-learn 16:26 - References and additional learning ... https://www.youtube.com/watch?v=iHeOrDH3v1k
60461706 Bytes