Talend Project - Parse a webpage (zacks.com) - Part 1/4
Sanjay Kattimani
#Talend #ETL This mini project in Talend uses multiple components to parse the webpage contents and extract just the text we need and format it. In this example, we will extract a stock's ratings provided by Zacks.com. We will first download tHTMLParse component from Talend exchange and then string format the text to extract ratings. Finally tNormalize component is used to convert ratings in one row to 4 records.
Java code that goes in to tJavaRow component is available on http://sanjaykattimani.blogspot.com/2016/12/talend-project-to-parse-webpage-zackscom.html
Check out my son's video on html basics - https://www.youtube.com/watch?v=K2coauDHAi8 ... https://www.youtube.com/watch?v=Zit7Pu_az8E
2021-12-31
0.0 LBC
Copyrighted (contact publisher)
33524369 Bytes