Python Readability - Strip Webpage of Junk Content - Linux CLI
gotbletu
gaminglearningtechnologyapplicationsarchlinuxbasedbashscriptsbloatbodyclicodingcommandlinecomputerconsolecontentcursescygwindebiandesktopdistributionsdistrofedoraframebuffergotbletuguihelphowtohtmlinterfacejunklinkslinuxlxmlmainmintncursesnewsopenopensuseoperatingprogrammingprogramspythonreadabilityscriptingshellsoftwarestripsystemterminaltermuxtexttuitutorialubuntuuserwebpagezsh
https://github.com/gotbletu/shownotes/blob/master/w3m-python-readability.md
https://pypi.org/project/readability-lxml/ readability-lxml Given a html document, it pulls out the main body text and cleans it up.
nodejs readability-cli version https://youtu.be/_j-p0z2AQp4 ... https://www.youtube.com/watch?v=qPiE1JUgsBg
2020-10-09
17.17505156 LBC
Copyrighted (contact publisher)
49185512 Bytes