×

Loading...

Topic

This topic has been archived. It cannot be replied.
  • 工作学习 / IT技术讨论 / Help: How to get pure text from HTML page?
    I don't want to write my own codes to get rid of all tags because that's not accurate. Is there any easy way to do this job?
    Thanks a lot.
    • it is easy, open your html file in browse, seelct all text in the brwose, paste then into txt file
      • code! i want to make it work in automation.
        • try perl module, i remember there is one perl module can do this. but i don't have my perl book with me, check www.perl.com
        • goto www.yahoo.com , find the keyword html2txt
    • sed 's/<[^>]*>//g' input.html > out.txt