Get text without #JavaScript scripts, #css styles and #html tags -- only the text of the page.
You can get cleared text on the link if the link is open and does not require authorization.
Or copy the entire source text of the page (page code) and clean it.