Jump to content
IGNORED

Wget


Dementia

Recommended Posts

Anyone ever use Wget or something similar to crawl an online journal database?

 

What if I wanted to extract every single .pdf and have them placed neatly into sub-directories, organized by journal name?

Link to comment
Share on other sites

wget -r -np -A.pdf whatever-the-url-is

 

Erm possibly - haven't used it in years, don't blame me if you end up downloading the whole internet ... I seem to remember something about robots.txt preventing crawling (mass downloading) on sites as well

Link to comment
Share on other sites

Archived

This topic is now archived and is closed to further replies.

  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.