A tool for scraping files from imageboards’ threads.
Go to file
2021-05-04 04:04:22 +04:00
scrapthechan Version changed to 0.5.1. 2021-05-04 03:58:02 +04:00
.gitignore Initial commit with all the files. 2020-07-08 22:53:39 +04:00
CHANGELOG.md Changelog updated with 0.5.1 changes. 2021-05-04 04:04:22 +04:00
COPYING Initial commit with all the files. 2020-07-08 22:53:39 +04:00
Makefile Version changed to 0.5.1 in a Makefile. 2021-05-04 03:58:46 +04:00
README.md README updated. lolifox.cc removed. Option --skip-posts added. 2021-05-03 02:45:41 +04:00
setup.cfg Lolifox removed. Development Status classifier is changed to Alpha. Python 3.7 classifier left to represent oldest supported version. 2021-05-03 02:43:49 +04:00
setup.py Initial commit with all the files. 2020-07-08 22:53:39 +04:00

This is a tool for scraping files from imageboards' threads.

It extracts the files from a JSON representation of a thread. And then downloads 'em in a specified output directory or if it isn't specified then creates following directory hierarchy in a working directory:

<imageboard name>
|-<board name>


scrapthechan [OPTIONS] (<url> | <imageboard> <board> <thread>)

<url> -- URL of a thread.

<imageboard> <board> <thread> -- imageboard name, board name and thread ID separately. E.g. 4chan b 1100500.

-o, --output-dir -- output directory where all files will be dumped to.

-N, --no-op -- by default OP's post will be saved in a !op.txt file. This flag disables this behaviour. An exclamation mark ! in a name is for so this file will be on the top of a directory listing.

-S <num>, --skip-posts <num> -- skip given number of posts.

-v, --version prints the version of the program.

-h, --help prints help for a program.

Supported imageboards


  • Sane rewrite of a program;
  • Thread watcher.