A tool for scraping files from imageboards’ threads.
scrapthechan | ||
.gitignore | ||
CHANGELOG.md | ||
COPYING | ||
Makefile | ||
README.md | ||
setup.cfg | ||
setup.py |
This is a tool for scraping files from imageboards' threads.
It extracts the files from a JSON version of a thread. And then downloads 'em in a specified output directory or if it isn't specified then creates following directory hierarchy in a working directory:
<imageboard name>
|-<board name>
|-<thread>
|-[!op.txt]
|-...
|-...
Usage
scrapthechan [OPTIONS] (<url> | <imageboard> <board> <thread>)
<url>
-- URL of a thread.
<imageboard> <board> <thread>
-- imageboard name, board name and thread ID
separately. E.g. 4chan b 1100500
.
-o
, --output-dir
-- output directory where all files will be dumped to.
--no-op
-- by default OP's post will be saved in a !op.txt
file. This flag
disables this behaviour. I desided to put an !
in a name so this file will be
on the top in a directory listing.
-v
, --version
prints the version of the program, and -h
, --help
prints
help for a program.
Supported imageboards
- 4chan.org since 0.1.0
- lainchan.org since 0.1.0
- 2ch.hk since 0.1.0
- 8kun.top since 0.2.2
- lolifox.cc since 0.3