1
0
Fork 0
A tool for scraping files from imageboards’ threads.
Go to file
Alexander Andreev cdcc184de8
Lolifox removed. Development Status classifier is changed to Alpha. Python 3.7 classifier left to represent oldest supported version.
2021-05-03 02:43:49 +04:00
scrapthechan Copyright, date, and version are updated. 2021-05-03 02:41:32 +04:00
.gitignore Initial commit with all the files. 2020-07-08 22:53:39 +04:00
CHANGELOG.md Changelog update for 0.4.1. 2021-04-28 02:49:26 +04:00
COPYING Initial commit with all the files. 2020-07-08 22:53:39 +04:00
Makefile Version in a Makefile is changed to 0.4.1. 2021-04-28 02:50:38 +04:00
README.md A little fix for README. 2020-11-25 03:36:31 +04:00
setup.cfg Lolifox removed. Development Status classifier is changed to Alpha. Python 3.7 classifier left to represent oldest supported version. 2021-05-03 02:43:49 +04:00
setup.py Initial commit with all the files. 2020-07-08 22:53:39 +04:00

README.md

This is a tool for scraping files from imageboards' threads.

It extracts the files from a JSON representation of a thread. And then downloads 'em in a specified output directory or if it isn't specified then creates following directory hierarchy in a working directory:

<imageboard name>
|-<board name>
  |-<thread>
    |-[!op.txt]
    |-...
  |-...

Usage

scrapthechan [OPTIONS] (<url> | <imageboard> <board> <thread>)

<url> -- URL of a thread.

<imageboard> <board> <thread> -- imageboard name, board name and thread ID separately. E.g. 4chan b 1100500.

-o, --output-dir -- output directory where all files will be dumped to.

-N, --no-op -- by default OP's post will be saved in a !op.txt file. This flag disables this behaviour. An exclamation mark ! in a name is for so this file will be on the top of a directory listing.

-v, --version prints the version of the program.

-h, --help prints help for a program.

Supported imageboards