1
0
Fork 0
A tool for scraping files from imageboards’ threads.
Go to file
Alexander Andreev f3ef07af68
Rewrite of Parser class because it was fucked up. Now there's no problems with inheritance and its subclasses now more pleasant to write. ThreadNotFoundError now has a reason field.
2021-05-03 02:38:46 +04:00
scrapthechan Rewrite of Parser class because it was fucked up. Now there's no problems with inheritance and its subclasses now more pleasant to write. ThreadNotFoundError now has a reason field. 2021-05-03 02:38:46 +04:00
.gitignore Initial commit with all the files. 2020-07-08 22:53:39 +04:00
CHANGELOG.md Changelog update for 0.4.1. 2021-04-28 02:49:26 +04:00
COPYING Initial commit with all the files. 2020-07-08 22:53:39 +04:00
Makefile Version in a Makefile is changed to 0.4.1. 2021-04-28 02:50:38 +04:00
README.md A little fix for README. 2020-11-25 03:36:31 +04:00
setup.cfg In classifiers licence was fixed and added more topics related to a program. 2020-11-25 03:37:24 +04:00
setup.py Initial commit with all the files. 2020-07-08 22:53:39 +04:00

README.md

This is a tool for scraping files from imageboards' threads.

It extracts the files from a JSON representation of a thread. And then downloads 'em in a specified output directory or if it isn't specified then creates following directory hierarchy in a working directory:

<imageboard name>
|-<board name>
  |-<thread>
    |-[!op.txt]
    |-...
  |-...

Usage

scrapthechan [OPTIONS] (<url> | <imageboard> <board> <thread>)

<url> -- URL of a thread.

<imageboard> <board> <thread> -- imageboard name, board name and thread ID separately. E.g. 4chan b 1100500.

-o, --output-dir -- output directory where all files will be dumped to.

-N, --no-op -- by default OP's post will be saved in a !op.txt file. This flag disables this behaviour. An exclamation mark ! in a name is for so this file will be on the top of a directory listing.

-v, --version prints the version of the program.

-h, --help prints help for a program.

Supported imageboards