Greasy Fork is available in English.
Fetch and download main content on current page, provide special support for chinese novel
Устаревшая версия за
Lightweight crawling script, used to download the main content of the webpage, theoretically suitable for any non-Ajax writing novel website, forum, etc. without writing any rules for this
The script will automatically retrieve the main content on the page and download it.
If you are in the novel directory page, it will traverse all the chapters and sort them and save them as TXT files.
https://xxx.xxx/book-**[20-99]**.html, https://xxx.xxx/book-**[01- 10]**.html
, which means download https://xxx.xxx/book-20.html to https://xxx.xxx/book-99.html, and https://xxx.xxx/book- 01.html to https://xxx.xxx/book-10.html, [1-10] does not add 0.l_chaptname>a
, but the text cannot be downloaded through the url. The text is downloaded by ajax through articlescontent. At this time, you can use @@articles@@articlescontent
to change the chapter. Replace articles in url with articlescontent, .l_chaptname>a@@articles@@articlescontent
can be adapted to this site. The first article can use regularity, for example, @@articles\d+@@$1content
means to replace "articles1", "articles2", etc. in the link with "1content" "2content".mask
, which means to delete the element whose class is mask