75 Commits

Author SHA1 Message Date
3ead328a0f Merge branch 'master' into scraper 2025-10-09 20:37:55 +02:00
fb90935d42 Remove non image formats 2025-10-09 20:37:39 +02:00
f55f118271 Merge branch 'master' into scraper 2025-10-09 20:07:02 +02:00
3279e8e3c0 Scan video formats aswell 2025-10-09 20:06:51 +02:00
1dc17d7670 Don't create py34.Post for downloading images
py34.Post does extension scan, that is not required as it is saved
inside the block data.
2025-10-09 20:05:36 +02:00
6d6e13c231 Merge branch 'master' into scraper 2025-10-09 19:25:12 +02:00
34753bc011 Fix typo 2025-10-09 19:25:05 +02:00
a4e3ced8ab Download videos too, prioritize video over image 2025-10-09 19:18:26 +02:00
0b3293eaa8 Merge branch 'master' into scraper 2025-10-09 19:17:26 +02:00
f07f2ce32d Removed underscore from VIDEO_FORMATS 2025-10-09 19:17:11 +02:00
5473f92470 Merge branch 'master' into scraper 2025-10-03 00:09:06 +02:00
ee4ba5d5f2 Create file after the image was retrieved
`post.get_image_data()` can throw, creating file and later calling the function can create empty "image" files, confusing the program.
2025-10-02 23:40:33 +02:00
bc28807aeb Fixed a typo 2025-10-02 23:36:10 +02:00
217595fcdd Download and cache images 2025-09-23 12:44:36 +02:00
be6740a24c Added api route for getting raw block 2025-09-23 02:24:57 +02:00
71efaf7c25 Merge branch 'master' into scraper 2025-08-27 19:22:17 +02:00
1e8e8e9b40 Sleep causes threads to hang (python internal issue) 2025-08-27 19:22:06 +02:00
b81684de19 Merge branch 'master' into scraper 2025-08-27 17:47:50 +02:00
e7a8dcfcb3 Disambiguate image from video servers 2025-08-27 17:47:42 +02:00
f9fe6af292 Merge branch 'master' into scraper 2025-08-26 18:20:06 +02:00
461031aa3a Exit retry loop 2025-08-26 18:19:56 +02:00
6d1b1628f1 Merge branch 'master' into scraper 2025-08-26 18:15:14 +02:00
dd4447f756 Get response not body 2025-08-26 18:15:03 +02:00
0143cc3999 Merge branch 'master' into scraper 2025-08-26 18:02:07 +02:00
a2bb0e38cf Don't die on missing thumbnails 2025-08-26 18:01:53 +02:00
a39bfcd0b1 Merge branch 'master' into scraper 2025-08-15 19:32:33 +02:00
31b7ce76df Fixed invalid function name 2025-08-15 19:32:27 +02:00
826c5c3473 Merge branch 'master' into scraper 2025-08-15 19:31:40 +02:00
965384c3f2 Fixed invalid function names 2025-08-15 19:31:33 +02:00
265ec8d58e Merge branch 'master' into scraper 2025-08-15 19:30:42 +02:00
6dddf7985c Fixed missing "self" 2025-08-15 19:30:33 +02:00
241bc90e82 Merge branch 'master' into scraper 2025-08-15 19:28:08 +02:00
081efaeb9f Added more http methods to scraper 2025-08-15 19:27:21 +02:00
bdf77b2920 Merge remote-tracking branch 'refs/remotes/origin/scraper' into scraper 2025-08-15 18:48:23 +02:00
5f4358a3d1 Probe with HEAD requests, not GET 2025-08-15 18:47:58 +02:00
8e3a7b105a Fixed invalid extension check 2025-08-10 15:50:40 +02:00
d0dec584a8 Removed debug print 2025-08-10 15:49:03 +02:00
fb06339cc7 Added flask 2025-08-10 15:26:14 +02:00
48d17fcf7b Merge branch 'master' into scraper 2025-08-09 17:23:49 +02:00
c1b8be46aa Initial scraper commit 2025-08-09 17:15:20 +02:00
6fc820b397 Added ListURL 2025-08-09 17:12:56 +02:00
68b6a505c1 Close handle when deleted 2025-08-09 17:00:56 +02:00
462d24ab5e Ignore removed posts 2025-08-09 17:00:38 +02:00
abdcef0a80 Fixed handle leak 2025-08-09 16:13:25 +02:00
a0e38071d8 When bot detection kicks in, timeout and reload CloudScraper 2025-08-09 05:12:15 +02:00
12038f9477 Fixed view identification 2025-08-09 04:31:37 +02:00
dddb0e9583 Fixed handle leak 2025-08-09 04:25:24 +02:00
c9928342c2 Added ViewURL 2025-08-09 04:24:37 +02:00
14ed66aad7 Added empty Post constructor 2025-08-09 04:24:16 +02:00
8f228bde36 Close response handles 2025-08-09 02:27:23 +02:00