Commit Graph

88 Commits

Author SHA1 Message Date
ac8aa3418a Merge branch 'scraper' into scraper-dev 2025-10-09 20:38:07 +02:00
3ead328a0f Merge branch 'master' into scraper 2025-10-09 20:37:55 +02:00
fb90935d42 Remove non image formats 2025-10-09 20:37:39 +02:00
889e9b69a8 Remove non image formats 2025-10-09 20:36:06 +02:00
66bef9279f Merge branch 'scraper' into scraper-dev 2025-10-09 20:07:14 +02:00
f55f118271 Merge branch 'master' into scraper 2025-10-09 20:07:02 +02:00
3279e8e3c0 Scan video formats aswell 2025-10-09 20:06:51 +02:00
1dc17d7670 Don't create py34.Post for downloading images
py34.Post does extension scan, that is not required as it is saved
inside the block data.
2025-10-09 20:05:36 +02:00
97a63937df Merge branch 'scraper' into scraper-dev 2025-10-09 19:25:28 +02:00
9247ab28ce Merge branch 'master' into scraper-dev 2025-10-09 19:25:17 +02:00
6d6e13c231 Merge branch 'master' into scraper 2025-10-09 19:25:12 +02:00
34753bc011 Fix typo 2025-10-09 19:25:05 +02:00
5cedbb2526 Merge branch 'scraper' into scraper-dev 2025-10-09 19:18:37 +02:00
a4e3ced8ab Download videos too, prioritize video over image 2025-10-09 19:18:26 +02:00
0b3293eaa8 Merge branch 'master' into scraper 2025-10-09 19:17:26 +02:00
f07f2ce32d Removed underscore from VIDEO_FORMATS 2025-10-09 19:17:11 +02:00
bf9237750b Merge branch 'scraper' into scraper-dev 2025-10-03 00:09:36 +02:00
5473f92470 Merge branch 'master' into scraper 2025-10-03 00:09:06 +02:00
ee4ba5d5f2 Create file after the image was retrieved
`post.get_image_data()` can throw, creating file and later calling the function can create empty "image" files, confusing the program.
2025-10-02 23:40:33 +02:00
bc28807aeb Fixed a typo 2025-10-02 23:36:10 +02:00
a3dca99a5d Merge branch 'scraper' into scraper-dev 2025-09-23 12:44:52 +02:00
217595fcdd Download and cache images 2025-09-23 12:44:36 +02:00
bb7d010694 Merge branch 'scraper' into scraper-dev 2025-09-23 02:27:13 +02:00
be6740a24c Added api route for getting raw block 2025-09-23 02:24:57 +02:00
ee236445c2 Made single threaded, added back delay 2025-08-28 21:38:32 +02:00
71efaf7c25 Merge branch 'master' into scraper 2025-08-27 19:22:17 +02:00
1e8e8e9b40 Sleep causes threads to hang (python internal issue) 2025-08-27 19:22:06 +02:00
f0efc80fd8 Removed sleeping between service overload attempts.
https://stackoverflow.com/questions/47147328/thread-wait-for-tstate-lock-never-returns#55643196
2025-08-27 19:06:31 +02:00
54fc8fc213 Merge branch 'scraper' into scraper-dev 2025-08-27 17:47:59 +02:00
b81684de19 Merge branch 'master' into scraper 2025-08-27 17:47:50 +02:00
e7a8dcfcb3 Disambiguate image from video servers 2025-08-27 17:47:42 +02:00
64908f298e Added verbosity to requests 2025-08-27 17:10:17 +02:00
f9fe6af292 Merge branch 'master' into scraper 2025-08-26 18:20:06 +02:00
461031aa3a Exit retry loop 2025-08-26 18:19:56 +02:00
6d1b1628f1 Merge branch 'master' into scraper 2025-08-26 18:15:14 +02:00
dd4447f756 Get response not body 2025-08-26 18:15:03 +02:00
0143cc3999 Merge branch 'master' into scraper 2025-08-26 18:02:07 +02:00
a2bb0e38cf Don't die on missing thumbnails 2025-08-26 18:01:53 +02:00
a39bfcd0b1 Merge branch 'master' into scraper 2025-08-15 19:32:33 +02:00
31b7ce76df Fixed invalid function name 2025-08-15 19:32:27 +02:00
826c5c3473 Merge branch 'master' into scraper 2025-08-15 19:31:40 +02:00
965384c3f2 Fixed invalid function names 2025-08-15 19:31:33 +02:00
265ec8d58e Merge branch 'master' into scraper 2025-08-15 19:30:42 +02:00
6dddf7985c Fixed missing "self" 2025-08-15 19:30:33 +02:00
241bc90e82 Merge branch 'master' into scraper 2025-08-15 19:28:08 +02:00
081efaeb9f Added more http methods to scraper 2025-08-15 19:27:21 +02:00
bdf77b2920 Merge remote-tracking branch 'refs/remotes/origin/scraper' into scraper 2025-08-15 18:48:23 +02:00
5f4358a3d1 Probe with HEAD requests, not GET 2025-08-15 18:47:58 +02:00
8e3a7b105a Fixed invalid extension check 2025-08-10 15:50:40 +02:00
d0dec584a8 Removed debug print 2025-08-10 15:49:03 +02:00