@eyedeekay
&kytv
&zzz
+R4SAS
+RN
+StormyCloud
+T3s|4
+dr|z3d
+hagen
+hk
+lbt
+mareki2p
+orignal
+postman
+radakayot
+snex
+wodencafe
Arch
Danny
DeltaOreo
FreefallHeavens
Irc2PGuest12735
Irc2PGuest16986
Irc2PGuest59134
Onn4l7h
Onn4|7h
SigSegv
Sisyphus
Sleepy
T3s|4_
Teeed
acetone_
aeiou
ardu
b3t4f4c3__
bak83_
cumlord
death
dr4wd3
eyedeekay_bnc
not_bob_afk
poriori_
profetik1
qend-irc2p
r3med1tz
rapidash
shiver_
solidx66_
thetia
u5657
uop23ip
w8rabbit
weko_
x74a6
eyedeekay
Holy cow, scrapers did not waste any time. And in-network too.
orignal
AI shit?
orignal
I was tired to ban them
eyedeekay
Yeah looks like it, they were eating gitlab alive too, and are also pounding on the mirrors
eyedeekay
But they're fucking up the whole internet so why should gitea be special I guess
eyedeekay
Fortunately for me I've got way more options with gitea than gitlab to mess with them
eyedeekay
Going to try a honeypot thing soon, they're looking at stuff that isn't real and they're doing it pretty early, so if somebody tries to look for like, acme stuff in .well-known then I can pretty much conclude they're a bot and ban them in real-ish time
StormyCloud
Dooooo it nuke them all
orignal
no, recognize them and redicrect to pages with shit
orignal
let this crap learn on crap
eyedeekay
Whatever gets them out of my hair
eyedeekay
Where I'm doing the I2P side by plugging in SAMv3 Listeners I can pretty much put any Listener middleware in between the service and the net, including ones I write, so I have a lot of strategic agility here
orignal
this shit must be elimited at all cost
orignal
if AI so smart as these fucking idiots declare, why it has to abuse the sites made by people for people
orignal
not enough material in fucking facebook dataabse?
orignal
assholes
orignal
thieves
snex
create a black hole and redirect them to that so they stay stuck connected to you forever
orignal
that's what I suggest
orignal
or let them learn on complete useless garbage
orignal
Zuckerberg and Altman must eat thier own shit
eyedeekay
As long as I don't waste I2P network resources on it I'm going to be as harsh as possible with them. If I can just trickle them a stream of garbage that's a worthwhile plan.
eyedeekay
But job one right now is giving the service a way to reliably spot them.
snex
respond with a "file" that is just /dev/urandom at 1B/s
eyedeekay
Described above is plan A^
snex
a while back i saw an article on how to run an ssh server that prevents the client from ever disconnecting but i cant seem to find it
eyedeekay
I remember that
RN
ohhhh, niiiiiice
eyedeekay
Maybe I'll give it the ability trickle out a some... "random" .gif from somewhere, one 1B/s
eyedeekay
user-supplied of course
dr|z3d
from someone we know, eyedeekay: ramble.i2p/f/Tech/5684/open-source-devs-say-ai-crawlers-dominate-traffic-forcing
eyedeekay
Yeah I read that. Also experienced it, they're doing basically the same thing to all the stuff we host AFAICT
eyedeekay
I had occasion to look at the logs of one of the mirrors yesterday, it's wild
dr|z3d
any obvious user agents?
eyedeekay
No not really, but it's obvious by what they're downloading, you'll see one IP from AWS download like, I2P 1.8.1 and an old version of imule and the source code for an old version of android
eyedeekay
stuff that nobody seeks out organically
orignal
fail2ban solves the problem
eyedeekay
More-or-less yeah, but they rotate identities pretty quickly too
eyedeekay
Oh I see planet.i2p looking for RSS feeds, I'll get new addresses for those
dr|z3d
nginx request throttling is probably worth deploying.
dr|z3d
shouldn't impact normal users, but bots can get throttled to a single request per minute, or longer, once they hit a certain threshold.
snex
have you all looked at anubis?
snex
proof-of-work captcha thingy
dr|z3d
that's what I linked earlier, no?
snex
maybe?
dr|z3d
> from someone we know, eyedeekay: ramble.i2p/f/Tech/5684/open-source-devs-say-ai-crawlers-dominate-traffic-forcing
snex
ive seen it in several places
snex
now if only you could make anubis challenges count as monero blocks...
snex
at the very least they need to let you easily theme it yourself. that tan page is ugly af