The Watcher

prologic

twtxt.net

@movq Yes

prologic

twtxt.net

@movq Yes

prologic

twtxt.net

@kat What do you use for this btw? 🤔

prologic

twtxt.net

@kat What do you use for this btw? 🤔

prologic

twtxt.net

05 Jan 25 13:44 UTC

So I need to figure out how to block ASN(s)...

Additionally, I' thinking of; How to detect DDoS attachs?

Here's one way I've come up that's quite simple:

> Detecting DDoS attacks by tracking requests across multiple IPs in a sliding window. If total requests exceed a threshold in a given time, flag as potential DDoS.

prologic

twtxt.net

05 Jan 25 13:44 UTC

prologic

twtxt.net

05 Jan 25 10:49 UTC

prologic

twtxt.net

05 Jan 25 10:49 UTC

prologic

twtxt.net

05 Jan 25 10:36 UTC

@lyse Cool 👌

prologic

twtxt.net

05 Jan 25 10:36 UTC

@lyse Cool 👌

prologic

twtxt.net

05 Jan 25 10:35 UTC

Hmmm so I've sustained two DDoS attacks on my Gitea server today. A few hours apar. Still analyzing the traffic...

prologic

twtxt.net

05 Jan 25 10:35 UTC

Hmmm so I've sustained two DDoS attacks on my Gitea server today. A few hours apar. Still analyzing the traffic...

prologic

twtxt.net

05 Jan 25 06:09 UTC

For the time being... I've just blocked all of OpenAI(s) Bots. They (_thankfully_) publish a JSON endpoint that you can use to block all OpenAI crawlers from reaching your server (_in my case, blocking it at the edge_). Example:


proxy-1:~# curl -qs https://openai.com/gptbot.json | jq -r '.prefixes[].ipv4Prefix' | xargs -I{} ./block-ip.sh {}

Where block-ip.sh is simply:


#!/bin/sh

ufw insert 1 deny from "$1" to any

prologic

twtxt.net

05 Jan 25 06:09 UTC


proxy-1:~# curl -qs https://openai.com/gptbot.json | jq -r '.prefixes[].ipv4Prefix' | xargs -I{} ./block-ip.sh {}

Where block-ip.sh is simply:


#!/bin/sh

ufw insert 1 deny from "$1" to any

prologic

twtxt.net

05 Jan 25 05:44 UTC

@aelaraji Yes! 👏 This is exactly what it is! 🤣 I will of course soon™ be hosting this service, likely at validator.twtxt.net 😅😅

prologic

twtxt.net

05 Jan 25 05:44 UTC

@aelaraji Yes! 👏 This is exactly what it is! 🤣 I will of course soon™ be hosting this service, likely at validator.twtxt.net 😅😅

prologic

twtxt.net

05 Jan 25 04:56 UTC

@kat Haha 🤣 If someone figures this out, please let me know 🙏🙏 -- In the meantime, I'm going to very soon™ write a daemon that will watch the audit log for repeated violations and add to the network firewall.

prologic

twtxt.net

05 Jan 25 04:56 UTC

prologic

twtxt.net

05 Jan 25 04:55 UTC

This is better:


proxy-1:~# ./audit-log-by-ip.sh 4.227.36.76 | coraza-log-formatter -m -
2025/01/04 23:17:04 4.227.36.76 58982 GET /external?aff-HY0BLO=&f=mediaonly&f=noreplies&nick=g1n&uri=https%3A%2F%2Fthe-president-codes.linegames.org null 0  On OWASP_CRS/4.7.0
Actionset: OWASP_CRS/4.7.0
Message: Bad User Agent
Severity: 0
Raw: SecRule REQUEST_HEADERS:User-Agent "@pmFromFile /etc/caddy/waf/bad_user_agents.txt" "id:2000,log,phase:1,deny,msg:'Bad User Agent'"

prologic

twtxt.net

05 Jan 25 04:55 UTC

This is better:


proxy-1:~# ./audit-log-by-ip.sh 4.227.36.76 | coraza-log-formatter -m -
2025/01/04 23:17:04 4.227.36.76 58982 GET /external?aff-HY0BLO=&f=mediaonly&f=noreplies&nick=g1n&uri=https%3A%2F%2Fthe-president-codes.linegames.org null 0  On OWASP_CRS/4.7.0
Actionset: OWASP_CRS/4.7.0
Message: Bad User Agent
Severity: 0
Raw: SecRule REQUEST_HEADERS:User-Agent "@pmFromFile /etc/caddy/waf/bad_user_agents.txt" "id:2000,log,phase:1,deny,msg:'Bad User Agent'"

prologic

twtxt.net

05 Jan 25 04:43 UTC

Nice! I wrote another useful tool 👌


proxy-1:~# ./audit-log-by-ip.sh 4.227.36.76 | coraza-log-formatter -m -
Actionset: OWASP_CRS/4.7.0
Message: Bad User Agent
Severity: 0
Raw: SecRule REQUEST_HEADERS:User-Agent "@pmFromFile /etc/caddy/waf/bad_user_agents.txt" "id:2000,log,phase:1,deny,msg:'Bad User Agent'"

prologic

twtxt.net

05 Jan 25 04:43 UTC

Nice! I wrote another useful tool 👌


proxy-1:~# ./audit-log-by-ip.sh 4.227.36.76 | coraza-log-formatter -m -
Actionset: OWASP_CRS/4.7.0
Message: Bad User Agent
Severity: 0
Raw: SecRule REQUEST_HEADERS:User-Agent "@pmFromFile /etc/caddy/waf/bad_user_agents.txt" "id:2000,log,phase:1,deny,msg:'Bad User Agent'"

prologic

twtxt.net

05 Jan 25 04:07 UTC

How in da fuq do you _actually_ make these fucking useless AI bots go way?


proxy-1:~# jq '. | select(.request.remote_ip=="4.227.36.76")' /var/log/caddy/access/mills.io.log | jq -s '. | last' | caddy-log-formatter -
4.227.36.76 - [2025-01-05 04:05:43.971 +0000] "GET /external?aff-QNAXWV=&f=mediaonly&f=noreplies&nick=g1n&uri=https%3A%2F%2Fmy-hero-ultra-impact-codes.linegames.org HTTP/2.0" 0 0
proxy-1:~# date
Sun Jan  5 04:05:49 UTC 2025

😱

prologic

twtxt.net

05 Jan 25 04:07 UTC

How in da fuq do you _actually_ make these fucking useless AI bots go way?


proxy-1:~# jq '. | select(.request.remote_ip=="4.227.36.76")' /var/log/caddy/access/mills.io.log | jq -s '. | last' | caddy-log-formatter -
4.227.36.76 - [2025-01-05 04:05:43.971 +0000] "GET /external?aff-QNAXWV=&f=mediaonly&f=noreplies&nick=g1n&uri=https%3A%2F%2Fmy-hero-ultra-impact-codes.linegames.org HTTP/2.0" 0 0
proxy-1:~# date
Sun Jan  5 04:05:49 UTC 2025

😱

prologic

twtxt.net

Done.

prologic

twtxt.net

Done.

prologic

twtxt.net

@lyse Oh good! It works haha 🤣 I'll bump it up a bit 👌

prologic

twtxt.net

@lyse Oh good! It works haha 🤣 I'll bump it up a bit 👌

prologic

twtxt.net

04 Jan 25 23:39 UTC

And now I've applied rate limits on every site to reasonable values 👌

prologic

twtxt.net

04 Jan 25 23:39 UTC

And now I've applied rate limits on every site to reasonable values 👌

prologic

twtxt.net

04 Jan 25 22:13 UTC

@bender Isn't that why um yarning my progress 🤣

prologic

twtxt.net

04 Jan 25 22:13 UTC

@bender Isn't that why um yarning my progress 🤣

prologic

twtxt.net

04 Jan 25 16:16 UTC

@kat I've actually moved most of my stuff of of Cloudflare now 🤣 I'm actually very happy with my edge proxy setup that reverse proxies, caches and acts as a web application firewall 🥳

prologic

twtxt.net

04 Jan 25 16:16 UTC

@kat I've actually moved most of my stuff of of Cloudflare now 🤣 I'm actually very happy with my edge proxy setup that reverse proxies, caches and acts as a web application firewall 🥳

prologic

twtxt.net

04 Jan 25 16:15 UTC

@kat Have you seen the SSG that I built and use on all my static sites? zs 🤔

prologic

twtxt.net

04 Jan 25 16:15 UTC

@kat Have you seen the SSG that I built and use on all my static sites? zs 🤔

prologic

twtxt.net

04 Jan 25 16:14 UTC

Oh gawd. I can't enable caching on my edge proxy everywhere 😱 Some shit™ doesn't deal with a caching reverse proxy in front of it very well for some reason I don't have time to dig into right now 🤔

prologic

twtxt.net

04 Jan 25 16:14 UTC

prologic

twtxt.net

04 Jan 25 15:50 UTC

What's a reasonable per second or per minute rate limit that I could apply in general at my edge proxy for all clients? (_no matter what_) ... LIke a good reasonable upper bound? 🤔

prologic

twtxt.net

04 Jan 25 15:50 UTC

What's a reasonable per second or per minute rate limit that I could apply in general at my edge proxy for all clients? (_no matter what_) ... LIke a good reasonable upper bound? 🤔

prologic

twtxt.net

04 Jan 25 14:30 UTC

@movq Yeah I swear to god the engineers that write this shit™ don't know how to write distributed cralwers that don't happy the shit™ out of their targets 🤦‍♂️

prologic

twtxt.net

04 Jan 25 14:30 UTC

@movq Yeah I swear to god the engineers that write this shit™ don't know how to write distributed cralwers that don't happy the shit™ out of their targets 🤦‍♂️

prologic

twtxt.net

04 Jan 25 12:48 UTC

@doesnm No. I generally don't put up any robots.txt files at all really, because they mostly get ignored. I don't generally mind if "normal" web crawlers crawl things. But LLM(s) can go fuck themselves 🤣

prologic

twtxt.net

04 Jan 25 12:48 UTC

prologic

twtxt.net

04 Jan 25 12:26 UTC

@movq Yeah it's starting to piss me off too 🤣 Not nearly as much as that guy, but stil. Anyway I'm having fun! Now I just need to find a good IP/Subnet list that I can blacklist entirely, ideally one that's updated frequently so I can refresh firewall rules.

prologic

twtxt.net

04 Jan 25 12:26 UTC

prologic

twtxt.net

04 Jan 25 12:09 UTC

Bloody fucking hell. I _think_ one of Google's GenAI crawlers was just hitting my Gitea instance quite hard. Fuck 🤬 Geez

prologic

twtxt.net

04 Jan 25 12:09 UTC

Bloody fucking hell. I _think_ one of Google's GenAI crawlers was just hitting my Gitea instance quite hard. Fuck 🤬 Geez

prologic

twtxt.net

@movq Oh 🤦‍♂️

prologic

twtxt.net

@movq Oh 🤦‍♂️

prologic

twtxt.net

I just banned 41 bad user agents from accessing any of my services. 😱

prologic

twtxt.net

I just banned 41 bad user agents from accessing any of my services. 😱

prologic

twtxt.net

04 Jan 25 11:18 UTC

@movq How do you manage to get those skulines on your photos? 🤔

prologic

twtxt.net

04 Jan 25 11:18 UTC

@movq How do you manage to get those skulines on your photos? 🤔

prologic

twtxt.net

04 Jan 25 07:32 UTC

@doesnm No, it's only designed for yarnd. What did you have in mind here? 🤔

prologic

twtxt.net

04 Jan 25 07:32 UTC

@doesnm No, it's only designed for yarnd. What did you have in mind here? 🤔

prologic

twtxt.net

04 Jan 25 07:06 UTC

@doesnm It is the same API that yarnc the command-line client uses.

prologic

twtxt.net

04 Jan 25 07:06 UTC

@doesnm It is the same API that yarnc the command-line client uses.

prologic

twtxt.net

04 Jan 25 02:50 UTC

i.e: Not much point in running a WAF on a static site. But OTOH if there's enough abuse from shitty assholes, there might be 🤔🤔

prologic

twtxt.net

04 Jan 25 02:50 UTC

i.e: Not much point in running a WAF on a static site. But OTOH if there's enough abuse from shitty assholes, there might be 🤔🤔

prologic

twtxt.net

04 Jan 25 02:49 UTC

I'm just basically learning now how ModSecurity rules work and how to write my own.

The builtin OWASP rules are already working nicely 👌 -- And yeah I won't include the WAF on every site block, probably just my main/primary domain where I tend to run demo services and other things.

prologic

twtxt.net

04 Jan 25 02:49 UTC

prologic

twtxt.net

04 Jan 25 02:48 UTC

@kat If you've been following my yarns the other day about me getting off of Clownflare and building my own WAF, Proxy and effectively my own Edge network, you'll know I'm doing this at the very edge 🤣🤣

prologic

twtxt.net

04 Jan 25 02:48 UTC

prologic

twtxt.net

04 Jan 25 02:22 UTC

Having a lot of fun with Coraza today. A Web Application Firewall library written in Go that also happens to have a Caddy module.

prologic

twtxt.net

04 Jan 25 02:22 UTC

Having a lot of fun with Coraza today. A Web Application Firewall library written in Go that also happens to have a Caddy module.

prologic

twtxt.net

04 Jan 25 02:21 UTC

@bender Hey ! 👋

prologic

twtxt.net

04 Jan 25 02:21 UTC

@bender Hey ! 👋

prologic

twtxt.net

04 Jan 25 01:14 UTC

@eapl.me And here I always lived by:

> Problems are solved by method.
-- Dr. Don Abel.

prologic

twtxt.net

04 Jan 25 01:14 UTC

@eapl.me And here I always lived by:

> Problems are solved by method.
-- Dr. Don Abel.

prologic

twtxt.net

04 Jan 25 01:05 UTC

🥱 morning y'all 👋 Soo tired 🥱 Need coffee!!! ☕️☕️☕️☕️

prologic

twtxt.net

04 Jan 25 01:05 UTC

🥱 morning y'all 👋 Soo tired 🥱 Need coffee!!! ☕️☕️☕️☕️

prologic

twtxt.net

04 Jan 25 01:02 UTC

@lyse It does not 🤣 Shsll I enable it? 🤣

prologic

twtxt.net

04 Jan 25 01:02 UTC

@lyse It does not 🤣 Shsll I enable it? 🤣

prologic

twtxt.net

03 Jan 25 16:38 UTC

@bender It's true! 🤣 It's a total garbage nonsense title. But the actual research paper that the video references is real. Apple did in fact do a bunch of research and proved what we already know 🤣 -- That is, AI is stupid 🤣

prologic

twtxt.net

03 Jan 25 16:38 UTC

prologic

twtxt.net

03 Jan 25 16:14 UTC

@movq Amend 🙏

prologic

twtxt.net

03 Jan 25 16:14 UTC

@movq Amend 🙏

prologic

twtxt.net

03 Jan 25 14:20 UTC

But to be fair, we already knew this... I've observed it first hand, we knew it at the beginning. I'll just leave you with this:

> Stochastic Parrot

or put simply:

> Artificial Incompetence

prologic

twtxt.net

03 Jan 25 14:20 UTC

But to be fair, we already knew this... I've observed it first hand, we knew it at the beginning. I'll just leave you with this:

> Stochastic Parrot

or put simply:

> Artificial Incompetence

prologic

twtxt.net

03 Jan 25 14:18 UTC

Apple DROPS AI BOMBSHELL: LLMS CANNOT Reason - YouTube

prologic

twtxt.net

03 Jan 25 14:18 UTC

Apple DROPS AI BOMBSHELL: LLMS CANNOT Reason - YouTube

prologic

twtxt.net

03 Jan 25 10:18 UTC

@movq Fuxking awesome 🙃😍

prologic

twtxt.net

03 Jan 25 10:18 UTC

@movq Fuxking awesome 🙃😍

prologic

twtxt.net

03 Jan 25 07:01 UTC

@movq Yup! 😅

prologic

twtxt.net

03 Jan 25 07:01 UTC

@movq Yup! 😅

prologic

twtxt.net

I can walk you through some examples later tonight when I get back if you like?

prologic

twtxt.net

I can walk you through some examples later tonight when I get back if you like?

prologic

twtxt.net

A pointer is basically a reference to a variable. It is typically used with structs and especially in pointer receiver methods so that you can modify fields of a struct.

prologic

twtxt.net