# I am the Watcher. I am your guide through this vast new twtiverse.
# 
# Usage:
#     https://watcher.sour.is/api/plain/users              View list of users and latest twt date.
#     https://watcher.sour.is/api/plain/twt                View all twts.
#     https://watcher.sour.is/api/plain/mentions?uri=:uri  View all mentions for uri.
#     https://watcher.sour.is/api/plain/conv/:hash         View all twts for a conversation subject.
# 
# Options:
#     uri     Filter to show a specific users twts.
#     offset  Start index for quey.
#     limit   Count of items to return (going back in time).
# 
# twt range = 1 196303
# self = https://watcher.sour.is?offset=165758
# next = https://watcher.sour.is?offset=165858
# prev = https://watcher.sour.is?offset=165658
Enjoy a hot, refreshing lemongrass tea in the sun on the terrace in pleasant temperatures.
On my blog: Free Culture Book Club — Nevada, part 5 https://john.colagioia.net/blog/2024/07/06/nevada-5.html #freeculture #bookclub
[47°09′29″S, 126°43′48″W] 4097 days without news from Herve
/https://baldo.cat/media/photos/IMG_1338.jpg) #catsoftwtxt
#catsoftwtxt
#catsoftwtxt
[47°09′01″S, 126°43′55″W] Saalmi, retransmit, please
[47°09′33″S, 126°43′28″W] Non-significative results -- sampling finished
I just blocked the following ASN(s) from being able to hit twtxt.net or mills.io:


16509 - AMAZON-02
32934 - FACEBOOK


Why? Because the Claude Bot web crawler from facebookexternalhit and Meta's facebookexternalhit web crawler are both behaving badly for pages that have no cache headers. Not sure if this is malicious, an oversight, a bug or me just being stupid and not ensuring every web resource or page had appropriate Cache headers? 🤔 In any case, until I hear back from at least facebookexternalhit (_whom I've reached out to_), these ASN(s) will remain entirely blocked.

That is the entirety of Amazon Web Services and Facebook.
I just blocked the following ASN(s) from being able to hit twtxt.net or mills.io:


16509 - AMAZON-02
32934 - FACEBOOK


Why? Because the Claude Bot web crawler from facebookexternalhit and Meta's facebookexternalhit web crawler are both behaving badly for pages that have no cache headers. Not sure if this is malicious, an oversight, a bug or me just being stupid and not ensuring every web resource or page had appropriate Cache headers? 🤔 In any case, until I hear back from at least facebookexternalhit (_whom I've reached out to_), these ASN(s) will remain entirely blocked.

That is the entirety of Amazon Web Services and Facebook.
I just blocked the following ASN(s) from being able to hit twtxt.net or mills.io:


16509 - AMAZON-02
Investigate in Security center
32934 - FACEBOOK
16


Why? Because the Claude Bot web crawler from facebookexternalhit and Meta's facebookexternalhit web crawler are both behaving badly for pages that have no cache headers. Not sure if this is malicious, an oversight, a bug or me just being stupid and not ensuring every web resource or page had appropriate Cache headers? 🤔 In any case, until I hear back from at least facebookexternalhit (_whom I've reached out to_), these ASN(s) will remain entirely blocked.

That is the entirety of Amazon Web Services and Facebook.
Good tip!
🧮 USERS:1 FEEDS:2 TWTS:1022 ARCHIVED:76617 CACHE:2289 FOLLOWERS:17 FOLLOWING:14
hehehe Post no relato ao vivo no Público: "A França avança para as meias-finais, onde vai defrontar a França. Portugal ficou pelo caminho no desempate por penáltis." Destaque para o pormenor que a França vai defrontar a França screenshot do street fighter 2, com o Ryu contra o Ryu
hehehe Post no relato ao vivo no Público: "A França avança para as meias-finais, onde vai defrontar a França. Portugal ficou pelo caminho no desempate por penáltis." Destaque para o pormenor que a França vai defrontar a França screenshot do street fighter 2, com o Ryu contra o Ryu
On my blog: Toots 🦣 from 07/01 to 07/05 https://john.colagioia.net/blog/2024/07/05/week.html #linkdump #mastodon #socialmedia #week
@bender Hah! You edited this quite late didn't you 🤣
@shreyan they did, too, had their “Independence Day”. 🥳
[47°09′19″S, 126°43′58″W] Bad satellite signal -- switching to analog communication
Got to fix that broken parsing on URLs with parenthesis in it. This one, inexplicably, works.
[47°09′09″S, 126°43′51″W] --no signal--
[47°09′17″S, 126°43′42″W] --white noise--
@bender Oh my 🤣
@bender Oh my 🤣
@prologic “as a political term, Tory was an insult (derived from the Middle Irish word tóraidhe, modern Irish tóraí, meaning "outlaw", "robber", from the Irish word tóir, meaning "pursuit" since outlaws were "pursued men")”

Source: https://en.m.wikipedia.org/wiki/Tories_(British_political_party)#:~:text=As%2520a%2520political%2520term%252C%2520Tory,Bill%2520crisis%2520of%25201678%E2%80%931681.~=
@prologic “as a political term, Tory was an insult (derived from the Middle Irish word tóraidhe, modern Irish tóraí, meaning "outlaw", "robber", from the Irish word tóir, meaning "pursuit" since outlaws were "pursued men")”

Source: https://en.m.wikipedia.org/wiki/Tories_(British_political_party)
❤️ 🎶: RU BI : Sorrowful Tears by Ahn Ye Eun
❤️ 🎶: Right Now by NewJeans
By the way, why are they called Tories?
By the way, why are they called Tories?
[47°09′06″S, 126°43′22″W] Reading: 1.67000 PPM
❤️ 🎶: My Wish by Lena Park
❤️ 🎶: 캐논의 아침 by Baek A Yeon
❤️ 🎶: You Are My Everything by DAVICHI
[47°09′44″S, 126°43′59″W] Sample analyzing complete -- starting transfer
Yeah, though sometimes the most clever devs aren't always the best to deal with on a personal level. I seem to remember the (former?) lead dev on GrapheneOS (IIRC) was an ass hat and threw tantrums at the smallest things and would get stalkery and weird if someone criticised him, but he's undeniably a brilliant coder and problem solver. Some people need to be more self aware of how their efforts might be harmed with their behaviour though.
🧮 USERS:1 FEEDS:2 TWTS:1021 ARCHIVED:76613 CACHE:2295 FOLLOWERS:17 FOLLOWING:14
Congratulations to the British for getting rid of the Tories tyranny, and electing the forward thinking Labour party! 🥳
- ¡Mira! Soy igual de mono que el otro. -
/https://duque-terron.cat/media/photos/photo_18804-07-2024_21-43-28.jpg) #catsoftwtxt
#catsoftwtxt
#catsoftwtxt
/https://baldo.cat/media/photos/photo_18704-07-2024_21-43-28.jpg) #catsoftwtxt
- ¡Mira! Soy igual de mono que el otro. -
#catsoftwtxt
- ¡Mira! Soy igual de mono que el otro. -
#catsoftwtxt
On my blog: Real Life in Star Trek, Unification Part 2 https://john.colagioia.net/blog/2024/07/04/unification-part-2.html #scifi #startrek #closereading
Este fds há #FestivalElétrico de beats open air no parque da Pasteleira no Porto, alguém vai? Eu vou arrastar-me do sofá no sábado para conseguir ir ver Tiga e Michael Mayer às 18
Este fds há #FestivalElétrico de beats open air no parque da Pasteleira no Porto, alguém vai? Eu vou arrastar-me do sofá no sábado para conseguir ir ver Tiga e Michael Mayer às 18
Independence Day: 0.12 miles, 03:40:05 average pace, 00:27:21 duration

#swimming
Independence Day: 0.12 miles, 03:40:05 average pace, 00:27:21 duration

#swimming
Independence Day: 0.12 miles, 03:40:05 average pace, 00:27:21 duration

#swimming
@prologic

> So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?

That would be quite ironic. Using “AI” to fight “AI”, huh? 🤪

(I haven’t read the article in depth, because I don’t use Cloudflare.)
@prologic

> So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?

That would be quite ironic. Using “AI” to fight “AI”, huh? 🤪

(I haven’t read the article in depth, because I don’t use Cloudflare.)
@prologic

> So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?

That would be quite ironic. Using “AI” to fight “AI”, huh? 🤪

(I haven’t read the article in depth, because I don’t use Cloudflare.)
@prologic

> So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?

That would be quite ironic. Using “AI” to fight “AI”, huh? 🤪

(I haven’t read the article in depth, because I don’t use Cloudflare.)
There’s something special about writing your own programs for OS/2 in C and finally getting it to work after sifting through lots of ancient docs. ✨

I’d be totally lost without KO Myung-Hun's website and Open Watcom v2. 🙏

(I’m making a little tool to dump floppy disks to image files. I know these programs already exist – I’m doing it for fun and to learn. The task itself is not complicated, but finding the correct docs is.)

https://movq.de/v/13597a4d87/os2dump.jpg
There’s something special about writing your own programs for OS/2 in C and finally getting it to work after sifting through lots of ancient docs. ✨

I’d be totally lost without KO Myung-Hun's website and Open Watcom v2. 🙏

(I’m making a little tool to dump floppy disks to image files. I know these programs already exist – I’m doing it for fun and to learn. The task itself is not complicated, but finding the correct docs is.)

https://movq.de/v/13597a4d87/os2dump.jpg
There’s something special about writing your own programs for OS/2 in C and finally getting it to work after sifting through lots of ancient docs. ✨

I’d be totally lost without KO Myung-Hun's website and Open Watcom v2. 🙏

(I’m making a little tool to dump floppy disks to image files. I know these programs already exist – I’m doing it for fun and to learn. The task itself is not complicated, but finding the correct docs is.)

https://movq.de/v/13597a4d87/os2dump.jpg
There’s something special about writing your own programs for OS/2 in C and finally getting it to work after sifting through lots of ancient docs. ✨

I’d be totally lost without KO Myung-Hun's website and Open Watcom v2. 🙏

(I’m making a little tool to dump floppy disks to image files. I know these programs already exist – I’m doing it for fun and to learn. The task itself is not complicated, but finding the correct docs is.)

https://movq.de/v/13597a4d87/os2dump.jpg
[47°09′47″S, 126°43′52″W] Analyzing samples
❤️ 🎶: See the Moon by LeeZe
Quick life hack regarding dead gopherholes: Try to contact the owner. Sometimes there is a httpd listening on the same host which points at a way to contact the owner. So
@bender wut da fuq?!
@bender wut da fuq?!
@bender Yeah 😅
@bender Yeah 😅
@aelaraji @prologic @bender They're also AI-ing this, so I doubt that it really works. Just another shit show to lure more people into routing the traffic through Clownflare in my opinion.
[47°09′25″S, 126°43′59″W] Re-taking samples
https://www.punk.ist/ Punkist
https://www.punk.ist/ Punkist
@prologic that, sure, it is a determinant factor, I agree.
@prologic I don't know if there is/will be a Crowdsec bouncer to handle something like that. 🤔
@prologic I don't know if there is/will be a Crowdsec bouncer to handle something like that. 🤔
There is, also, a small controversy going around for something that should have been a small change, but that Kling (SerenityOS, and Ladybird creator) handled quite badly: https://github.com/SerenityOS/serenity/pull/6814.

Seemingly small things like this divide, and have the potential to harm a project.
@bender I think it's the massive data analytics and machine learning that allows them to distinguish these fake bots 🤔
@bender I think it's the massive data analytics and machine learning that allows them to distinguish these fake bots 🤔
@eldersnake how many browsers are out there, that use a unique “engine”? There seems to be quite a few: https://en.m.wikipedia.org/wiki/Comparison_of_browser_engines. Sure, another one won’t hurt. Would I use it? Probably not. 😅
@eldersnake how many browsers are out there, the use a unique “engine”? There seems to be quite a few: https://en.m.wikipedia.org/wiki/Comparison_of_browser_engines. Sure, another one won’t hurt. Would I use it? Probably not. 😅
@prologic oh, then it can’t be applied to self-hosters. Unless they spin up a similar-to-CloudFlare infrastructure. 😂
Base: 7.09 miles, 00:09:56 average pace, 01:10:25 duration
feeling slow so running slow
#running #treadmill
Base: 7.09 miles, 00:09:56 average pace, 01:10:25 duration
feeling slow so running slow
#running #treadmill
Base: 7.09 miles, 00:09:56 average pace, 01:10:25 duration
feeling slow so running slow
#running #treadmill
@bender What I mean is: without using or relying on Cloudflare!
@bender What I mean is: without using or relying on Cloudflare!
DISCLAIMER: I don’t use CloudFlare other than as a Domain Registrar, and their DNS. I might be saying wrong/utterly inaccurate things.
@prologic I think the same way the do DNS proxy. They clean all traffic for you, and your self hosted infra gets it.
Not sure how this can be applied for self hosters?
Not sure how this can be applied for self hosters?
@ basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?
So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?
So basically it seems that Cloudflare has enough data that they can do machine learning to figure out whether the traffic behavior and patterns of bots even ones that fake their identity are really bots or not right?
❤️ 🎶: One ring by Solji
@eldersnake That's actually awesome! Happy seeing the pace at which it's starting to pick up momentum as far as getting more eyes on the project, especially after the "Ladybird Browser Initiative" announcement, that surely got a lot more people talking.
@eldersnake That's actually awesome! Happy seeing the pace at which it's starting to pick up momentum as far as getting more eyes on the project, especially after the "Ladybird Browser Initiative" announcement, that surely got a lot more people talking.
I don't remember who was looking for a way to block A.I bots/scrappers. But here's an article by Cloudflare "Declare your AIndependence: block AI bots, scrapers and crawlers with a single click" offering a way to do so even for the ones spoofing their User-Agent and such.
I don't remember who was looking for a way to block A.I bots/scrappers. But here's an article by Cloudflare "Declare your AIndependence: block AI bots, scrapers and crawlers with a single click" offering a way to do so even for the ones spoofing their User-Agent and such.
[47°09′47″S, 126°43′58″W] Taking samples
[47°09′50″S, 126°43′25″W] Raw reading: 0x66864871, offset +/-5
After that talk about the Ladybird browser the other day, I see this article just pop up:

https://devclass.com/2024/07/03/ladybird-web-browser-project-now-funded-by-github-co-founder-promises-no-code-from-other-browsers/

Seems it's gaining some recognition and support, I hope it can gain traction as we sure as anything need some genuine alternatives.
https://www.marginalia.nu/marginalia-search/about/ search engine
[47°09′49″S, 126°43′36″W] 4094 days without news from Herve