The Watcher

prologic@twtxt.net

twtxt.net

@prologic @etux @xuu This is the result so far in this veery quick piece of code:\n\n

\n$ ./twtxt-search-engine\n...\nAll done!\nFound 14909 twts in 344 feeds\n

prologic

twtxt.net

@prologic @etux @xuu This is the result so far in this veery quick piece of code:\n\n

\n$ ./twtxt-search-engine\n...\nAll done!\nFound 14909 twts in 344 feeds\n

prologic

twtxt.net

@prologic @etux @xuu This is the result so far in this veery quick piece of code:


$ ./twtxt-search-engine
...
All done!
Found 14909 twts in 344 feeds

prologic

twtxt.net

@prologic @etux @xuu This is the result so far in this veery quick piece of code:


$ ./twtxt-search-engine
...
All done!
Found 14909 twts in 344 feeds

prologic

twtxt.net

11 Jan 21 15:34 UTC

@prologic @etux @xuu Now I want to remove the "domain" restriction, add a rate-limit and _try_ to crawl as much of the Twtxt wider network as I can and see how deep it goes 🤔

prologic@twtxt.net

twtxt.net

11 Jan 21 15:34 UTC

@prologic @etux @xuu Now I want to remove the "domain" restriction, add a rate-limit and _try_ to crawl as much of the Twtxt wider network as I can and see how deep it goes 🤔

prologic

twtxt.net

11 Jan 21 15:34 UTC

@prologic @etux @xuu Now I want to remove the "domain" restriction, add a rate-limit and _try_ to crawl as much of the Twtxt wider network as I can and see how deep it goes 🤔

lyse

lyse.isobeef.org

11 Jan 21 17:45 UTC+0100

@prologic Cool!

xuu@txt.sour.is

txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:\n

\ntwters:  744\ntotal:  52073\n

xuu

txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:


twters:  744
total:  52073

xuu

dev.txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:


twters:  744
total:  52073

xuu

txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:\n


twters:  744
total:  52073

xuu

dev.txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:


twters:  744
total:  52073

xuu

txt.sour.is

@lyse @prologic very curious... i worked on a very similar track. i built a spider that will trace off any follows = comments and mentions from other users and came up with:


twters:  744
total:  52073

prologic@twtxt.net

twtxt.net

11 Jan 21 21:18 UTC

@lyse @xuu Hmmm very interesting ! Let me put my code up somewhere

prologic

twtxt.net

11 Jan 21 21:18 UTC

@lyse @xuu Hmmm very interesting ! Let me put my code up somewhere

prologic

twtxt.net

11 Jan 21 21:18 UTC

@lyse @xuu Hmmm very interesting ! Let me put my code up somewhere

xuu

txt.sour.is

@prologic It is pretty basic, and depends on some local changes i am still working out on my branch.. https://gist.github.com/JonLundy/dc19028ec81eb4ad6af74c50255e7cee

xuu

dev.txt.sour.is

@prologic It is pretty basic, and depends on some local changes i am still working out on my branch.. https://gist.github.com/JonLundy/dc19028ec81eb4ad6af74c50255e7cee

xuu

dev.txt.sour.is

@prologic It is pretty basic, and depends on some local changes i am still working out on my branch.. https://gist.github.com/JonLundy/dc19028ec81eb4ad6af74c50255e7cee

xuu

txt.sour.is

@prologic It is pretty basic, and depends on some local changes i am still working out on my branch.. https://gist.github.com/JonLundy/dc19028ec81eb4ad6af74c50255e7cee

xuu@txt.sour.is

txt.sour.is

@prologic It is pretty basic, and depends on some local changes i am still working out on my branch.. https://gist.github.com/JonLundy/dc19028ec81eb4ad6af74c50255e7cee

prologic@twtxt.net

twtxt.net

11 Jan 21 23:08 UTC

@xuu I _think_ what I have put together last night is a little different... 🤔 https://gist.github.com/prologic/c64a00affbf14eb3a508ce43ffce1cbb. -- What you've got is a lot more code and looks way more polished 🤗 At a high-level what does yours do?

prologic

twtxt.net

11 Jan 21 23:08 UTC

prologic

twtxt.net

11 Jan 21 23:08 UTC

prologic@twtxt.net

twtxt.net

11 Jan 21 23:11 UTC

Ahh I don't think your code actually _crawls_ the Twtxt space right? Just parses urls given to it and adds it to a database file?

prologic

twtxt.net

11 Jan 21 23:11 UTC

Ahh I don't think your code actually _crawls_ the Twtxt space right? Just parses urls given to it and adds it to a database file?

prologic

twtxt.net

11 Jan 21 23:11 UTC

Ahh I don't think your code actually _crawls_ the Twtxt space right? Just parses urls given to it and adds it to a database file?

prologic

twtxt.net

11 Jan 21 23:12 UTC

It _might_ be worthwhile combining the two approaches and _actually_ building a goodness to gracious search engine and crawler for twtxt? 🤔 🤣

prologic

twtxt.net

11 Jan 21 23:12 UTC

It _might_ be worthwhile combining the two approaches and _actually_ building a goodness to gracious search engine and crawler for twtxt? 🤔 🤣

prologic@twtxt.net

twtxt.net

11 Jan 21 23:12 UTC

It _might_ be worthwhile combining the two approaches and _actually_ building a goodness to gracious search engine and crawler for twtxt? 🤔 🤣

xuu

dev.txt.sour.is

@prologic yeah it reads a seed file. I'm using mine. it scans for any mention links and then scans them recursively. it reads from http/s or gopher. i don't have much of a db yet.. it just writes to disk the feed and checks modified dates.. but I will add a db that has hashs/mentions/subjects and such.

xuu

dev.txt.sour.is

xuu@txt.sour.is

txt.sour.is

xuu

txt.sour.is

xuu

txt.sour.is

xuu

txt.sour.is

12 Jan 21 00:34 UTC