The Watcher

@adi If I stuck with that shell script abomination I have no doubt I could have hacked something together but it was already taking half a second to process my feed and nearly a minute to process @prologic's feed, although that one completely broke the script and mangled the output.

adi

f.adi.onl

09 Oct 21 22:01 UTC+0200

View Thread

@prologic @mckinley Meaning you could learned awk. @prologic feed is a special case 😋. Ok, optimized it some:\n\nhttps://clbin.com/MCzFb\n\n\nHow does it run on your side?

adi

f.adi.onl

09 Oct 21 22:34 UTC+0200

View Thread

@mckinley @prologic \n\n> although that one completely broke the script and mangled the output.\n\nI assumed all lines start with a date, so lines starting with '#' break the script. You're required to delete those lines.

mckinley

twtxt.net

09 Oct 21 21:20 UTC

View Thread

@adi\n> I assumed all lines start with a date\n\nSo did I in my attempt, but even after a quick grep -v '^#' it would still break everything.\nI'm trying out the newer version now. Will report back.

mckinley

twtxt.net

09 Oct 21 21:20 UTC

View Thread

@adi
> I assumed all lines start with a date

So did I in my attempt, but even after a quick grep -v '^#' it would still break everything.
I'm trying out the newer version now. Will report back.

mckinley

twtxt.net

09 Oct 21 22:07 UTC

View Thread

I don't see much a difference between the new version and the old version. There are a couple of small bugs I've seen. "2021: January-April" is hard-coded into the twt page template as well as the date "27 April 2021 01:04" for each twt. The timestamps are also printed along with the twt because it just copies the line through. What is smu used for?

mckinley

twtxt.net

09 Oct 21 22:13 UTC

View Thread

My template (most recent copy) attempts to solve the latter two problems, but I think it's another job for awk to avoid the dumpster fire below.


timestamp=$(echo "$line" | grep -Eo '^[0-9]{4}-[01][0-9]-[0-3][0-9][Tt][0-2][0-9]:[0-5][0-9]:[0-5][0-9](\.[0-9]+)?([+-][0-2][0-9]:[0-5][0-9]|Zz)')
twt=$(echo "$line" | sed -e 's/"/\"/g; s/

 -)
hyperlinked=$(echo "$twt" | sed 's|http://[^ ]*[^ ,.;:)>}!]|<a href="&">&</a>|g; s|https://[^ ]*[^ ,.;:)>}!]|<a href="&">&</a>|g')

mckinley

twtxt.net

09 Oct 21 22:13 UTC

View Thread

My template (most recent copy) attempts to solve the latter two problems, but I think it's another job for awk to avoid the dumpster fire below. \n

\ntimestamp=$(echo "$line" | grep -Eo '^[0-9]{4}-[01][0-9]-[0-3][0-9][Tt][0-2][0-9]:[0-5][0-9]:[0-5][0-9](\\.[0-9]+)?([+-][0-2][0-9]:[0-5][0-9]|Zz)')\ntwt=$(echo "$line" | sed -e 's/"/\\"/g; s/

/\\

/g' | cut -f 2- -)\nhyperlinked=$(echo "$twt" | sed 's|http://[^ ]*[^ ,.;:)>}!]|<a href="&">&</a>|g; s|https://[^ ]*[^ ,.;:)>}!]|<a href="&">&</a>|g')\n

mckinley

twtxt.net

09 Oct 21 22:19 UTC

View Thread

It looks like I just have no idea what I'm doing, and that's partially true, but those are the best solutions I was able to get while conforming to POSIX (without awk). I'll take the time at some point to learn more about awk and come up with a better solution for this.

eldersnake

yarn.andrewjvpowell.com

10 Oct 21 09:23 UTC+1100

View Thread

I'm an awk noob myself (@adi wrote the script that parses my twtxt feed) but if I've learned anything, is that it is one little powerhouse of a language.

mckinley

twtxt.net

09 Oct 21 22:32 UTC

View Thread

@eldersnake I've seen a lot of very impressive things done with awk.

adi

f.adi.onl

10 Oct 21 00:51 UTC+0200

View Thread

@mckinley You don't have to sanitize the twt, in pp you are required to sanitized template strings not variable values. You can just output unsanitezed, it's safe.

adi

f.adi.onl

10 Oct 21 00:54 UTC+0200

View Thread

@mckinley @eldersnake Wrong tool for the job, but still fun I guess.

adi

f.adi.onl

10 Oct 21 01:03 UTC+0200

View Thread

@eldersnake I'm not that great with it either, but until now I realized how powerful it is.