# I am the Watcher. I am your guide through this vast new twtiverse.
# 
# Usage:
#     https://watcher.sour.is/api/plain/users              View list of users and latest twt date.
#     https://watcher.sour.is/api/plain/twt                View all twts.
#     https://watcher.sour.is/api/plain/mentions?uri=:uri  View all mentions for uri.
#     https://watcher.sour.is/api/plain/conv/:hash         View all twts for a conversation subject.
# 
# Options:
#     uri     Filter to show a specific users twts.
#     offset  Start index for quey.
#     limit   Count of items to return (going back in time).
# 
# twt range = 1 17
# self = https://watcher.sour.is/conv/pe5ymoa
@quark Maybe at the weekend, but I’m rather busy these days with birthdays coming up (and/or no energy left after work to debug software 😳). But don’t worry, we’ll get this sorted out. 😊
@quark Maybe at the weekend, but I’m rather busy these days with birthdays coming up (and/or no energy left after work to debug software 😳). But don’t worry, we’ll get this sorted out. 😊
@quark Maybe at the weekend, but I’m rather busy these days with birthdays coming up (and/or no energy left after work to debug software 😳). But don’t worry, we’ll get this sorted out. 😊
@movq Sure thing, mate! I do not discount that it could be something stupid I am doing. My list of those is large, and rich. 🤣
@quark Alright, I can’t tell when I’ll be able to do a screen sharing thingy. So let’s try this the old-fashioned way first. Please try to reproduce the issue with the branch quark-trace that I pushed recently. It’ll create a /tmp/jenny.log file (it will get *large*). When you see duplicate twts, try to find them in that log.

Reasons, in theory, why we could see dups:

1) jenny doesn’t detect your feed’s URL correctly.

2) python-dateutil doesn’t parse your twt’s timestamp correctly. Or rather, it parses it differently depending on some env vars? Cronjobs often have this pitfall where some env var is different than your normal environment.

Actually … I can’t think of anything else. 🤔 You don’t see dups *all the time*, it only happens for your *own* twts, and you said that the twt hash *mismatches*. That already narrows it down to something in make_twt_hash(). 🤔

Let’s see if that trace file helps. If it doesn’t, we can add more trace() calls.
@quark Alright, I can’t tell when I’ll be able to do a screen sharing thingy. So let’s try this the old-fashioned way first. Please try to reproduce the issue with the branch quark-trace that I pushed recently. It’ll create a /tmp/jenny.log file (it will get *large*). When you see duplicate twts, try to find them in that log.

Reasons, in theory, why we could see dups:

1) jenny doesn’t detect your feed’s URL correctly.

2) python-dateutil doesn’t parse your twt’s timestamp correctly. Or rather, it parses it differently depending on some env vars? Cronjobs often have this pitfall where some env var is different than your normal environment.

Actually … I can’t think of anything else. 🤔 You don’t see dups *all the time*, it only happens for your *own* twts, and you said that the twt hash *mismatches*. That already narrows it down to something in make_twt_hash(). 🤔

Let’s see if that trace file helps. If it doesn’t, we can add more trace() calls.
@quark Alright, I can’t tell when I’ll be able to do a screen sharing thingy. So let’s try this the old-fashioned way first. Please try to reproduce the issue with the branch quark-trace that I pushed recently. It’ll create a /tmp/jenny.log file (it will get *large*). When you see duplicate twts, try to find them in that log.\n\nReasons, in theory, why we could see dups:\n\n1) jenny doesn’t detect your feed’s URL correctly.\n\n2) python-dateutil doesn’t parse your twt’s timestamp correctly. Or rather, it parses it differently depending on some env vars? Cronjobs often have this pitfall where some env var is different than your normal environment.\n\nActually … I can’t think of anything else. 🤔 You don’t see dups *all the time*, it only happens for your *own* twts, and you said that the twt hash *mismatches*. That already narrows it down to something in make_twt_hash(). 🤔\n\nLet’s see if that trace file helps. If it doesn’t, we can add more trace() calls.
@quark Alright, I can’t tell when I’ll be able to do a screen sharing thingy. So let’s try this the old-fashioned way first. Please try to reproduce the issue with the branch quark-trace that I pushed recently. It’ll create a /tmp/jenny.log file (it will get *large*). When you see duplicate twts, try to find them in that log.

Reasons, in theory, why we could see dups:

1) jenny doesn’t detect your feed’s URL correctly.

2) python-dateutil doesn’t parse your twt’s timestamp correctly. Or rather, it parses it differently depending on some env vars? Cronjobs often have this pitfall where some env var is different than your normal environment.

Actually … I can’t think of anything else. 🤔 You don’t see dups *all the time*, it only happens for your *own* twts, and you said that the twt hash *mismatches*. That already narrows it down to something in make_twt_hash(). 🤔

Let’s see if that trace file helps. If it doesn’t, we can add more trace() calls.
@movq It might be timestamps which are missing a timezone. In this case the spec says UTC must be assumed: if twt_date.tzinfo is None: twt_date = twt_date.replace(tzinfo=datetime.timezone.utc). I had to fix this in my implementation recently, too. Let me quickly provide a fix for the Python reference implementation in the spec.
@lyse @movq What would an time.RFC3339 timestamp look like with and without a TZ? Maybe this is what's causing @maya 's Twts to persist at the top of my feed for yarnd pod owners and users? 🤔
@lyse @movq What would an time.RFC3339 timestamp look like with and without a TZ? Maybe this is what's causing @maya 's Twts to persist at the top of my feed for yarnd pod owners and users? 🤔
@movq @prologic Ok, there you go: https://git.mills.io/yarnsocial/yarn/pulls/459 However, I mixed something up. The bug I found two weeks ago in my implementation was something different regarding seconds precision. Go will always invent UTC for time.Time objects (if I'm not mistaken it's impossible to have a timestamp without a timezone), Python does not.
@prologic Maybe her -00:00 timezones throws off a parser? I think I tried this edge case when coming up with the twt hash spec, but can't tell anymore for sure. At least I've written this (2020-12-13T07:45:23-00:00 → 2020-12-13T07:45:23Z) explicitly down. The thing is, I just documented Go's behavior to make sure I don't break anything. Maybe add a test similar to this one and see what's happening. :-?
@lyse Hmmmm, good idea, but it already does that: https://www.uninformativ.de/git/jenny/file/jenny.html#l219 🤔
@lyse Hmmmm, good idea, but it already does that: https://www.uninformativ.de/git/jenny/file/jenny.html#l219 🤔
@lyse Hmmmm, good idea, but it already does that: https://www.uninformativ.de/git/jenny/file/jenny.html#l219 🤔
@movq Ah! Didn't look at the parsing stage. :-)