cat videos.jsonl | grep '\"author\":\"[a-z0-9]*\"' -o | sed 's/"author":"//;s/"//' | sort | uniq -c | sort -n | tac > users.txtIsto é útil porque há vários "parceiros" (ex. noticiasaominuto) cujos vídeos não vão ser removidos e por isso não temos interesse em preservar
Vejo tb que o canal com mais vídeos (rtc) tem 43000 no dataset mas 91000 no site, por isso é uma boa medida de quão completo o nosso dataset está :-)
O top10 dos já descobertos:
43115 rtc
26422 tpa1
19110 canalq
18020 entretenimentoporto
14628 stv
14452 fantunes11
12712 portocanalonline
8641 alticemeokanal
8498 sapodesporto
7972 economicotv
cc @hugopeixoto @bossatossa@bossatossa
#sapovideos
cat videos.jsonl | grep '\"author\":\"[a-z0-9]*\"' -o | sed 's/"author":"//;s/"//' | sort | uniq -c | sort -n | tac > users.txtIsto é útil porque há vários "parceiros" (ex. noticiasaominuto) cujos vídeos não vão ser removidos e por isso não temos interesse em preservar
Vejo tb que o canal com mais vídeos (rtc) tem 43000 no dataset mas 91000 no site, por isso é uma boa medida de quão completo o nosso dataset está :-)
O top10 dos já descobertos:
43115 rtc
26422 tpa1
19110 canalq
18020 entretenimentoporto
14628 stv
14452 fantunes11
12712 portocanalonline
8641 alticemeokanal
8498 sapodesporto
7972 economicotv
cc @hugopeixoto @bossatossa@bossatossa
#sapovideos
*If you don’t think you need to read this post because you’re always giving Good, Helpful Advice as a Good, Helpful Citizen, this one is for you. I’m sure you probably mean well, but it is with a heavy heart that I must inform you that you’ve likely annoyed the hell out of someone at some point or another. Probably more than once. Maybe it’s a regular pattern of behaviour. This post is for you.*
https://anotherangrywoman.com/2023/01/18/how-to-give-advice-on-the-internet-without-being-an-utter-menace/
Shaddow of a large hatSo I had to take a photo. Just too funny. Going up the mountain we smelled freshly baked bread. What an overpowering scent. The bread baking boys must have been firing the wood-burning oven at the summit shortly before we arrived. On the way down we ran into extremely cute lambs. Super adorable. If only I could have petted them.
Very pretty sheepEven further down the mountain we came across a caterpillar. It totally looked like a flute wiper. Unfortunately, it walked too quickly for my camera's shutter speed and the fading light.
Some of the blackberries were already good to eat. Most of them, however, were still sour. Even though they heavily wacked them down last year for the forest liming, these blackberry bushes are three meters tall and about 40 meters in length. In a few weeks this strip in the forest will be heaven, I reckon.
Finally, we saw two deer on a meadow. Plenty of good encounters today. Just too much wind noise on the videos, sorry!
Shaddow of a large hatSo I had to take a photo. Just too funny. Going up the mountain we smelled freshly baked bread. What an overpowering scent. The bread baking boys must have been firing the wood-burning oven at the summit shortly before we arrived. On the way down we ran into extremely cute lambs. Super adorable. If only I could have petted them.
Very pretty sheepEven further down the mountain we came across a caterpillar. It totally looked like a flute wiper. Unfortunately, it walked too quickly for my camera's shutter speed and the fading light.
Some of the blackberries were already good to eat. Most of them, however, were still sour. Even though they heavily wacked them down last year for the forest liming, these blackberry bushes are three meters tall and about 40 meters in length. In a few weeks this strip in the forest will be heaven, I reckon.
Finally, we saw two deer on a meadow. Plenty of good encounters today. Just too much wind noise on the videos, sorry!
Mejor que la tele. ⌘ Read more****
"A Entidade Reguladora para a Comunicação Social (ERC), recomendou a eventual criação do estatuto de rádio comunitária, pedindo ainda à Autoridade Nacional de Comunicações (Anacom) que considere a disponibilização de micro-frequências nesse sentido."
https://24.sapo.pt/amp/atualidade/artigos/radios-locais-erc-propoe-criacao-do-estatuto-de-radio-comunitaria-e-pede-a-anacom-micro-frequencias
#rádio
Aber was ich bisher so gesehen habe: *Eigentlich* ist das Regelwerk der ISO27001 gar nicht so schlecht, also die Gedanken darin/dahinter. (Natürlich kostet der Kram was, kannste also nicht mal einfach so reingucken.) Man könnte das schon als Grundlage nehmen, um sich mal die Firma anzugucken, was die so tut und wie sie das tut und dann tatsächlich Dinge verbessern.
Anders formuliert: Wenn du den Willen hast, dein Unternehmen zu verbessern, dann schau’ in die ISO27001 rein. Die gibt Ansatzpunkte und Ideen, an die du vielleicht gar nicht dachtest.
Wenn man das wirklich gewissenhaft macht und mehr als 5 Mitarbeitende hat, dann ist das aber ein *unfassbar* aufwändiger und schmerzhafter Vorgang. Ich kann mir beim besten Willen nicht vorstellen, dass die ganzen Unternehmen/Konzerne, die hübsch mit ISO27001-Zertifizierung werben, das auch so durchgezogen haben. No way. Und spätestens da fängt’s dann an, albern zu werden.
Aber was ich bisher so gesehen habe: *Eigentlich* ist das Regelwerk der ISO27001 gar nicht so schlecht, also die Gedanken darin/dahinter. (Natürlich kostet der Kram was, kannste also nicht mal einfach so reingucken.) Man könnte das schon als Grundlage nehmen, um sich mal die Firma anzugucken, was die so tut und wie sie das tut und dann tatsächlich Dinge verbessern.
Anders formuliert: Wenn du den Willen hast, dein Unternehmen zu verbessern, dann schau’ in die ISO27001 rein. Die gibt Ansatzpunkte und Ideen, an die du vielleicht gar nicht dachtest.
Wenn man das wirklich gewissenhaft macht und mehr als 5 Mitarbeitende hat, dann ist das aber ein *unfassbar* aufwändiger und schmerzhafter Vorgang. Ich kann mir beim besten Willen nicht vorstellen, dass die ganzen Unternehmen/Konzerne, die hübsch mit ISO27001-Zertifizierung werben, das auch so durchgezogen haben. No way. Und spätestens da fängt’s dann an, albern zu werden.
Aber was ich bisher so gesehen habe: *Eigentlich* ist das Regelwerk der ISO27001 gar nicht so schlecht, also die Gedanken darin/dahinter. (Natürlich kostet der Kram was, kannste also nicht mal einfach so reingucken.) Man könnte das schon als Grundlage nehmen, um sich mal die Firma anzugucken, was die so tut und wie sie das tut und dann tatsächlich Dinge verbessern.
Anders formuliert: Wenn du den Willen hast, dein Unternehmen zu verbessern, dann schau’ in die ISO27001 rein. Die gibt Ansatzpunkte und Ideen, an die du vielleicht gar nicht dachtest.
Wenn man das wirklich gewissenhaft macht und mehr als 5 Mitarbeitende hat, dann ist das aber ein *unfassbar* aufwändiger und schmerzhafter Vorgang. Ich kann mir beim besten Willen nicht vorstellen, dass die ganzen Unternehmen/Konzerne, die hübsch mit ISO27001-Zertifizierung werben, das auch so durchgezogen haben. No way. Und spätestens da fängt’s dann an, albern zu werden.
#swimming
#swimming
#swimming
#swimming
1. Abre as developer tools do teu browser (tecla F12) e abre o tab "Network"
2. Abre a página de um user
3. Faz scroll para baixo, vão aparecer linhas novas na tab network
4. Quando chegares ao fim da lista, clica em cada uma das linhas e à direita faz copy do campo "data" (v. img), e cola o resultado num editor de texto
5. Repete pra todas e grava o texto como videos.txt
Agora no terminal, vai ao dir onde tens o videos.txt para podermos extrair o identificador de cada vídeo encontrado
cat videos.txt | grep randname | cut -c 17-36 > randnames.txte agora podemos descarregar tudo de uma vez com o yt-dlp:
while read p; do yt-dlp "http://videos.sapo.pt/$p"; done < randnames.txtE deve dar, ou não. Apontei isto a correr para não perder a referência, e porque não tenho tempo de escrever um scraper. Talvez dê jeito a alguém.
Obrigado @brunomiguel e @JD557@JD557 pelas pistas!
developer tools do firefox a mostrar uma response e o campo JSON a copiar
#swimming
#swimming
#swimming
#swimming
* https://www.youtube.com/@StefanGotteswinter/videos Machining tiny parts
* https://www.youtube.com/@ThisOldTony/videos Machining and engineering
* https://www.youtube.com/@urituchmanpigeon/videos Building cool stuff
* https://www.youtube.com/@torbjornahman/videos Blacksmithing
* https://www.youtube.com/@Matthiaswandel/videos Woodworking and engineering
* https://www.youtube.com/@matthiasrandomstuff2221/videos All sorts of engineering
* https://www.youtube.com/@SVSeeker/videos Building a steel sailing vessel and sailing
* https://www.youtube.com/@ProjectBrupeg/videos Restoring a steel trawler
* https://www.youtube.com/@SampsonBoatCo/videos Rebuilding a wooden sailing yacht*
#swimming
#swimming
#swimming
#swimming
Yeah, listening to all these owl calls on YouTube I was surprised that they were so short. All those years I thought hoots are much, much longer. Learning something new very day. :-)
Conhecem alguém no arquivo.pt? Era fundamental ser feito o arquivamento disto, já se viu que se dependemos das empresas, tudo eventualmente desaparece.