TechEcho

12 comments

DCKingalmost 2 years ago

Tube Archivist is quite heavyweight as it's meant to do heavy full archiving of YouTube channels and search through positively huge libraries. I'm getting the sense that it's a data hoarding tool, not a casual web video watching tool. I found that I just want to add a few channels to my media library, for which I use Jellyfin already.For people looking for a more lightweight option of that kind, I run the following script hourly [1]. This script uses yt-dlp to go through a text file full of YouTube RSS urls (either a channel RSS or a playlist RSS works for channels where you're only interested in a subset of videos) [2] and downloads the latest 5 videos organized in folders based on channel name. I watch these files by adding the output folder in a Jellyfin "Movies" type library sorted by most recent. The script contains a bunch of flags to make sure Jellyfin can display video metadata and thumbnails without any further plugins, and repackages videos in a format that is 1080p yet plays efficiently even in web browsers on devices released in at least the last 10 years.It uses yt-dlp's "archive" functionality to keep track of videos it's already downloaded such that it only downloads a video once, and I use a separate script to clean out files older than two weeks once in a while. Running the script depends on ffmpeg (just used for repackaging videos, not transcoding!), xq (usually comes packaged with jq or yq) and yt-dlp being installed. You sometimes will need to update yt-dlp if a YouTube side change breaks it.For my personal usage it's been honed for a little while and now runs reliably for my purposes at least. Hope it's useful to more people.[1]: <a href="https://pastebin.com/s6kSzXrL" rel="nofollow noreferrer">https://pastebin.com/s6kSzXrL</a>[2]: E.g. <a href="https://danielmiessler.com/p/rss-feed-youtube-channel/" rel="nofollow noreferrer">https://danielmiessler.com/p/rss-feed-youtube-channel/</a>

评论 #36750104 未加载

评论 #36750490 未加载

评论 #36749810 未加载

评论 #36750667 未加载

simonwalmost 2 years ago

I saw this was a Django app so I dug around to look at their models. As far as I can tell this is all they have: <a href="https://github.com/tubearchivist/tubearchivist/blob/master/tubearchivist/home/models.py">https://github.com/tubearchivist/tubearchivist/blob/master/t...</a> - just a `Account` model.It looks like Django + SQLite is used for user accounts, but all other data storage happens in Elasticsearch.It's an interesting design decision. I would have gone all-in on the database, and used SQLite FTS in place of Elasticsearch for simplicity, but that's my own personal favourite stack. Not saying their design is bad, just different.

评论 #36749184 未加载

codetrotteralmost 2 years ago

Does it save the video thumbnail as well? Video description? Comments? Channel name? Channel avatar? etcCurrently I use yt-dlp to manually download individual videos that I want to keep. At the moment I only save the video itself. And most of the time I then also paste the URL of the video into archive.is save page and web.archive.org/save so that there is a snapshot of what the video page itself looked like at the time. But this is still incomplete, and relies on those services continuing to exist. Locally saving a snapshot of the page like that, and then also saving the thumbnail and perhaps more of the comments would be nice.

评论 #36745915 未加载

评论 #36747738 未加载

评论 #36746947 未加载

评论 #36747279 未加载

评论 #36745764 未加载

评论 #36750348 未加载

评论 #36747975 未加载

c0brac0braalmost 2 years ago

I've had significant problems running this for extended periods.It will crash and then restoration will fail internally with corruption errors, requiring reading through docker logs or just starting over from scratch completely.

renegat0x0almost 2 years ago

Ha, I have also wrote something similar<a href="https://github.com/rumca-js/Django-link-archive">https://github.com/rumca-js/Django-link-archive</a>I support not only youtube, but also any RSS source.It functions as link aggregation software. I can also fetch meta for all videos in channel, and download videos, audios.I am using standard Django auth module.It still lacks polish, and it is under development. I am not a webdev, so I am still struggling with overall architecture

indusalmost 2 years ago

Interesting idea.I always dream of writing a proxy server—-where all videos—-irrespective of device—-get stored in a local cache and served without going outside on subsequent requests.Gonna try this one, and gonna take that direction.

评论 #36748207 未加载

评论 #36748875 未加载

评论 #36748375 未加载

snthdalmost 2 years ago

As a less sophisticated alternative there's a metadata plugin for jellyfin <a href="https://github.com/ankenyr/jellyfin-youtube-metadata-plugin">https://github.com/ankenyr/jellyfin-youtube-metadata-plugin</a>

nullcipheralmost 2 years ago

I couldn't find docs for installing from source. Is docker really mandatory ?Also, "Tube Archivist depends on Elasticsearch 8." . Wow, why?

评论 #36747742 未加载

评论 #36747258 未加载

评论 #36747053 未加载

MaikaDiHaikaalmost 2 years ago

I tried installing it half a year ago and the setup and documentation was really bad. Maybe I'll try it again sometime

freefaleralmost 2 years ago

Looks great, I will try it, since YouTube broke my scirpts a while a go...The way I was using them was to create a playlist named "save" and pulling from it once a day. It worked for a while, but YT started to ban somehow my script. Tube Archivist looks like would be ideal for that.Thanks for sharing this!

评论 #36746663 未加载

ocdtrekkiealmost 2 years ago

I like prologic's Tube. Way simpler, single Golang binary. Hosts video, not much else.

EGregalmost 2 years ago

How does it download videos? I thought YouTube blocked ripping videos?

评论 #36748839 未加载

12 comments

DCKingalmost 2 years ago

评论 #36750104 未加载

评论 #36750490 未加载

评论 #36749810 未加载

评论 #36750667 未加载

simonwalmost 2 years ago

评论 #36749184 未加载

codetrotteralmost 2 years ago

评论 #36745915 未加载

评论 #36747738 未加载

评论 #36746947 未加载

评论 #36747279 未加载

评论 #36745764 未加载

评论 #36750348 未加载

评论 #36747975 未加载

c0brac0braalmost 2 years ago

renegat0x0almost 2 years ago

indusalmost 2 years ago

评论 #36748207 未加载

评论 #36748875 未加载

评论 #36748375 未加载

snthdalmost 2 years ago

nullcipheralmost 2 years ago

I couldn't find docs for installing from source. Is docker really mandatory ?Also, "Tube Archivist depends on Elasticsearch 8." . Wow, why?

评论 #36747742 未加载

评论 #36747258 未加载

评论 #36747053 未加载

MaikaDiHaikaalmost 2 years ago

I tried installing it half a year ago and the setup and documentation was really bad. Maybe I'll try it again sometime

freefaleralmost 2 years ago

评论 #36746663 未加载

ocdtrekkiealmost 2 years ago

I like prologic's Tube. Way simpler, single Golang binary. Hosts video, not much else.

EGregalmost 2 years ago

How does it download videos? I thought YouTube blocked ripping videos?

评论 #36748839 未加载

Self hosted YouTube media server

12 comments

Self hosted YouTube media server

12 comments