Maybe I‘m completely stuck but can somebody please explain the usecase of self hosting TubeArchivist?
https://github.com/tubearchivist/tubearchivist
Why would I rather download the content and watch it locally instead of watching it directly on YT?
I saw a person post a really good comment in regards to this a few days ago. They said they don’t want their kids going on YouTube. But there are videos they want to watch on YouTube. So they download the videos and put them on their media server. So the kids have their own playlist of videos to watch.
That’s actually really smart. Gotta remember that for when/if the condom ever breaks. Thanks!
Niche reason: I’m in China and YouTube is blocked. My server has VPN access, but my Kodi system on my TVs doesn’t.
That said, I hadn’t heard of this project, but I’ll probably install it now instead of manually using yt-dlp.
I just use it to be prepared in case youtube wins the ongoing adblocking war.
YT videos get taken down for any reason these days - fake copyright claims, hacking or just the creator getting fed up with YT’s policies. Entire channels vanish with no warning. Valuable videos that generate income suddenly become private only. It is not an open platform, it’s a monetised platform first and foremost.
If you have these videos under your control, then if they’re no longer watchable online, you still have them. That’s exactly what TA is for and does a superb job of. Basically every YT video I watch that I think is useful, I hit the Save button. Some of them are indeed no longer available. I have entire channels downloading so if the creator does close up shop, at least I’ve got their latest.
Obviously you need a lot of storage space - mine is over 5TB and growing. But it’s worth it.
Also, it avoids the YT before, mid and after ads.
Why self host? So you can have an independent instance that can’t be paywalled, deleted, or modified by a third party company like Google…
Hi, i use tubearchivist a lot. My usecases:
- I follow several video essasist types whose content has been removed for various reasons- reviews of media Claimed, hit with ‘inappropriate content’ (spuriously, usually). In two cases, there are creators who removed a lot of their older content for reasons that I completely understand, but also want to be able to reference because they made good points.
-Another creator i follow had their channel hacked, all content replaced with bitcoinshilling. The older videos were eventually restored, but there were a couple days where the content was completely unavailible, and the restoration was not exactly guarenteed.
-i use youtube for background noise a lot. This might be one of the reasons i rewatch a lot of essays and reviews or music videos. While yes, i can use regular youtube for this, tracking, ads, etc are an issue (yes, adblockers exist, but youtube is escalating the war with them). It’s easier to just pull up and watch on my own server instead.
Didbt set it up rn, but will tomorrow. I think its perfect if u got childs. So you can define what they can see.
Hmmm. My docker container for tubearchivist has suddenly stopped working. Sad.
Maybe take a look at the setup guide, Maybe there were some changes.
Years ago there was this funny clip of Ben Affleck doing various different Boston accents on the Jimmy Kimmel show. Super funny and became one of those clips that I would seek out a few times a year to rewatch and have a laugh.
It got pulled for for copyright or something which was weird because its really like a 3 minute clip. In any event its gone, forever. Cant find it anywhere.
Tubearchivist allows us to grab things we plan on rewatching over and over.
Imagine your favorite youtuber decide to just delete their channel… how would you re-watch their content or watch the content you never watched originally…
Essentially it’s the same use case as DVD’s and Blurays… they exist so you can watch the content when you want to regardless of whether they are still available on the original platform.
Good point. Thanks.
came here to say basically the same thing: preservation
What if that content isn’t there tomorrow?
Google knows exactly what you’re watching and when you’re watching it. Not so much if you just archive everything.
last i looked it was a bloated mess
Can you explain more?
it took up like a gig of memory before i even had even logged into it. i think it’s because the author uses elasticsearch which just felt like overkill for what i thought it aimed to accomplish
There’s definitely some overhead. ES is used as a DB to integrate search functionality. I know you can do close or similar with other DBs, but that’s what they went with. It works pretty good on my archive (as far as the team is aware, I’m the heaviest user). I’ve got comments, subtitles in ES as well, all searchable.
You can limit the RAM ES has available to it with Java arguments if you care.
Content disappearing is a big reason, but it’s also great for channels that post b-roll or green screen videos/effects/etc if you like video editing.
Sure, you can grab them manually, or you can just have your folders of source materials available at the speeds of your local network, already stacked up and ready to work with.
Also, as someone in the infosec industry, a lot of conference talks and guides and useful videos get blown away whenever YouTube goes on a “hacking tutorial purge”. This has already killed multiple channels full of useful guides.
Plus, YouTube has reencoded old videos and won’t let you access the originals pre automated upscaling. YT is not a backup or an archival site. So, if that’s something you want, you have to DIY it.
Thanks 👍🏻
It’s also quite easy to use. I used it for example to download yt video on my phone before taking off on plane.