Is there an instance that has migrated all Reddit content?

InternetPirate@lemmy.fmhy.ml · 1 year ago

Is there an instance that has migrated all Reddit content?

qprimed@lemmy.ml · 1 year ago

I can actually see some merit to a lemmy API accessible reddit corpus. it would be interesting to reference old reddit info in a lemmy compatible way with zero reference to reddit itself.

doing so for the entire corpus properly (link fixups, etc) would be… challenging, but doable.

Andy@lemmy.world · 1 year ago

Letting the LLMs source from their for free, completely invalidating the proposed licensing model at Reddit.

graphito@beehaw.org · 1 year ago

lemmit.online ?

SwingingKoala@discuss.tchncs.de · edit-2 5 months ago

deleted by creator

vamp07@lemm.ee · 1 year ago

I’m at a loss as to why anybody wouldnwant this in the first place.

RagingNerdoholic@lemmy.ca · edit-2 1 year ago

Am I really going to buy a 2TB drive to hold all of reddit…

Actually, I’m pretty surprised that it’s only 2TB.

InternetPirate@lemmy.fmhy.ml · 1 year ago

It would be helpful if there were an instance that migrated all of this to Lemmy so that we could access it from any other instance, instead of having to download it for local browsing.

RagingNerdoholic@lemmy.ca · 1 year ago

I haven’t downloaded it. Looks like a collection of compressed files, but I don’t know exactly what’s inside of them. Do you know what format they’re in?

taladar@sh.itjust.works · 1 year ago

I don’t really see how this would be useful. Having purely archived data available in a software design to show you new posts feels like a format/content mismatch.

InternetPirate@lemmy.fmhy.ml · 1 year ago

I find Reddit more useful because of all the data it has than because of the new posts. I’m sure I’m not the only one.