Is there a collection of all human knowledge ever created ?

saltynuts420@lemm.ee · 2 years ago

Is there a collection of all human knowledge ever created ?

jeff@lemmy.ca · 2 years ago

Wikipedia is a great start. You can download its entirety, roughly 100gb. Most of the basic and advanced human knowledge.

Check out kiwix to get it offline

Random Dent@lemmy.ml · 2 years ago

You can do all of Project Gutenberg too. It’s only about 75gb, surprisingly.

ozebb@lemmy.world · 2 years ago

Yes!

https://libraryofbabel.info/About.html

But there’s a catch.

funnystuff97@lemmy.world · 2 years ago

Quite a lot to read in that library.

Zahille7@lemmy.world · 2 years ago

You son of a bitch

GrabtharsHammer@lemmy.world · 2 years ago

Such an insightful commentary on the importance of the social contract and the irreplacibility of the individual. The only way forward is to share our personal experiences and strive for understanding. Once we know each other’s value, we will never surrender our common bonds, disappoint one another, go behind each other’s backs, nor do each other harm.

the_q@lemmy.world · 2 years ago

You’re using it right now.

schnurrito@discuss.tchncs.de · 2 years ago

That is pretty much exactly the goal of the Wikimedia Foundation which runs Wikipedia and its sister projects.

But by now we figured out what wikis can do well and what not. Wikis are suitable for crowdsourcing objective facts about the world (all it takes is one person to add any given fact), they are not a universal remedy for everything, especially not contentious issues or useful instructional materials.

I have made more than 100000 edits to their projects. I don’t participate there anymore. The time when they were a force for good in the world is long past.

TheControlled@lemmy.world · 2 years ago

Why aren’t they a force for good anymore?

schnurrito@discuss.tchncs.de · 2 years ago

Many reasons most of which you’ll only understand if you pay some attention to what’s going on behind their scenes.

There are reasons why nowadays pretty much everywhere else on the Internet more content is created all the time than on the Wikimedia projects.

The Wikipedias’ “neutral point of view” policy used to mean “we try to treat all sides fairly”, now it means “we are writing an unconditional propaganda organ for the status quo”. The mainstream media that is accepted as “reliable” as Wikipedia sources just isn’t that credible anymore.

Also, when I started editing there, the individual projects were mostly left alone by the WMF. Nowadays the WMF issues intransparent sanctions, up to lifetime bans from all projects, left and right.

I wish someone started an organization with the same goals as the WMF with an actually working system where people could actually enjoy participating.

Doubletwist@lemmy.world · 2 years ago

Be the change you want to see in the world…

Heck most of the hard work is already done for you, since the software that runs Wikipedia is open source.

schnurrito@discuss.tchncs.de · 2 years ago

Many people have tried that before. Wikis just aren’t that appealing anymore. Today’s internet is all about social media.

Doubletwist@lemmy.world · 2 years ago

If there’s anything that is absolutely atrocious as a searchable repository of knowledge, it’s social media.

foggy@lemmy.world · 2 years ago

All of Wikipedia is <256 gb.

All of Wikipedia in English <64 gb.

Then archive.org for multimedia, ~10 peta bytes. Yipes.

krimsonbun@lemmy.blahaj.zone · 2 years ago

Well archive.org has much more videos

Doubletwist@lemmy.world · 2 years ago

Which are part of human knowledge.

krimsonbun@lemmy.blahaj.zone · 2 years ago

True but I still don’t really find it to be a fair comparison, both are great at what they do

spitz@lemmy.ml · 2 years ago

According to my ex-, her.

alphacyberranger@sh.itjust.works · 2 years ago

Can confim

spitz@lemmy.ml · 2 years ago

Haha you can have her. Good luck!

alphacyberranger@sh.itjust.works · 2 years ago

Please take her back. I’ll even pay you.

haris@mander.xyz · 2 years ago

Check out this book: https://en.m.wikipedia.org/wiki/The_Knowledge:_How_to_Rebuild_Our_World_from_Scratch. It analyses that precise question in the first chapter. The author argues that even though Wikipedia is probably the closest thing there is, there is a clear lack of practical knowledge that will be essential in the situation that you are describing. Science progress heavily relies on industrial progress, and even if you know how to build something that doesn’t mean that you can do it, as there are other things that are required first.

DarkMatterStyx @lemmy.world · 2 years ago

I think the internet as a whole is going to be the closest we’ll ever come. Capitalism will make sure it’s never even close to complete so it always has something to monetize.

saltynuts420@lemm.ee · 2 years ago

you can read pretty much (except the lost media like those lost in library burnings , film destruction and wars) read any book written by humans since 2500 bce (example Rig Veda the first ved of Hinduism was written even before 2500 and is today said to be 98% at its original state thanks to Indian gurus and saints who passed it on orally and was made into a book only after 8th century) , watch any movie ever released , hear any music ever made after recording was invented .

ofcourse there is a catch that these medias are not freely and publicly available and most you have to pirate in order to consume it thus we need to have a centralised database of these things safely kept somewhere so that we don’t have to reinvent the wheel in case of a catastrophic event .

Decoy321@lemmy.world · 2 years ago

I wouldn’t say “complete” can even be sufficiently defined in this case. Every functional definition I can think of has a limiting factor.

Let’s try to define knowledge. What kind of information qualifies? We can usually think of important, useful info like physics and medicine. But what about other data, like sports game stats, atmospheric sensor readings, or even something more esoteric, like the location data of every object on earth.

And even if we could have the information of every single thing at any particular time, what about when things change in the next second? And the one afterwards?

Essentially, nothing will ever be “complete”. Thanks for listening to my rant on semantics.

DarkMatterStyx @lemmy.world · 2 years ago

That was a lovely rant on semantics. I thoroughly enjoyed reading it!

Cyclohexane@lemmy.ml · 2 years ago

I’m surprised no one mentioned projects like libgen and scihub. They are much better than Wikipedia imo.

saltynuts420@lemm.ee · 2 years ago

imo zlib is much better but they keep changing their domain … also sci hub is only for research papers which most people can understand

windtorn@beehaw.org · 2 years ago

Well, technically, Library of Babel though that probably isn’t really what you’re looking for.

starman@programming.dev · 2 years ago

Besides what other commenters already said, archive.org does a great job.

Call me Lenny/Leni@lemm.ee · 2 years ago

Yes, though not one you might like.

saltynuts420@lemm.ee · 2 years ago

Thanks

Jordan117@lemmy.world · 2 years ago

https://en.wikipedia.org/wiki/Long_Now_Foundation

https://en.wikipedia.org/wiki/Memory_of_Mankind

kuneho@lemmy.world · 2 years ago

https://www.wikidata.org/

JohnDClay@sh.itjust.works · 2 years ago

It’s never going to be all knowledge, since a lot of stuff is just lost or never recorded. A ton of stuff (like this thread) are probably low on the priority list for recording as well. But the closest you’d probably get to a full catalog of human knowledge (at last text based) are the huge data sets of nearly all text data on the internet used for training LLMs. I wouldn’t be surprised if there are ones soon that include video and pictures as well, since newer AI models are starting to be able to interpret those too.

I believe this is one of those data sets: https://github.com/yaodongC/awesome-instruction-dataset

Edit: here’s a big data set used for a lot of gpt3 https://commoncrawl.org/