8TB
-
Blog
Eleuther AI releases 8TB collection of licensed and open training data – Computerworld
AI research organization Eleuther AI has launched a massive text database, Common Pile v0.1, that can be used to train AI systems, according to Techcrunch. The 8TB database consists exclusively of publicly licensed texts, or texts that are classified as public domain. Common Pile v0.1 was developed over two years in collaboration with Poolside, Hugging Face, the US Library of…
Read More »