哈佛为AI培训发布大量公共域书数据集, 由科技巨人资助。
Harvard releases massive public-domain book dataset for AI training, funded by tech giants.
哈佛大学在微软公司和OpenAI公司的资助下,发行了近100万册公共领域书籍的数据集,用于培训AI模型。
Harvard University, with funding from Microsoft and OpenAI, has released a dataset of nearly one million public-domain books for training AI models.
" 机构数据倡议 " 旨在为较小的开发商提供获取高质量数据的机会,通常只有技术巨人才能获得这些数据,从而在开发AI方面提供公平的竞争环境。
The Institutional Data Initiative aims to provide smaller developers with access to high-quality data, typically available only to tech giants, thereby leveling the playing field in AI development.
数据集包括谷歌图书项目的书籍,任何人都可以用来培训AI,从业余爱好者到公司。
The dataset includes books from the Google Books project and can be used by anyone to train AI, from hobbyists to corporations.