Home Technology Harvard and Google to launch 1 million public-domain books as AI coaching...

Technology

Harvard and Google to launch 1 million public-domain books as AI coaching dataset | TechCrunch

December 12, 2024

AI coaching knowledge has an enormous price ticket, one best-suited for deep-pocketed tech companies. This is the reason Harvard College plans to release a dataset that features within the area of 1 million public-domain books, spanning genres, languages, and authors together with Dickens, Dante, and Shakespeare, that are not copyright-protected on account of their age.

The brand new dataset isn’t out there but, and it’s not clear when or how will probably be launched. Nonetheless, it comprises books derived from Google’s longstanding book-scanning undertaking, Google Books, and thus Google shall be concerned in releasing “this treasure trove far and broad.”

Harvard first teased the Institutional Data Initiative (IDI) back in March, outlining its plans to create a “trusted conduit for authorized knowledge for AI.” Nonetheless, not a lot has been heard from it till its formal launch today, which got here with affirmation that the IDI consists of monetary backing from Microsoft and OpenAI.

The IDI’s government director Greg Leppert says the dataset’s designed to “stage the taking part in subject” by opening up such an enormous dataset to anybody — from analysis labs to AI startups — that wish to practice their giant language fashions (LLMs).

Harvard and Google to launch 1 million public-domain books as AI coaching dataset | TechCrunch

MOST POPULAR

Rivian elects Cohere’s CEO to its board in newest sign the...

Netflix hops aboard Sifu film adaptation, assigns a screenwriter

Blow Out the Candles, Not Your Finances – NordVPN 72% Off...

Humane’s AI Pin is useless, as HP buys startup for $116M...

The very best laptop computer energy banks for 2025

Stunning VPN Deal: Simply Over $2/Month for Final Safety!

TRENDING

Rivian elects Cohere’s CEO to its board in newest sign the...

Netflix hops aboard Sifu film adaptation, assigns a screenwriter

Blow Out the Candles, Not Your Finances – NordVPN 72% Off...

Humane’s AI Pin is useless, as HP buys startup for $116M...

The very best laptop computer energy banks for 2025

Stunning VPN Deal: Simply Over $2/Month for Final Safety!

Rivian elects Cohere’s CEO to its board in newest sign the...

Netflix hops aboard Sifu film adaptation, assigns a screenwriter