Pro@programming.devM to AI - Artificial intelligence@programming.devEnglish · 16 days agoHarvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languageswww.institutionaldatainitiative.orgexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkHarvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languageswww.institutionaldatainitiative.orgPro@programming.devM to AI - Artificial intelligence@programming.devEnglish · 16 days agomessage-square0linkfedilink