Harvard Releases Big Data for Books

NYTimes – “Harvard’s library is making public the information on more than 12 million books, videos, audio recordings, images, manuscripts, maps, and more things inside Harvard’s 73 libraries. Harvard can’t put the actual content of much of this material online, owing to intellectual property laws, but this so-called metadata of things like titles, publication or recording dates, book sizes, or descriptions of what is in the video is also considered highly valuable material. Frequently descriptors of things like audio recordings are more valuable for search engines than the material itself. Search engines frequently rely on metadata over content, particularly when it can’t easily be scanned and understood. Harvard is hoping other libraries allow access to the metadata on their volumes, which could be the start of a large and unique repository of intellectual information.”

Comments are closed.

© Copyright 2014, Information Today, Inc., All rights reserved.