Libraries & Archivists Are Digitizing 480,000 Books Published in 20th Century That Are Secretly in the Public Domain

in Archives, Books, Internet Archive | September 27th, 2019

Image by Jason “Textfiles” Scott, via Wikimedia Commons

All books in the public domain are free. Most books in the public domain are, by definition, on the old side, and a great many aren’t easy to find in any case. But the books now being scanned and uploaded by libraries aren’t quite so old, and they’ll soon get much easier to find. They’ve fallen through a loophole because their copyright-holders never renewed their copyright, but until recently the technology wasn’t quite in place to reliably identify and digitally store them.

Now, though, as Vice’s Karl Bode writes, “a coalition of archivists, activists, and libraries are working overtime to make it easier to identify the many books that are secretly in the public domain, digitize them, and make them freely available online to everyone.” These were published between 1923 and 1964, and the goal of this digitization project is to upload all of these surprisingly out-of-copyright books to the Internet Archive, a glimpse of whose book-scanning operation appears above.

“Historically, it’s been fairly easy to tell whether a book published between 1923 and 1964 had its copyright renewed, because the renewal records were already digitized,” writes Bode. “But proving that a book hadn’t had its copyright renewed has historically been more difficult.” You can learn more about what it takes to do that from this blog post by New York Public Library Senior Product Manager Sean Redmond, who first crunched the numbers and estimated that 70 percent of the titles published over those 41 years may now be out of copyright: “around 480,000 public domain books, in other words.”

The first important stage is the conversion of copyright records into the XML format, a large part of which the New York Public Library has recently completed. Bode also mentions a software developer and science fiction author named Leonard Richardson who has written Python scripts to expedite the process (including a matching script to identify potentially non-renewed copyrights in the Internet Archive collection) and a bot that identifies newly discovered secretly public-domain books daily. Richardson himself underscores the necessity of volunteers to take on tasks like seeking out a copy of each such book, “scanning it, proofing it, then putting out HTML and plain-text editions.”

This work is now happening at American libraries and among volunteers from organizations like Project Gutenberg. The Internet Archive’s Jason Scott has also pitched in with his own resources, recently putting out a call for more help on the “very boring, VERY BORING (did I mention boring)” project of determining “which books are actually in the public domain to either surface them on @internetarchive or help make a hitlist.” Of course, many more obviously stimulating tasks exist even in the realm of digital archiving. But then, each secretly public-domain book identified, found, scanned, and uploaded brings humanity’s print and digital civilizations one step closer together. Whatever comes out of that union, it certainly won’t be boring.

via Vice

11,000 Digitized Books From 1923 Are Now Available Online at the Internet Archive

British Library to Offer 65,000 Free eBooks

Download for Free 2.6 Million Images from Books Published Over Last 500 Years on Flickr

Free: You Can Now Read Classic Books by MIT Press on Archive.org

The Library of Congress Launches the National Screening Room, Putting Online Hundreds of Historic Films

Based in Seoul, Colin Marshall writes and broadcasts on cities, language, and culture. His projects include the book The Stateless City: a Walk through 21st-Century Los Angeles and the video series The City in Cinema. Follow him on Twitter at @colinmarshall or on Faceboo k.

by Colin Marshall | Permalink | Make a Comment ( None ) |

1,100 Classic Arcade Machines Added to the Internet Arcade: Play Them Free Online

in Internet Archive, Video Games | September 27th, 2018

Once we could hardly imagine such things as video games. Then, all of a sudden, they appeared, though for years we had to go out to bars — and later, purpose-built “arcades” filled with video game machines — in order to play them, and we paid money to do so. When they came into our homes in the form of consoles we could hook up to our television sets, we at first felt only disappointment: these versions of Space Invaders, Donkey Kong, and Defender neither looked nor felt much like the originals into which we’d pumped so many coins. But only now that the technology in our homes has long since surpassed most of the technology outside them can we play faithful reproductions of all our old favorite games without going out to the arcade.

Not that many arcades still stand, although the Internet Archive has made up for that absence by building the Internet Arcade, which we previously featured here on Open Culture a few years ago. Having made it possible for us to play an enormous variety of classic arcade games free in our web browsers, the Internet Archive looks on its way to creating not just the largest arcade in existence but an infinite arcade, the kind that Borges would have imagined had he grown up in the video-game age. Just last week, developments in the software that powers it allowed Internet Archive to add more than a thousand new machines to the Internet Arcade, from games for which we had to wait in line back in the day to obscurities on which few of us have ever even laid eyes, let alone hands, before.

“The majority of these newly-available games date to the 1990s and early 2000s, as arcade machines both became significantly more complicated and graphically rich,” writes the Internet Archive’s Jason Scott, “while also suffering from the ever-present and home-based video game consoles that would come to dominate gaming to the present day. Even fervent gamers might have missed some of these arcade machines when they were in the physical world, due to lower distribution numbers and shorter times on the floor.” You can explore the new wing of the Internet Arcade here, some of whose most popular games include Puzzle Bobble (better known in the West as Bust-a-Move), X‑Men, Metal Slug 5, Teenage Mutant Ninja Turtles: Turtles in Time, and Street Fighter Alpha 2. Maybe their sound and graphics no longer wow us as once they did, but the years have done nothing to diminish their fun factor — and for many of us, not having to spend our quarters will always be a feeling to savor.

Free: Play 2,400 Vintage Computer Games in Your Web Browser

Play a Collection of Classic Handheld Video Games at the Internet Archive: Pac-Man, Donkey Kong, Tron and MC Hammer

Based in Seoul, Colin Marshall writes and broadcasts on cities and culture. His projects include the book The Stateless City: a Walk through 21st-Century Los Angeles and the video series The City in Cinema. Follow him on Twitter at @colinmarshall or on Faceboo k.

by Colin Marshall | Permalink | Make a Comment ( 2 ) |

200,000+ Vintage Records Being Digitized & Put Online by the Boston Public Library

in Archives, History, Internet Archive, Music | October 17th, 2017

It may be a great irony that our age of cultural destruction and—many would argue—decline also happens to be a golden age of preservation, thanks to the very new media and big data forces credited with dumbing things down. We spend ample time contemplating the losses; archival initiatives like The Great 78 Project, like so many others we regularly feature here, should give us reasons to celebrate.

In a post this past August, we outlined the goals and methods of the project. Centralized at the Internet Archive—that magnanimous citizens’ repository of digitized texts, recordings, films, etc.—the project contains several thousand carefully preserved 78rpm recordings, which document the distinctive sounds of the early 20th century from 1898 to the late-1950s.

Thanks to partners like preservation company George Blood, L.P. and the ARChive of Contemporary Music, we can hear many thousands of records from artists both famous and obscure in the original sound of the first mass-produced consumer audio format.

Just a few days ago, the Internet Archive announced that they would be joined in the endeavor by the Boston Public Library, who, writes Wendy Hanamura, “will digitize, preserve” and make available to the public “hundreds of thousands of audio recordings in a variety of historical formats,” including not only 78s, but also LP’s and Thomas Edison’s first recording medium, the wax cylinder. “These recordings have never been circulated and were in storage for several decades, uncatalogued and inaccessible to the public.”

The process, notes WBUR, “could take a few years,” given the sizable bulk of the collection and the meticulous methods of the Internet Archive’s technicians, who labor to preserve the condition of the often fragile materials, and to produce a number of different versions, “from remastered to raw.” The object, says Boston Public Library president David Leonard, is to “produce recordings in a way that’s interesting to the casual listener as well as to the hard-core music listener in the research business.”

Thus far, only two recordings from BPL’s extensive collections have become available—a 1938 recording called “Please Pass the Biscuits, Pappy (I Like Mountain Music)” by W. Lee O’Daniel and His Hillbilly Boys and Edvard Grieg’s only piano concerto, recorded by Freddy Martin and His Orchestra in 1947. Even in this tiny sampling, you can see the range of material the archive will feature, consistent with the tremendous variety the Great 78 Project already contains.

While we can count it as a great gain to have free and open access to this historic vault of recorded audio, it is also the case that digital archiving has become an urgent bulwark against total loss. Current recording formats instantly spawn innumerable copies of themselves. The physical media of the past existed in finite numbers and are subject to total erasure with time. “The simple fact of the matter,” archivist George Blood tells the BPL, “is most audiovisual recordings will be lost. These 78s are disappearing left and right. It is important that we do a good job preserving what we can get to, because there won’t be a second chance.”

via WBUR

The British Library’s “Sounds” Archive Presents 80,000 Free Audio Recordings: World & Classical Music, Interviews, Nature Sounds & More

BBC Launches World Music Archive

Josh Jones is a writer and musician based in Durham, NC. Follow him at @jdmagness

by Josh Jones | Permalink | Make a Comment ( None ) |

Apple’s Hypercard Software, the Innovative 1980s Precursor to Hypertext, Now Made Available by Archive.org

in Apple, Internet Archive, Software | August 14th, 2017

Archive.org is on a bit of a roll lately. After recently making available 25,000+ digitized 78rpm records from the early 20th century, they’ve turned around and put online Apple Hypercard software. When Hypercard was released in 1987, The New York Times published an article entitled “Apple to Introduce Unusual Software,” which began:

Apple Computer Inc. will introduce an unusual database and management information program Tuesday that the company hopes will help it maintain its lead in technology for making computers easy to use.

The new software, known as Hypercard, will enable users of Apple’s Macintosh computers to organize information on computerized file cards that can be linked to other file cards in intricate ways. The program will be included for no charge with each Macintosh sold, starting this month.

Hypercard made its appearance precisely when Apple also released “a communications device, known as a modem, that will enable the Macintosh to send documents to and from facsimile machines.” Some of us still use modems today. Hypercard, not so much. At least not directly.

As Hypercard’s creator Bill Atkinson indicates above, Hypercard started working with the hypertext concept that’s now prevalent on the web today. Think those links you find in HTML. On Archive.org, you can find and play with Hypercard software, or what they call emulated Hypercard stacks. (They also host a library of emulated software for the early Macintosh computer). Read more about Archive.org’s Hypercard project on their blog here.

If you would like to sign up for Open Culture’s free email newsletter, please find it here. It’s a great way to see our new posts, all bundled in one email, each day.

If you would like to support the mission of Open Culture, consider making a donation to our site. It’s hard to rely 100% on ads, and your contributions will help us continue providing the best free cultural and educational materials to learners everywhere. You can contribute through PayPal, Patreon, and Venmo (@openculture). Thanks!

Related Content:

Free Online Computer Science Courses

Free: You Can Now Read Classic Books by MIT Press on Archive.org

How Brewster Kahle and the Internet Archive Will Preserve the Infinite Information on the Web

Run Vintage Video Games (From Pac-Man to E.T.) and Software in Your Web Browser, Thanks to Archive.org

The Internet Arcade Lets You Play 900 Vintage Video Games in Your Web Browser (Free)

by OC | Permalink | Make a Comment ( None ) |

25,000+ 78RPM Records Now Professionally Digitized & Streaming Online: A Treasure Trove of Early 20th Century Music

in History, Internet Archive, Music | August 9th, 2017

Every recording medium works as a metonym for its era: the term “LP” conjures up associations with a broad musical period of classic rock ‘n’ roll, soul, doo-wop, R&B, funk, jazz, disco etc.; we talk of the “CD era,” dominated by dance music and hip-hop; the 45 makes us think of jukeboxes, diners, and sock-hops; and the cassette, well… at least one subgenre of music, what John Peel called “shambling,” jangly, lo-fi pop, came to be known by the name “C86,” the title of an NME compilation, short for “Cassette, 1986.” (Readers of the magazine had to clip coupons and send money by postal mail to receive a copy of the tape.)

Soon, however, fewer and fewer people will remember the age of the 78rpm record, the preferred vehicle for the music of the early 20th century. From classical and opera to blues, bluegrass, swing, ragtime, gospel, Hawaiian, and holiday novelties the 78 epitomizes the sounds of its heyday as much as any of the media mentioned above.

While cassettes recently made a nostalgic comeback, and turntables are found in every big box store, we’re generally not equipped to play back 78s. These are brittle records made from shellac, a resin secreted by beetles. They were often played on appliances that doubled as quality parlor furniture.

Thanks now to the Internet Archive, that stalwart of digital cataloguing and curation, we can play twenty five thousand 78s and immerse ourselves in the early 20th century, whether for research purposes or pure enjoyment. Previous efforts at preservation have “restored or remastered… commercially viable recordings” on LP or CD, writes The Great 78 Project, the archive’s volunteer program to digitize musical history. The current effort seeks to go beyond popularity and collect everything, from the rarest and strangest to the already historic. “I want to know what the early 20th century sounded like,” writes Internet Archive founder Brewster Kahle, “Midwest, different countries, different social classes, different immigrant communities and their loves and fears.”

You can hear several selections here, and thousands more at this archive of 78s uploaded by audio-visual preservation company, George Blood, L.P. Other 78rpm archives from volunteer collectors and the ARChive of Contemporary Music are being digitized and uploaded as well. You’ll note the recordings are often submerged in crackle and hiss, and generally lack bass and treble (most playback systems of the time could not reproduce the lower and higher ends of the audible spectrum). “We have preserved the often very prominent surface noise and imperfections,” the Archive writes, “and included files generated by different sizes and shapes of stylus to facilitate different kinds of analysis.” Different playback systems could produce markedly different sounds, and the recordings were not always strictly 78rpm.

These conditions of the transfer ensure that we roughly hear what the first audiences heard, though the records’ age and our penchant for 7 speaker audio systems introduce some new variables. None of these recordings were even made in stereo. The 78 period, notes Yale Library, lasted between 1898 and the late 1950s, when the 33 1/2 rpm long-playing record fully edged out the older model. For approximately fifty years, these records carried recorded music, sound, and speech into homes around the world. “What is this?” Kahle asks of this formidable digitization project. “A reference collection? A collector’s dream? A discovery radio station? The soundtrack of the early 20th century?” All of the above. To learn more about The Great 78 Project, including the technical details of the transfer and how you can carefully package up and mail in your own 78rpm records, visit their Preservation page.

h/t @Ferdinand77

Related Content:

BBC Launches World Music Archive

The British Library’s “Sounds” Archive Presents 80,000 Free Audio Recordings: World & Classical Music, Interviews, Nature Sounds & More

DC’s Legendary Punk Label Dischord Records Makes Its Entire Music Catalog Free to Stream Online

Josh Jones is a writer and musician based in Durham, NC. Follow him at @jdmagness

by Josh Jones | Permalink | Make a Comment ( 2 ) |

Free: You Can Now Read Classic Books by MIT Press on Archive.org

in Books, Internet Archive, MIT | August 4th, 2017

FYI. At the end of May, Archive.org announced this on its blog:

For more than eighty years, MIT Press has been publishing acclaimed titles in science, technology, art and architecture. Now, thanks to a new partnership between the Internet Archive and MIT Press, readers will be able to borrow these classics online for the first time. With generous support from Arcadia, a charitable fund of Peter Baldwin and Lisbet Rausing, this partnership represents an important advance in providing free, long-term public access to knowledge.

“These books represent some of the finest scholarship ever produced, but right now they are very hard to find,” said Brewster Kahle, founder and Digital Librarian of the Internet Archive. “Together with MIT Press, we will enable the patrons of every library that owns one of these books to borrow it online–one copy at a time.”

This joint initiative is a crucial early step in Internet Archive’s ambitious plans to digitize, preserve and provide public access to four million books, by partnering widely with university presses and other publishers, authors, and libraries.…

We will be scanning an initial group of 1,500 MIT Press titles at Internet Archive’s Boston Public Library facility, including Cyril Stanley Smith’s 1980 book, From Art to Science: Seventy-Two Objects Illustrating the Nature of Discovery, and Frederick Law Olmsted and Theodora Kimball’s Forty Years of Landscape Architecture: Central Park, which was published in 1973. The oldest title in the group is Arthur C. Hardy’s 1936 Handbook of Colorimetry.

Throughout the summer, we’ve been checking in, waiting for the first MIT Press books to hit Archive.org’s virtual shelves. They’re now starting to arrive. Click here to find the beginnings of what promises to be a much larger collection.

As Brewster Kahle (founder of Internet Archive) explained it to Library Journal, his organization is “basically trying to wave a wand over everyone’s physical collections and say, Blink! You now have an electronic version that you can use” in whatever way desired, assuming its permitted by copyright. In the case of MIT Press, it looks like you can log into Archive.org and digitally borrow their electronic texts for 14 days.

Archive.org hopes to digitize 1,500 MIT Press classics by the end of 2017. Digital collections from other publishing houses seem sure to follow.

If you would like to sign up for Open Culture’s free email newsletter, please find it here. It’s a great way to see our new posts, all bundled in one email, each day.

If you would like to support the mission of Open Culture, consider making a donation to our site. It’s hard to rely 100% on ads, and your contributions will help us continue providing the best free cultural and educational materials to learners everywhere. You can contribute through PayPal, Patreon, and Venmo (@openculture). Thanks!

An Archive of 3,000 Vintage Cookbooks Lets You Travel Back Through Culinary Time

Enter a Huge Archive of Amazing Stories, the World’s First Science Fiction Magazine, Launched in 1926

by OC | Permalink | Make a Comment ( 4 ) |

The Metropolitan Museum of Art Makes 140,000+ Artistic Images from Its Collections Available on Archive.org

in Art, Internet Archive, Museums | June 28th, 2017

As an Open Culture reader, you might already know the Internet Archive, often simply called “Archive.org,” as an ever expanding trove of wonders, freely offering everything from political TV ads to vintage cookbooks to Grateful Dead concert recordings to the history of the internet itself. You might also know the Metropolitan Museum of Art as not just a building on Fifth Avenue, but a leading digital cultural institution, one willing and able to make hundreds of art books available to download and hundreds of thousands of fine-art images usable and remixable under a Creative Commons license.

Now, the Internet Archive and the Metropolitan Museum of Art have teamed up to bring you a collection of over 140,000 art images gathered by the latter and organized and hosted by the former.

Most every digital vault in the Internet Archive offers a cultural and historical journey within, but the collaboration with the Metropolitan Museum of Art offers an especially deep one, ranging historically from early 19th-century India (The Pleasures of the Hunt at the top of the post) to midcentury New York (the photo of the mighty locomotive before the entrance to the 1939 World’s Fair above) and, in either direction, well beyond.

Culturally speaking, you can also find in the Met’s collection in the Internet Archive everything from from Japanese interpretations of French photography (the woodblock print French Photographer above) to the Belgian interpretation of Anglo-American cinema (the poster design for Charlie Chaplin’s Play Day below). You can dial in on your zone of interest by using the “Topics & Subjects,” whose hundreds of filterable options include, to name just a few, such categories as Asia, wood, fragments, London, folios, and underwear.

The collection also contains works of the masters, such as Vincent van Gogh’s 1887 Self-Portrait with Straw Hat (as well as its obverse, 1885’s The Potato Peeler), and some of the world’s great vistas, including Francesco Guardi’s 1765 rendering of Venice from the Bacino di San Marco. If you’d like to see what in the collection has drawn the attention of most of its browsers so far, sort it by view count: those at work should beware that nudes and other erotically charged artworks predictably dominate the rankings, but they do it alongside Naruto Whirlpool, the Philosopher’s Stone, and Albert Einstein. Human interest, like human creativity, always has a surprise or two in store.

Download 464 Free Art Books from The Metropolitan Museum of Art

1.8 Million Free Works of Art from World-Class Museums: A Meta List of Great Art Available Online

Based in Seoul, Colin Marshall writes and broadcasts on cities and culture. He’s at work on the book The Stateless City: a Walk through 21st-Century Los Angeles, the video series The City in Cinema, the crowdfunded journalism project Where Is the City of the Future?, and the Los Angeles Review of Books’ Korea Blog. Follow him on Twitter at @colinmarshall or on Faceboo k.

by Colin Marshall | Permalink | Make a Comment ( None ) |

Download 200+ Free Modern Art Books from the Guggenheim Museum

in Art, Books, Internet Archive | April 26th, 2017

For at least half a decade now, New York’s Solomon R. Guggenheim Museum has been digitizing its exhibition catalogs and other art books. Now you can find all of the publications made available so far — not just to read, but to download in PDF and ePub formats — at the Internet Archive. If you’ve visited the Guggenheim’s non-digital location on Fifth Avenue even once, you know how much effort the institution puts toward the preservation and presentation of modern art, and that comes through as much in its printed material as it does in its shows.

Among the more than 200 Guggenheim art books available on the Internet Archive, you’ll find one on a 1977 retrospective of Color Field painter Kenneth Noland, one on the ever-vivid icon-making pop artist Roy Lichtenstein, and one on the existential slogans — “MONEY CREATES TASTE,” “PROTECT ME FROM WHAT I WANT,” “LACK OF CHARISMA CAN BE FATAL” — slyly, digitally inserted into the lives of thousands by Jenny Holzer. Other titles, like Expressionism, a German Intuition 1905–1920, From van Gogh to Picasso, from Kandinsky to Pollock, and painter Wassily Kandinsky’s own Point and Line to Plane, go deeper into art history.

Where to start amid all these books of modern (and even some of pre-modern) art? You might consider first having a look at the books in the Internet Archive’s Guggenheim collection about the Guggenheim itself: the handbook to its collection up through 1980, for instance, or 1991’s Masterpieces from the Guggenheim Collection: From Picasso to Pollock, or the following year’s Guggenheim Museum A to Z, or Art of this Century: The Guggenheim Museum and its Collection from the year after that. But just as when you pay a visit to the Guggenheim itself, you shouldn’t worry too much about what order you see everything in; the important thing is to look with interest.

Explore the collection of 200+ art books and catalogues here.

If you would like to sign up for Open Culture’s free email newsletter, please find it here. It’s a great way to see our new posts, all bundled in one email, each day.

If you would like to support the mission of Open Culture, consider making a donation to our site. It’s hard to rely 100% on ads, and your contributions will help us continue providing the best free cultural and educational materials to learners everywhere. You can contribute through PayPal, Patreon, and Venmo (@openculture). Thanks!

Download Over 300+ Free Art Books From the Getty Museum

The Guggenheim Puts Online 1600 Great Works of Modern Art from 575 Artists

Based in Seoul, Colin Marshall writes and broadcasts on cities and culture. He’s at work on a book about Los Angeles, A Los Angeles Primer, the video series The City in Cinema, the crowdfunded journalism project Where Is the City of the Future?, and the Los Angeles Review of Books’ Korea Blog. Follow him on Twitter at @colinmarshall or on Faceboo k.

by Colin Marshall | Permalink | Make a Comment ( 1 ) |

Libraries & Archivists Are Digitizing 480,000 Books Published in 20th Century That Are Secretly in the Public Domain

1,100 Classic Arcade Machines Added to the Internet Arcade: Play Them Free Online

200,000+ Vintage Records Being Digitized & Put Online by the Boston Public Library

Apple’s Hypercard Software, the Innovative 1980s Precursor to Hypertext, Now Made Available by Archive.org

25,000+ 78RPM Records Now Professionally Digitized & Streaming Online: A Treasure Trove of Early 20th Century Music

Free: You Can Now Read Classic Books by MIT Press on Archive.org

The Metropolitan Museum of Art Makes 140,000+ Artistic Images from Its Collections Available on Archive.org

Download 200+ Free Modern Art Books from the Guggenheim Museum

Essentials

Support Us

Free Courses

Receive our Daily Email

FREE UPDATES!

GET OUR DAILY EMAIL

Free Movies

Free Language Lessons

Free eBooks

Free Audio Books

Free Textbooks

K-12 Resources

Free Art & Images

Free Music

Writing Tips

Archive

Personal Finance

Categories

Great Lectures

Sign up for Newsletter

About Us

Great Recordings

Book Lists By

Syllabi

Favorite Movies

Archives

Search