Author: Pedro Gonzalez-Fernandez

Reflecting On a Year of Selected Datasets

Introduction The Selected Datasets Collection was publicly launched June 2020 as part of the Library’s ongoing efforts to support emerging data-driven styles of research. Since then, our initial offering of twenty datasets has grown to nearly 200 unique items, and we’ve continued to refine the technical workflows by which content is prepared and delivered to […]

Supporting the Acquisition of Openly-Available e-Serials from the Duplicate Materials Exchange Program: An Interview with Junior Fellow Alex Reese

For thirty years the Library of Congress has offered undergraduate and graduate students from across the country the opportunity to work on projects focused on expanding access to and use of the Library’s collections. As a result of the COVID-19 pandemic, the Junior Fellows program continued to be entirely virtual in 2021. The Digital Content […]

Exploring Openly-Licensed e-Serials from the Directory of Open Access Journals: An Interview with Junior Fellow Emmeline Kaser

For thirty years the Library of Congress has offered undergraduate and graduate students from across the country the opportunity to work on projects focused on expanding access to and use of the Library’s collections. As a result of the COVID-19 pandemic, the Junior Fellows program continued to be entirely virtual in 2021. The Digital Content […]

Passing the Mic to Our Audience: User Personas and Strategic Planning for the Sustainability of Digital Formats Website

This is a guest post written by Hilary Szu Yin Shiue and Jacob Kowall, 2021 Junior Fellows in the Digital Content Management & Services (DCMS) Division under the mentorship of Kate Murray, Digital Projects Coordinator. Hilary and Jacob assisted in updating and expanding the Sustainability of Digital Formats website, which provides information and analysis on […]

All Hyped Up for HyperCard: Further Adventures with an Apple Legacy Format

This is a guest post written by Jacob Kowall and Hilary Szu Yin Shiue, 2021 Junior Fellows in the Digital Collections Management & Services Division (DCMS) under the mentorship of Kate Murray, Digital Projects Coordinator. Jacob and Hilary assisted in updating and expanding the Sustainability of Digital Formats website, which provides information and analysis on […]

RFS 2.0 – A Year On

Today’s guest post is from Kate Murray (Digital Projects Coordinator, Digital Collections Management & Services Division), Marcus Nappier (Digital Collections Specialist, Digital Content Management Section), and Ted Westervelt (Chief, US/Anglo Division) at the Library of Congress. Introduction As the Library of Congress expands its digital collecting activities, the Recommended Formats Statement (RFS) has revised its […]

Selected Datasets: A New Library of Congress Collection

Friends, data wranglers, lend me your ears; The Library of Congress’ Selected Datasets Collection is now live! You can now download datasets of the Simple English Wikipedia, the Atlas of Historical County Boundaries, sports economic data, half a million emails from Enron, and urban soil lead abatement from this online collection. This initial set of […]

Earth Day 2020 Has Gone Digital

This is a guest post by Jennifer “JJ” Harbster, Head of the Science Reference Section in the Library’s Science, Technology and Business Division. She had her first taste of web archiving with the Internet Archive’s collaborative project documenting Hurricane Katrina and went on to lead the Science Blogs Web Archive. On April 22, 2020 we […]

More Open eBooks: Routinizing Open Access eBook Workflows

This is a guest post by Kristy Darby, a Digital Collections Specialist in the Digital Content Management Section in Library Services. We are excited to share that anyone anywhere can now access a growing online collection of contemporary open access eBooks from the Library of Congress website. For example, you can now directly access books […]

The Magnificent Seven: Looking Back on a Year of Exploring the Web Archives Datasets

It has been just over a year since we kicked off a deep dive into the Library of Congress Web Archives on the Signal! Now at over 2 petabytes, the web archives are a complex aggregation of interrelated web objects that make up the internet as we know it (images, text, code, audio, video, etc.). […]