Additional 5,000 articles deposited to CDR

Last month, we added an additional 5000 UNC-authored articles to the CDR. These articles were published from 1980-2015 and are primarily focused on science, technology and mathematics. These articles are made available via three different sources: publisher permission, PubMed Europe or through an open license.

As with the previous batches of articles, this batch was also identified through a report from 1Science, which the Libraries purchased in May 2018. We used the lessons that we learned from the previous batches to load the content quickly, but we once again had to do a lot of work in order to load the articles into the CDR, including:

  • Rewriting portions of the download script to prevent overwriting of file names
  • Identifying and obtaining missing or incorrect metadata
  • Normalizing metadata, including mapping author affiliations to the CDR’s internal department list
  • Identifying embargo periods
  • Writing a script to ingest the articles into the CDR
  • Multiple quality assurance checks

This list represents work from the Repository Services, Software Development, and Infrastructure Management departments. Next, we will need to look at individual permissions for the remaining articles and load them via small batches, which will be a much slower process.

These articles are only one component of our program to increase the number of scholarly articles in the Carolina Digital Repository and support the Libraries Sustainable Scholarship initiative. Read more about our Content Liberation initiatives! If you’d like to deposit your work in the CDR, please contact us!

Leave a Reply

Your email address will not be published. Required fields are marked *

*