Using MarcEdit and OpenRefine to regularize MARC 510 fields

Revision as of 21:15, 28 November 2022 by ErinBlake (talk | contribs) (saving work in progress)

This article describes how to use MarcEdit and OpenRefine to regularize citation forms extracted MARC records. For more on using OpenRefine with library data, see the Library Carpentry: OpenRefine online lesson at https://librarycarpentry.org/lc-open-refine/

  1. Open MarcEdit and select OpenRefine Data Transfer
  2. Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (if the .mrc file is very large, you may need to split it before continuing, otherwise or OpenRefine will run out of memory when attempting to create the project)
  3. Open OpenRefine and select "Create project"
  4. Import the .tsv file with these settings
    1. Character encoding: UTF-8,
    2. Columns are separated by: tabs (TSV)
    3. Parse next 1 line(s) as column headers
    4. Store blank rows