Using MarcEdit and OpenRefine to regularize MARC 510 fields: Difference between revisions

(Saving work in progress)
 
m (saving work in progress)
Line 2: Line 2:


# Open MarcEdit and select OpenRefine Data Transfer
# Open MarcEdit and select OpenRefine Data Transfer
# Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (Folger practice: Save File filename is same as Source File except with "-OpenRefine.tsv" instead of ".mrc")
# Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (if the .mrc file is very large, you may need to split it before continuing, otherwise or  OpenRefine will run out of memory when attempting to create the project)
# Open OpenRefine and select "Create project"
# Open OpenRefine and select "Create project"
# Import the .tsv file with these settings
# Import the .tsv file with these settings

Revision as of 21:15, 28 November 2022

This article describes how to use MarcEdit and OpenRefine to regularize citation forms extracted MARC records. For more on using OpenRefine with library data, see the Library Carpentry: OpenRefine online lesson at https://librarycarpentry.org/lc-open-refine/

  1. Open MarcEdit and select OpenRefine Data Transfer
  2. Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (if the .mrc file is very large, you may need to split it before continuing, otherwise or OpenRefine will run out of memory when attempting to create the project)
  3. Open OpenRefine and select "Create project"
  4. Import the .tsv file with these settings
    1. Character encoding: UTF-8,
    2. Columns are separated by: tabs (TSV)
    3. Parse next 1 line(s) as column headers
    4. Store blank rows