Using MarcEdit and OpenRefine to regularize MARC 510 fields: Difference between revisions
(Saving work in progress) |
m (saving work in progress) |
||
Line 2: | Line 2: | ||
# Open MarcEdit and select OpenRefine Data Transfer | # Open MarcEdit and select OpenRefine Data Transfer | ||
# Use "Export to OpenRefine" to convert a .mrc file into a .tsv file ( | # Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (if the .mrc file is very large, you may need to split it before continuing, otherwise or OpenRefine will run out of memory when attempting to create the project) | ||
# Open OpenRefine and select "Create project" | # Open OpenRefine and select "Create project" | ||
# Import the .tsv file with these settings | # Import the .tsv file with these settings |
Revision as of 21:15, 28 November 2022
This article describes how to use MarcEdit and OpenRefine to regularize citation forms extracted MARC records. For more on using OpenRefine with library data, see the Library Carpentry: OpenRefine online lesson at https://librarycarpentry.org/lc-open-refine/
- Open MarcEdit and select OpenRefine Data Transfer
- Use "Export to OpenRefine" to convert a .mrc file into a .tsv file (if the .mrc file is very large, you may need to split it before continuing, otherwise or OpenRefine will run out of memory when attempting to create the project)
- Open OpenRefine and select "Create project"
- Import the .tsv file with these settings
- Character encoding: UTF-8,
- Columns are separated by: tabs (TSV)
- Parse next 1 line(s) as column headers
- Store blank rows