FromThePage transcription conventions: Difference between revisions
No edit summary |
|||
(16 intermediate revisions by the same user not shown) | |||
Line 7: | Line 7: | ||
<li><i>Blank pages: </i>If the page is truly blank, check the '''Mark as blank''' checkbox above the transcription window, save, and move on. If the image includes any text, including, for instance, titles or dates tooled onto the binding, text on a bookplate, or a call number penciled in by a librarian, transcribe the text and do not mark as blank.</li> | <li><i>Blank pages: </i>If the page is truly blank, check the '''Mark as blank''' checkbox above the transcription window, save, and move on. If the image includes any text, including, for instance, titles or dates tooled onto the binding, text on a bookplate, or a call number penciled in by a librarian, transcribe the text and do not mark as blank.</li> | ||
<li><i>Line Breaks: </i>Hit '''Return''' once after each line ends. Two returns indicate a new entry/paragraph break. If a word breaks across a line, place the hyphen, equal sign, or other word splitting character in a soft break tag: <lb break="no"></lb></li> | <li><i>Line Breaks: </i>Hit '''Return''' once after each line ends. Two returns indicate a new entry/paragraph break. If a word breaks across a line, place the hyphen, equal sign, or other word splitting character in a soft break tag: <lb break="no"></lb></li> | ||
<li><i>Recipe titles and headings: </i>Recipe titles and other headings: | <li><i>Page Breaks: </i>If two pages (an opening) is shown, transcribe the first page, then use a page break tag: <pb/> and transcribe the second.</li> | ||
<li><i>Recipe titles and headings: </i>Recipe titles and other headings: Wrap each heading, including recipe titles, in a headings tag (<head>To make bread</head>). Add an extra line break before and after each heading.</li> | |||
<li><i>Line fillers and dividers: </i>Transcribe text only. Do not include lines that sometimes appear to divide sections, or characters repeated over and over to justify the line.</li> | <li><i>Line fillers and dividers: </i>Transcribe text only. Do not include lines that sometimes appear to divide sections, or characters repeated over and over to justify the line.</li> | ||
</ul> | </ul> | ||
==FromThePage vetting instructions== | |||
Pages will be designated as ready for review after a transcriber has marked that page as done. To access the pages in need of vetting, click the "Pages That Need Review" button on the Read tab. | |||
Once you've selected a page to review, finish the transcription if it's not already complete, review the transcription to make sure it's correct, adjust the formatting as necessary to match our conventions, and add or correct all necessary tags. | |||
When finished, click save and then approve. | |||
==Tags available as buttons on FromThePage editor== | ==Tags available as buttons on FromThePage editor== | ||
Line 43: | Line 52: | ||
:<pre>shipp <gap>...</gap></pre> | :<pre>shipp <gap>...</gap></pre> | ||
:[[File:Gap.PNG|thumb|left]] | :[[File:Gap.PNG|thumb|left]] | ||
<br clear="all"> | |||
;Heading | |||
:Used for obvious headings. | |||
:<pre><head></head></pre> | |||
:<pre><head>A Ham.</head></pre> | |||
:[[File:Heading.PNG|thumb|left]] | |||
<br clear="all"> | <br clear="all"> | ||
Line 129: | Line 146: | ||
;Heading | ;Heading | ||
:Used for obvious headings. | :Used for obvious headings. | ||
:<pre> | :<pre><head></head></pre> | ||
:<pre><head>A Ham.</head></pre> | |||
:<pre> | |||
:[[File:Heading.PNG|thumb|left]] | :[[File:Heading.PNG|thumb|left]] | ||
Line 191: | Line 207: | ||
==Common abbreviations and expansions== | ==Common abbreviations and expansions== | ||
:'''lb''' = pound | |||
:'''℥''' = ounce | |||
:'''f℥''' = fluid ounce | |||
:'''ʒ''' = dram, drachm | |||
:'''℈''' = scruple | |||
:'''-ā/-an''' | :'''-ā/-an''' | ||
Line 197: | Line 223: | ||
</pre> | </pre> | ||
:'''-<sup>r</sup>/-er''', as in | :'''-<sup>r</sup>/-er''', as in rememb<sup>r</sup>/remember: expand "e," silently lower "r" | ||
<pre style="min-height:38px; margin-left:2em"> | <pre style="min-height:38px; margin-left:2em"> | ||
<expan>e</expan>r | <expan>e</expan>r | ||
Line 267: | Line 293: | ||
</pre> | </pre> | ||
== | ==Table-based transcription in FromThePage== | ||
See documentation here: https://content.fromthepage.com/project-owner-documentation/table-encoding/ | |||
If there are column headers, transcribe them first, with each header separated by a space-pipe-space (" | "). The row should begin and end with a pipe character. The column heading separator comes after the header row, and consists of pipes separated by hyphens (" | --- | --- | ") (Using more hyphens may be visually helpful to the transcriber so that the columns line up, but is not necessary.) | |||
The remainder of the table is transcribed the same way as the column header, with each column separated by a space-pipe-space. For an empty cell, just leave a space in between two pipes. | |||
A small table (with a blank cell in row 2, column D) would be encoded as follows: | |||
<pre> | |||
| Heading column A | Heading column B | Heading column C | Heading column D | | |||
| --------------- | --------------- | --------------- | --------------- | | |||
| Row 1, column A | Row 1, column B | Row 1, column C | Row 1, column D | | |||
| Row 2, column A | Row 2, column B | Row 2, column C | | | |||
| Row 3, column A | Row 3, column B | Row 3, column C | Row 3, column D | | |||
</pre> | |||
==FromThePage/Dromio transcription differences== | |||
When we transcribe in Dromio, we capture enough information to generate both a diplomatic and a semi-diplomatic transcription. When we transcribe in FromThePage, we're only concerned with semi-diplomatic transcriptions, which means that there's less that we need to tag. The major differences are as follows (see also: the tag crosswalk below): | |||
* Do not tag macrons in FTP; just add in the expanded letters and mark then with an expan tag. | |||
* Do not tag or transcribe metamarks in FTP. | |||
* Do not tag any silently lowered text in FTP; just silently lower it without a tag. | |||
==FromThePage/Dromio crosswalk== | |||
{| class="wikitable" style="margin-left: auto;" | {| class="wikitable" style="margin-left: auto;" | ||
|- | |- | ||
! Dromio tag | !Dromio tag | ||
! FromThePage tag | !FromThePage tag | ||
! Used for | !Used for | ||
|- | |||
|add | |||
|supplied | |||
|text added later, at the custodial stage | |||
|- | |- | ||
| | |brev-es | ||
| | |<expan>e</expan>s | ||
| | | -es graph | ||
|- | |- | ||
| brev-p | |brev-p | ||
| p<expan>re</expan> (etc.) | |p<expan>re</expan> (etc.) | ||
| special p | |special p | ||
|- | |- | ||
| brev-y | |brev-y | ||
| <expan>th</expan> | |<expan>th</expan> | ||
| thorn letter | |thorn letter | ||
|- | |- | ||
| catchword | |catchword | ||
| catchword | |catchword | ||
| catchwords | |catchwords | ||
|- | |- | ||
| del | |del | ||
| del | |del | ||
| cancelled or crossed out text | |cancelled or crossed out text | ||
|- | |- | ||
| ex | |ex | ||
| expan | |expan | ||
| expansions | |expansions | ||
|- | |- | ||
| gap | |gap | ||
| gap | |gap | ||
| gap in text due to damage, etc. | |gap in text due to damage, etc. | ||
|- | |- | ||
| graphic | |graphic | ||
| figure | |figure | ||
| illustrations, etc. | |illustrations, etc. | ||
|- | |- | ||
| heading | |heading | ||
| | |head | ||
| headings | |headings | ||
|- | |- | ||
| ins | |ins | ||
| add | |add | ||
| intralineal insertions | |intralineal insertions | ||
|- | |- | ||
| label | |label | ||
| marginalia | |marginalia | ||
| marginal notes | |marginal notes | ||
|- | |- | ||
| macron | |macron | ||
| do not tag (expand added character(s) with expan tag) | |do not tag (expand added character(s) with expan tag) | ||
| macrons or tildes above characters | |macrons or tildes above characters | ||
|- | |- | ||
| metamark | |metamark | ||
| do not transcribe | |do not transcribe | ||
| metamarks | |metamarks | ||
|- | |- | ||
| note | |note | ||
| use notes box below the transcription field | |use notes box below the transcription field | ||
| transcribers' notes | |transcribers' notes | ||
|- | |- | ||
| pb | |pb | ||
| pb | |pb | ||
| page break | |page break | ||
|- | |- | ||
| sl | |sl | ||
| do not tag | |do not tag | ||
| superscript text to be silently lowered | |superscript text to be silently lowered | ||
|- | |- | ||
| sup | |sup | ||
| sup | |sup | ||
| superscript text not to be lowered | |superscript text not to be lowered | ||
|- | |- | ||
| unclear | |unclear | ||
| unclear | |unclear | ||
| unclear or uncertain text | |unclear or uncertain text | ||
|} | |} |
Latest revision as of 11:05, 31 March 2022
FromThePage transcription conventions
- Tagging: If you are not comfortable using tags, it's okay to skip them! A reviewer will add them for you.
- Spelling, punctuation, and capitalization: Use original spelling, punctuation, and capitalization.
- Blank pages: If the page is truly blank, check the Mark as blank checkbox above the transcription window, save, and move on. If the image includes any text, including, for instance, titles or dates tooled onto the binding, text on a bookplate, or a call number penciled in by a librarian, transcribe the text and do not mark as blank.
- Line Breaks: Hit Return once after each line ends. Two returns indicate a new entry/paragraph break. If a word breaks across a line, place the hyphen, equal sign, or other word splitting character in a soft break tag: <lb break="no"></lb>
- Page Breaks: If two pages (an opening) is shown, transcribe the first page, then use a page break tag: <pb/> and transcribe the second.
- Recipe titles and headings: Recipe titles and other headings: Wrap each heading, including recipe titles, in a headings tag (<head>To make bread</head>). Add an extra line break before and after each heading.
- Line fillers and dividers: Transcribe text only. Do not include lines that sometimes appear to divide sections, or characters repeated over and over to justify the line.
FromThePage vetting instructions
Pages will be designated as ready for review after a transcriber has marked that page as done. To access the pages in need of vetting, click the "Pages That Need Review" button on the Read tab.
Once you've selected a page to review, finish the transcription if it's not already complete, review the transcription to make sure it's correct, adjust the formatting as necessary to match our conventions, and add or correct all necessary tags.
When finished, click save and then approve.
Tags available as buttons on FromThePage editor
- Addition
- Used for intralineally inserted text, often written above crossed out text.
<add></add>
in a stew pan ^ <add>over a stove</add>
- Deletion
- Used for text that has been cancelled or crossed out. Illegible letters can be replaced by periods.
<del></del>
<del>his</del>
- Expansion
- Used for abbreviated text. Place the tags only around the expanded characters, not around the entire word. Note: when using the shortcut button, FromThePage will include an "abbr" attribute. Users should leave the attribute blank, or may delete it.
<expan></expan>
<expan abbr=""></expan>
Ma<expan>ies</expan>tie
- Gap
- Used for a gap in the transcription for reasons other than deletion (e.g. due to damage to the page). Illegible letters can be replaced by periods.
<gap></gap>
shipp <gap>...</gap>
- Heading
- Used for obvious headings.
<head></head>
<head>A Ham.</head>
- Marginalia
- Used for any kind of marginal note, including footnotes, glosses, or summaries.
<marginalia></marginalia>
<marginalia>to Kensington</marginalia>
- Superscript
- Used for superscript text not to be lowered: only for ordinal numbers and for money (l,s,d).
<sup></sup>
xl<sup>s</sup>
- Unclear
- Used for uncertain text. Transcriber should enter their best guess, with the option to replace individual letters with periods.
<unclear></unclear>
<unclear>a chicin P</unclear>
- Underline
- Used for underlined text.
<u></u>
<u>This text is underlined</u>
All available tags
- Addition
- Used for intralineally inserted text, often written above crossed out text.
<add></add>
in a stew pan ^ <add>over a stove</add>
- Catchword
- Used for catchwords in the text.
<catchword></catchword>
<catchword>being</catchword>
- Deletion
- Used for text that has been cancelled or crossed out. Illegible letters can be replaced by periods.
<del></del>
<del>his</del>
- Expansion
- Used for abbreviated text. Place the tags only around the expanded characters, not around the entire word. Note: when using the shortcut button, FromThePage will include an "abbr" attribute. Users should leave the attribute blank, or may delete it.
<expan></expan>
<expan abbr=""></expan>
Ma<expan>ies</expan>tie
- Figure
- Used for illustrations.
<fig></fig>
<fig>Three small horizontal stems with flowers and buds; left is red and blue, middle is red, right is red and blue.</fig>
- Gap
- Used for a gap in the transcription for reasons other than deletion (e.g. due to damage to the page). Illegible letters can be replaced by periods.
<gap></gap>
shipp <gap>...</gap>
- Heading
- Used for obvious headings.
<head></head>
<head>A Ham.</head>
- Marginalia
- Used for any kind of marginal note, including footnotes, glosses, or summaries.
<marginalia></marginalia>
<marginalia>to Kensington</marginalia>
- Page break
- Used to indicate a page break when transcribing an opening rather than a single page.
<pb/>
- Soft break
- Used for words that break across a line.
<lb break="no"></lb>
to your tast; put into euery Bott<lb break="no">=</lb> ell a peece of Lemmon Pill
- Superscript
- Used for superscript text not to be lowered: only for ordinal numbers and for money (l,s,d).
<sup></sup>
xl<sup>s</sup>
- Supplied
- Used for text added at the custodial stage, by booksellers, librarians, etc. This tag is not used for text added later by the author(s) or by contemporary readers.
<supplied></supplied>
<supplied>Received 12-31-53 order no.B529</supplied>
- Unclear
- Used for uncertain text. Transcriber should enter their best guess, with the option to replace individual letters with periods.
<unclear></unclear>
<unclear>a chicin P</unclear>
- Underline
- Used for underlined text.
<u></u>
<u>This text is underlined</u>
Common abbreviations and expansions
- lb = pound
- ℥ = ounce
- f℥ = fluid ounce
- ʒ = dram, drachm
- ℈ = scruple
- -ā/-an
a<expan>n</expan>
- -r/-er, as in remembr/remember: expand "e," silently lower "r"
<expan>e</expan>r
- frō/from
fro<expan>m</expan>
- Matie/Maiestie
Ma<expan>ies</expan>tie
- -ō/-on
o<expan>n</expan>
- Sr/Sir
S<expan>i</expan>r
- yt/that
<expan>tha</expan>t
- ye/the
<expan>th</expan>e
- ym/them
<expan>the</expan>m
- yn/then
<expan>the</expan>n
- -ū/-um
u<expan>m</expan>
- wch/which
w<expan>hi</expan>ch
- wth/with
w<expan>i</expan>th
- yr/your
y<expan>ou</expan>r
- yor/your
yo<expan>u</expan>r
Table-based transcription in FromThePage
See documentation here: https://content.fromthepage.com/project-owner-documentation/table-encoding/
If there are column headers, transcribe them first, with each header separated by a space-pipe-space (" | "). The row should begin and end with a pipe character. The column heading separator comes after the header row, and consists of pipes separated by hyphens (" | --- | --- | ") (Using more hyphens may be visually helpful to the transcriber so that the columns line up, but is not necessary.)
The remainder of the table is transcribed the same way as the column header, with each column separated by a space-pipe-space. For an empty cell, just leave a space in between two pipes.
A small table (with a blank cell in row 2, column D) would be encoded as follows:
| Heading column A | Heading column B | Heading column C | Heading column D | | --------------- | --------------- | --------------- | --------------- | | Row 1, column A | Row 1, column B | Row 1, column C | Row 1, column D | | Row 2, column A | Row 2, column B | Row 2, column C | | | Row 3, column A | Row 3, column B | Row 3, column C | Row 3, column D |
FromThePage/Dromio transcription differences
When we transcribe in Dromio, we capture enough information to generate both a diplomatic and a semi-diplomatic transcription. When we transcribe in FromThePage, we're only concerned with semi-diplomatic transcriptions, which means that there's less that we need to tag. The major differences are as follows (see also: the tag crosswalk below):
- Do not tag macrons in FTP; just add in the expanded letters and mark then with an expan tag.
- Do not tag or transcribe metamarks in FTP.
- Do not tag any silently lowered text in FTP; just silently lower it without a tag.
FromThePage/Dromio crosswalk
Dromio tag | FromThePage tag | Used for |
---|---|---|
add | supplied | text added later, at the custodial stage |
brev-es | <expan>e</expan>s | -es graph |
brev-p | p<expan>re</expan> (etc.) | special p |
brev-y | <expan>th</expan> | thorn letter |
catchword | catchword | catchwords |
del | del | cancelled or crossed out text |
ex | expan | expansions |
gap | gap | gap in text due to damage, etc. |
graphic | figure | illustrations, etc. |
heading | head | headings |
ins | add | intralineal insertions |
label | marginalia | marginal notes |
macron | do not tag (expand added character(s) with expan tag) | macrons or tildes above characters |
metamark | do not transcribe | metamarks |
note | use notes box below the transcription field | transcribers' notes |
pb | pb | page break |
sl | do not tag | superscript text to be silently lowered |
sup | sup | superscript text not to be lowered |
unclear | unclear | unclear or uncertain text |