'$RCSfile: eml-text.xsd,v $'
Copyright: 1997-2002 Regents of the University of California,
University of New Mexico, and
Arizona State University
Sponsors: National Center for Ecological Analysis and Synthesis and
Partnership for Interdisciplinary Studies of Coastal Oceans,
University of California Santa Barbara
Long-Term Ecological Research Network Office,
University of New Mexico
Center for Environmental Studies, Arizona State University
Other funding: National Science Foundation (see README for details)
The David and Lucile Packard Foundation
For Details: http://knb.ecoinformatics.org/
This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation; either version 2 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
eml-text
The eml-text module - Text field formatting
any module
no
Text
A simple text desription.
The "text" element allows for both formatted and unformatted text blocks to be included in EML. It can contain a number of relevant subsections that allow the use of titles, sections, and paragraphs in the text block. The structured elements can be specified using a subset of DocBook so the predefined DocBook stylesheets can be used to style EML fields that implement this module, or alternatively can be specified using Markdown text blocks.
Text
A simple text desription.
The "text" element allows for both formatted and unformatted text blocks to be included in EML. It can contain a number of relevant subsections that allow the use of titles, sections, and paragraphs in the text block. This markup is a subset of DocBook, or alternatively can be specified using Markdown text blocks.
Section
A section of related text.
The "section" element allows for grouping related paragraphs of text together, with an optional title. This markup is a subset of DocBook.
Paragraph
A simple paragraph of text.
The "paragraph" element allows for both formatted and unformatted text blocks to be included in EML. It can be plain text or text with a limited set of markup tags, including emphasis, subscript, superscript, and lists. This markup is a subset of DocBook.
Markdown
A block of text formatted with Markdown directives.
Markdown is a family of text-based formatting directives that can be used to
structure and format a block of text. A single markdown element in EML can contain multiple
formatting directives that support creation of sections and subsections with headings, a
wide variety of text formatting directives, the ability to include inline links to external
content, and the ability to embed inline citations, figures, and tables. EML's markdown element
follows the GitHub Flavored Markdown (GFM) extensions to the CommonMark specification. Clients that
display EML should use a markdown preprocessor to convert the Markdown formatting into an
appropriate display format such as HTML or PDF as appropriate. When a Markdown block is interleaved
with other blocks of text such as section and paragraph elements from Docbook, the Markdown section
should be interleaved as a block-level element in the flow of the document. This allows authors to
specify, for example, an initial section in DocBook, followed by a Markdown section, and then possibly
other sections in DocBook. This will likely be uncommon because Markdown and Docbook have similar
formatting capabilities, but it may be helpful when converting legacy documents that use DocBook.
Because markdown uses special characters that might be reserved by XML processors, one
must be careful to escape such characters, which is typically done by embedding the
text in a CDATA block, or other XML escaping measures. These escape sequences must be
unescaped before parsing the text with a Markdown processor.
Within a Markdown block, one can use embedded images to specify the location where an inline image
should be displayed within the document. For example, the syntax for an embedded image uses the
markdown reference syntax, for example ![Figure 1][fig.1.ab567w]
, where "fig.1.ab567w"
is the unique id
attribute for the entity containing the reference to the image. When
client tools process such image links, they should inline the image data from that entity at the
location specified, which may involve, for example, resolving the image url from an
otherEntity
section. This means that there is an implied link in all Markdown
documents of the form [id]: url/to/image "Optional title attribute"
, which is derived
from the metadata for each entity within a document. Users do not need to insert these links in
order to use them, but client software that might be generating HTML from the markdown will likely
need to generate them from the entity metadata if they are using an external Markdown pre-processor
to handle conversion to HTML and other languages.
Inline citations can also be used to cite scholarly works in the text of an EML
document. This follows the Pandoc syntax for citation keys, in which the citation keys
are in inside square brackets and separated by semicolons. Each citation is identified
by a key, which consists of an ‘@’ symbol and the citation identifier from the entry for
that citation. Citation keys may optionally have a prefix, a locator, and a suffix to
further qualify what is being cited. For example, a simple citation would be
constructed as '[@fegraus_2005]', and a list would be '[@jones_2001; @fegraus_2005]'.
The keys must be present in either the 'id' field of a
citation element in the EML document, or as the BibTex key in a 'bibtex' entry in the
EML document. It is a validation error to cite an entry for which the corresponding
citation key is not present in the EML document, and it is a validation error for the
same citation key to be reused across citation and bibtex elements in the document (each
citation identifier must be unique within the document). Clients that parse and display
EML documents should first gather up all citation and bibtex elements in the document to
create a citation database in bibtext format, and then pass that database along with the
text in markdown sections to pandoc or an equivalent tool to convert the citations into
properly formatted, human readable citations. Pandoc supports the use of Citation Style
Language (CSL) files to specify the formatting of citations upon conversion. See
http://citationstyles.org/ for more details.
Because bulleted lists and other structures within Markdown are dependent on indenting the raw
markdown text, authors and processors should pay close attention to formatting within the
markdown block. In particular, if the XML document within which the markdown block is embedded
is in an indented hierarchy, then the first non-whitespace character of the markdown block defines
the column for the leftmost column of the markdown, and all subsequent markdown should be indented
relative to that column. For example, if the first character of the markdown is in column 16
of the document, then all subsequent markdown lines in that block should also start on column 16.
A bulleted list would start on column 16, and its sublist would be indented four space to column 20.
Paragraph
A simple paragraph of text.
The "paragraph" element allows for both formatted and unformatted text blocks to be included in EML. It can be plain text or text with a limited set of markup tags, including emphasis, subscript, superscript, lists and links. This
markup is a subset of DocBook.
Localized string for the paragraph
Language translation for the paragraph
Language translation as specified by the xml:lang attribute
Itemized List
A list of items in a text paragraph.
A list of items in a text paragraph. The list is generally displayed as a bulleted list. This markup is a subset of DocBook.
Ordered List
An ordered list of items in a text paragraph.
An ordered list of items in a text paragraph. The list is generally displayed as a numbered list. This markup is a subset of DocBook.
Emphasis
A span of emphasized text in a paragraph.
A span of emphasized text in a paragraph. Emphasized text is generally rendered as boldfaced or otherwise distinct from the surrounding text. This markup is a subset of DocBook.
Subscript
A subscript in a text paragraph.
A subscript in a text paragraph. This markup is a subset of DocBook.
Superscript
A superscript in a text paragraph.
A superscript in a text paragraph. This markup is a subset of DocBook.
literalLayout
The equivalent to <pre> in html.
This element specifies that the structure of the text within the tag, specifically the whitespace, should not be altered.
ulink
a link that addresses its target by means of a URL
this element and its children allow paragraphs to contain urls and titles for anchor tags. This markup is a subset of DocBook.
citetitle
the title of the cited work
the citetitle element contains a text title for the url. It can be displayed in an anchor tag. This markup is a subset of DocBook.
The Dublin Core Metadata Initiative
url
the Uniform Resource Locator for the cited work
the url attribute contains the location of the work for a link. This markup is a subset of DocBook.
url="http://dublincore.org/documents/usageguide/"
Section
A section of related text.
The "section" element allows for grouping related paragraphs (or other sections) of text together, with an optional title. This markup is a subset of DocBook.
Title
The optional title of the section.
The optional title for a section. This markup is a subset of DocBook.
Paragraph
A simple paragraph of text.
The "paragraph" element allows for both formatted and unformatted text blocks to be included in EML. It can be plain text or text with a limited set of markup tags, including emphasis, subscript, superscript, and lists. This markup
is a subset of DocBook.
Section
A section of related text.
The "section" element allows for grouping related paragraphs of text together, with an optional title. This markup is a subset of DocBook.
ListType
A list of items in a text paragraph.
A list of items in a text paragraph. The ListType is used by both orderedlist elements and itemizedlist elements. This markup is a subset of DocBook.
List Item
An item in a list of items.
An item in a list of items. Each list item is formatted as a bulleted or numbered item depending on the list type in which it resides. List items contain paragraphs which in turn can be plain text or text with a limited set of markup
tags, including emphasis, subscript, superscript, and lists. This markup is a subset of DocBook.
Paragraph
A simple paragraph of text.
The "paragraph" element allows for both formatted and unformatted text blocks to be included in EML. It can be plain text or text with a limited set of markup tags, including emphasis, subscript, superscript, and lists. This markup
is a subset of DocBook.
Itemized List
A list of items in a text paragraph.
A list of items in a text paragraph. The list is generally displayed as a bulleted list. This markup is a subset of DocBook.
Ordered List
An ordered list of items in a text paragraph.
An ordered list of items in a text paragraph. The list is generally displayed as a numbered list. This markup is a subset of DocBook.
Subscript/Superscript Type
A subscript or a superscript in a text paragraph.
A subscript or a superscript in a text paragraph. This type is used by both subscript and superscript elements to define their recursive content. This markup is a subset of DocBook.
Localized string for the sub/superscript
Language translation for the sub/superscript
Language translation as specified by the xml:lang attribute
Subscript
A subscript in a text paragraph.
A subscript in a text paragraph. This markup is a subset of DocBook.
Superscript
A superscript in a text paragraph.
A superscript in a text paragraph. This markup is a subset of DocBook.
Localized string
Language translation for the text
Language translation as specified by the xml:lang attribute
xml:lang
language code for element content
Specifies the language
xml:lang="en-US"