This specification describes the standard, required atomic XProc steps. A machine-readable description of these steps may be found in steps.xpl.
Many atomic steps are available for [XProc 3.03.1]. They are described in several specifications. This specification describes the general background common to all steps. A conformant processor must implement all of the steps in this specification. Additional steps may also be implemented.
The types given for options should be understood as follows:
Types in the XML Schema namespace, identified as QNames with the
xs:
prefix, as per the XML Schema specification with one exception. Anywhere anxs:QName
is specified, an EQName is allowed.XPathExpression
: As a string per [W3C XML Schema: Part 2], including whitespace normalization, and the further requirement to be a conformant Expression per [XPath 3.1].XSLTSelectionPattern
: As a string per [XSLT 3.0] conforming to an XSLT selection pattern.XPathSequenceType
: An XPath sequence type.ContentType
: A media type as defined in [RFC 2046].ContentTypes
: As a whitespace separated list of media types as defined in [RFC 2046].
Option values are often expressed using the shortcut syntax. In these cases, the option shortcuts are generally treated as value templates. However, for options of type map()
or array()
, an expression is required (there is no non-expression string which can ever be a legal value for a map or array). Given that every value entered this way will have to be a value template, and consequently every curly brace contained within the expression will have to be escaped, values of type map or array are defined to be expressions directly.
Some aspects of documents are generally unchanged by steps:
When a step in this library produces an output document, the base URI of the output is the base URI of the step's primary input document unless the step's process explicitly sets an
xml:base
attribute or the step's description explicitly states how the base URI is constructed.Steps are responsible for describing how document properties are transformed as documents flow through them. Many steps claim that the specified properties are preserved. Generally, it is the responsibility of the pipeline author to determine when this is inapropriate and take corrective action. However, it is the responsibility of the pipeline processor to assure that the
content-type
property is correct. If a step transforms a document in a manner that is inconsistent with thecontent-type
property (accepting an XML document on the source port but producing a text document on the result, for example), the processor must assure that thecontent-type
property is appropriate. If a step changes thecontent-type
in this way, it must also remove theserialization
property
Also, in this specification, several steps use this element for result information:
<c:result>
string
</c:result>
When a step uses an XPath to compute an option value, the XPath context is as defined in [XProc 3.03.1].
When a step specifies a particular version of a technology, implementations must implement that version or a subsequent version that is backwards compatible with that version. At user-option, they may implement other non-backwards compatible versions.
In this specification the words must, must not, should, should not, may and recommended are to be interpreted as described in [RFC 2119].
As described in PSVIs in XProc in XProc 3.0: An XML Pipeline Language, steps may not produce PSVI output unless that behavior is explicitly described. In this specification, the steps that may produce PSVI output are the “identity” steps: p:identity
, p:store
, and p:split-sequence
(which must preserve PSVI properties that appear on their inputs). In addition, the p:xslt
and p:xquery
steps may return documents with PSVI annotations.
A conformant processor must support all of these steps.
The p:add-attribute
step adds a single attribute to a set of matching elements. The input document specified on the source
is processed for matches specified by the selection pattern in the match
option. For each of these matches, the attribute whose name is specified by the attribute-name
option is set to the attribute value specified by the attribute-value
option. The p:add-attribute
step adds a single attribute to a set of matching elements. The input document specified on the source
is processed for matches specified by the selection pattern in the match
option. For each of these matches, the attribute whose name is specified by the attribute-name
option is set to the attribute value specified by the attribute-value
option.
The resulting document is produced on the result
output port and consists of a exact copy of the input with the exception of the matched elements. Each of the matched elements is copied to the output with the addition of the specified attribute with the specified value.
<p:declare-step
type
="
p:add-attribute
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
attribute-name
"
required
="
true
"
as
="
xs:QName
"
/>
<p:option
name
="
attribute-value
"
required
="
true
"
as
="
xs:string
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value | Required |
---|---|---|---|
attribute-name | xs:QName | ✔ | |
attribute-value | xs:string | ✔ | |
match | XSLTSelectionPattern | '/*' |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if the selection pattern matches a node which is not an element. |
err:XC0059 | It is a dynamic error if the QName value in the attribute-name option is “xmlns” or uses the prefix “xmlns” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/. |
Declaration
<p:declare-step
type
="
p:add-attribute
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
attribute-name
"
required
="
true
"
as
="
xs:QName
"
/>
<p:option
name
="
attribute-value
"
required
="
true
"
as
="
xs:string
"
/>
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the selection pattern matches a node which is not an element.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the selection pattern matches a node which is not an element.
The value of the attribute-value
option must be a legal attribute value according to XML.The value of the attribute-value
option must be a legal attribute value according to XML.
If an attribute with the same name as the expanded name from the attribute-name
option exists on the matched element, the value specified in the attribute-value
option is used to set the value of that existing attribute. That is, the value of the existing attribute is changed to the attribute-value
value. If an attribute with the same name as the expanded name from the attribute-name
option exists on the matched element, the value specified in the attribute-value
option is used to set the value of that existing attribute. That is, the value of the existing attribute is changed to the attribute-value
value.
Note
If multiple attributes need to be set on the same element(s), the p:set-attributes
step can be used to set them all at once.
This step cannot be used to add namespace declarations. It is a dynamic error () if the QName value in the attribute-name
option is “xmlns
” or uses the prefix “xmlns
” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/
. Note, however, that while namespace declarations cannot be added explicitly by this step, adding an attribute whose name is in a namespace for which there is no namespace declaration in scope on the matched element may result in a namespace binding being added by namespace fixup.This step cannot be used to add namespace declarations. It is a dynamic error () if the QName value in the attribute-name
option is “xmlns
” or uses the prefix “xmlns
” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/
. Note, however, that while namespace declarations cannot be added explicitly by this step, adding an attribute whose name is in a namespace for which there is no namespace declaration in scope on the matched element may result in a namespace binding being added by namespace fixup.
If an attribute named xml:base
is added or changed, the base URI of the element must also be amended accordingly.
Document properties
All document properties are preserved.
The p:add-xml-base
step exposes the base URI via explicit xml:base
attributes. The input document from the source
port is replicated to the result
port with xml:base
attributes added to or corrected on each element as specified by the options on this step.The p:add-xml-base
step exposes the base URI via explicit xml:base
attributes. The input document from the source
port is replicated to the result
port with xml:base
attributes added to or corrected on each element as specified by the options on this step.
<p:declare-step
type
="
p:add-xml-base
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
all
"
as
="
xs:boolean
"
select
="
false()
"
/>
<p:option
name
="
relative
"
as
="
xs:boolean
"
select
="
true()
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value |
---|---|---|
all | xs:boolean | false() |
relative | xs:boolean | true() |
Errors
Error code | Description |
---|---|
err:XC0058 | It is a dynamic error if the all and relative options are both true. |
Declaration
<p:declare-step
type
="
p:add-xml-base
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
all
"
as
="
xs:boolean
"
select
="
false()
"
/>
<p:option
name
="
relative
"
as
="
xs:boolean
"
select
="
true()
"
/>
</p:declare-step>
The value of the all
option must be a boolean.The value of the all
option must be a boolean.
The value of the relative
option must be a boolean.The value of the relative
option must be a boolean.
It is a dynamic error () if the all
and relative
options are bothtrue
.It is a dynamic error () if the all
and relative
options are bothtrue
.
The p:add-xml-base
step modifies its input as follows:
For every element that is a child of the document node: force the element to have an
xml:base
attribute with the document's [base URI] property's value as its value.For other elements:
If the
all
option has the valuetrue
, force the element to have anxml:base
attribute with the element's [base URI] value as its value.If theall
option has the valuetrue
, force the element to have anxml:base
attribute with the element's [base URI] value as its value.If the element's [base URI] is different from the its parent's [base URI], force the element to have an
xml:base
attribute with the following value: if the value of therelative
option istrue
, a string which, when resolved against the parent's [base URI], will give the element's [base URI], otherwise the element's [base URI].If the element's [base URI] is different from the its parent's [base URI], force the element to have anxml:base
attribute with the following value: if the value of therelative
option istrue
, a string which, when resolved against the parent's [base URI], will give the element's [base URI], otherwise the element's [base URI].Otherwise, if there is an
xml:base
attribute present, remove it.
Document properties
All document properties are preserved.
The p:archive
step outputs on its result
port an archive (usually binary) document, for instance a ZIP file. A specification of the contents of the archive may be specified in a manifest XML document on the manifest
port. The step produces a report on the report
port, which contains the manifest, amended with additional information about the archiving.The p:archive
step outputs on its result
port an archive (usually binary) document, for instance a ZIP file. A specification of the contents of the archive may be specified in a manifest XML document on the manifest
port. The step produces a report on the report
port, which contains the manifest, amended with additional information about the archiving.
<p:declare-step
type
="
p:archive
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
true
"
/>
<p:input
port
="
manifest
"
content-types
="
xml
"
sequence
="
true
"
>
<p:empty/>
</p:input>
<p:input
port
="
archive
"
content-types
="
any
"
sequence
="
true
"
>
<p:empty/>
</p:input>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
report
"
content-types
="
application/xml
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName
"
select
="
'zip'
"
/>
<p:option
name
="
relative-to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types | Default binding |
---|---|---|---|---|
source | ✔ | ✔ | any | |
manifest | ✔ | xml | p:empty | |
archive | ✔ | any | p:empty |
Output port | Primary | Sequence | Content types | Default binding |
---|---|---|---|---|
result | ✔ | any | ||
report | application/xml |
Option name | Type | Default value |
---|---|---|
format | xs:QName | 'zip' |
parameters | map(xs:QName, item()*)? | () |
relative-to | xs:anyURI? | () |
Errors
Error code | Description |
---|---|
err:XC0079 | It is a dynamic error if the map parameters contains an entry whose key is defined by the implementation and whose value is not valid for that key. |
err:XC0080 | It is a dynamic error if the number of documents on the archive does not match the expected number of archive input documents for the given format and command. |
err:XC0081 | It is a dynamic error if the format of the archive does not match the format as specified in the format option. |
err:XC0084 | It is a dynamic error if two or more documents appear on the p:archive step's source port that have the same base URI or if any document that appears on the source port has no base URI. |
err:XC0085 | It is a dynamic error if the format of the archive cannot be understood, determined and/or processed. |
err:XC0100 | It is a dynamic error if the document on port manifest does not conform to the given schema. |
err:XC0112 | It is a dynamic error if more than one document appears on the port manifest. |
err:XD0011 | It is a dynamic error if the resource referenced by the href option does not exist, cannot be accessed or is not a file. |
err:XD0064 | It is a dynamic error if the base URI is not both absolute and valid according to . |
Implementation details
Implementation | Description |
---|---|
Defined | The list of formats supported by the p:archive step is implementation-defined. |
Defined | The list of archive formats that can be modified by p:archive is implementation-defined. |
Defined | The semantics of any additional attributes, elements and their values are implementation-defined. |
Defined | It is implementation-defined what other formats are supported. |
Defined | It is implementation-defined what other formats are supported. |
Defined | It is implementation-defined how the step determines the archive's format. |
Defined | The c:archive root element may contain additional implementation-defined attributes. |
Defined | The default compression method is implementation-defined. |
Defined | It is implementation-defined what other compression methods are supported. |
Defined | The default compression method is implementation-defined. |
Defined | It is implementation-defined what compression levels are supported. |
Defined | The c:entry elements may contain additional implementation-defined attributes. |
Defined | The p:archive step may support additional, implementation-defined commands for ZIP files. |
Defined | The actual parameter names supported by p:archive for a particular format are implementation-defined. |
Declaration
<p:declare-step
type
="
p:archive
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
true
"
/>
<p:input
port
="
manifest
"
content-types
="
xml
"
sequence
="
true
"
>
<p:empty/>
</p:input>
<p:input
port
="
archive
"
content-types
="
any
"
sequence
="
true
"
>
<p:empty/>
</p:input>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
report
"
content-types
="
application/xml
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName
"
select
="
'zip'
"
/>
<p:option
name
="
relative-to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
The p:archive
step can perform several different operations on archives. The most common one will likely be creating an archive, but it could also, depending on the archive format, provide services like update, freshen or even merge. The only format implementations must support is [ZIP]. The list of formats supported by the p:archive
step is implementation-defined.
The p:archive
step has the following input ports:
source
The (primary)
source
port is used to provide documents to be archived (for instance constructed by other steps). How and which of these documents are processed is governed by the document(s) appearing on the other input ports and the combination of options and parameters. See below for details. It is a dynamic error () if two or more documents appear on thep:archive
step'ssource
port that have the same base URI or if any document that appears on thesource
port has no base URI.The (primary)source
port is used to provide documents to be archived (for instance constructed by other steps). How and which of these documents are processed is governed by the document(s) appearing on the other input ports and the combination of options and parameters. See below for details. It is a dynamic error () if two or more documents appear on thep:archive
step'ssource
port that have the same base URI or if any document that appears on thesource
port has no base URI.manifest
The
manifest
port can receive a manifest document that tells the step how to construct the archive. If no manifest document is provided on this port, a default manifest is constructed automatically. See . It is a dynamic error () if the document on portmanifest
does not conform to the given schema. Themanifest
port can receive a manifest document that tells the step how to construct the archive. If no manifest document is provided on this port, a default manifest is constructed automatically. See . It is a dynamic error () if the document on portmanifest
does not conform to the given schema.It is a dynamic error () if more than one document appears on the port
manifest
.It is a dynamic error () if more than one document appears on the portmanifest
.The default input for this port is the empty sequence.
archive
The
archive
port is used to provide the step with existing archive(s) for operations like update, freshen or merge. Handling of ZIP files supports modifying archives appearing on thearchive
port (). The list of archive formats that can be modified byp:archive
is implementation-defined. For instance an implementation that supports archive merging may accept more than one document on thearchive
port.Thearchive
port is used to provide the step with existing archive(s) for operations like update, freshen or merge. Handling of ZIP files supports modifying archives appearing on thearchive
port (). The list of archive formats that can be modified byp:archive
is implementation-defined. For instance an implementation that supports archive merging may accept more than one document on thearchive
port.The default input for this port is the empty sequence.
The p:archive
step has the following output ports:
result
The (primary)
result
port will output the resulting archive.The (primary)result
port will output the resulting archive.report
The
report
port will output a report about the archiving operation. This will be the same as the manifest (as provided on themanifest
port or automatically created if there was no manifest provided), optionally amended with additional attributes and/or elements. The semantics of any additional attributes, elements and their values are implementation-defined.Thereport
port will output a report about the archiving operation. This will be the same as the manifest (as provided on themanifest
port or automatically created if there was no manifest provided), optionally amended with additional attributes and/or elements. The semantics of any additional attributes, elements and their values are implementation-defined.
The p:archive
step has the following options:
format
The format of the archive can be specified using the
format
option. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.The format of the archive can be specified using theformat
option. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.parameters
parameters
The
parameters
option can be used to supply parameters to control the archiving. The semantics of the keys and the allowed values for these keysSeveral parameters are defined for processing , but implementations are free to use additional parameters and to use parameters for controlling how other archive formats are processed. It is a dynamic error () if the mapparameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.Theparameters
option can be used to supply parameters to control the archiving. The semantics of the keys and the allowed values for these keysSeveral parameters are defined for processing , but implementations are free to use additional parameters and to use parameters for controlling how other archive formats are processed. It is a dynamic error () if the mapparameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.relative-to
The
relative-to
option is used in creating a manifest when no manifest is provided on themanifest
port. If a manifest is present this option is not used. If the option’s value is a relative URI, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].Therelative-to
option is used in creating a manifest when no manifest is provided on themanifest
port. If a manifest is present this option is not used. If the option’s value is a relative URI, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].
The format of the archive is determined as follows:
If the
format
option is specified, this determines the format of the archive. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the format of the archive does not match the format as specified in theformat
option.If theformat
option is specified, this determines the format of the archive. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the format of the archive does not match the format as specified in theformat
option.If no
format
option is specified or if its value is the empty sequence, the archive's format will be determined by the step, using thecontent-type
document-property of the document on thearchive
port and/or by inspecting its contents. It is implementation-defined how the step determines the archive's format. Implementations should recognize archives in [] format. If noformat
option is specified or if its value is the empty sequence, the archive's format will be determined by the step, using thecontent-type
document-property of the document on thearchive
port and/or by inspecting its contents. It is implementation-defined how the step determines the archive's format. Implementations should recognize archives in [] format.
It is a dynamic error () if the format of the archive does not match the specified format, cannot be understood, determined and/or processed.It is a dynamic error () if the format of the archive does not match the specified format, cannot be understood, determined and/or processed.
An archive manifest specifies which documents will be considered in processing the archive. Every entry in the archive must have a corresponding entry in the manifest; if no such entry is provided, one will be constructed automatically (see below). If manifest entries are provided for documents that are not in the archive, how those are processed depends on the archive type and the parameters passed to the step.
A manifest is represented by a c:archive
root element:
<c:archive>
(c:entry* &
anyNonXProcElement*)
</c:archive>
The c:archive
root element may contain additional implementation-defined attributes.
All entries in the archive must be present as c:entry
child elements:
<c:entry
name = string
href = anyURI
comment? = string
method? = string
level? = string
content-type? = ContentType>
anyElement*
</c:entry>
The
name
attribute specifies the name of the entry in the archive.The
href
attribute must be a valid URI according to [RFC 3986]. If its value is relative, it is made absolute against the base URI of the manifest. There are two possible cases:If the (absolute)
href
value is exactly the same as the base URI of a document appearing on thesource
port, that document is associated with this entry. If this entry is to be added to the archive, the associated document will be used. (Theserialization
document property can be used to provide serialization properties.) If the (absolute)href
value is exactly the same as the base URI of a document appearing on thesource
port, that document is associated with this entry. If this entry is to be added to the archive, the associated document will be used. (Theserialization
document property can be used to provide serialization properties.)If no document on the
source
port has a base URI that is exactly the same as the (absolute)href
value, the document at the specified URI is associated with this entry. These documents are stored in the archive “as is”; they must not be parsed and re-serialized.If no document on thesource
port has a base URI that is exactly the same as the (absolute)href
value, the document at the specified URI is associated with this entry. These documents are stored in the archive “as is”; they must not be parsed and re-serialized.
The
method
attribute specifies how the entry should be compressed. The default compression method is implementation-defined. Implementations must support no compression, specified with the valuenone
. It is implementation-defined what other compression methods are supported.The
level
attribute specifies the level of compression. The default compression method is implementation-defined. It is implementation-defined what compression levels are supported.The
content-type
attribute specifies the content-type of the entry as detected by the processor. It will be set byp:archive-manifest
in constructing the manifest. It will be ignored byp:archive
.
The p:archive
step should strive to retain the order of the c:entry
elements when constructing the archive. For instance, an e-book in EPUB format has a non-compressed entry that must be first in the archive. It should be possible to construct such an archive using p:archive
.
The c:entry
elements may contain additional implementation-defined attributes.
If no manifest entry is provided for a document appearing on the source
port, the step will create a manifest entry for the document. (If no document arrives on the manifest
port at all, a complete manifest document will be created.)If no manifest entry is provided for a document appearing on the source
port, the step will create a manifest entry for the document. (If no document arrives on the manifest
port at all, a complete manifest document will be created.)
In a constructed manifest entry:
The entry’s
href
value is the base URI of the document.The entry’s
name
value is derived from the base URI of the document and therelative-to
option.The entry’sname
value is derived from the base URI of the document and therelative-to
option.First, the value of the
relative-to
option is made absolute. If the initial substring of the base URI is exactly the same as the resulting absolute value, then thename
is the portion of the base URI that follows that initial substring. First, the value of therelative-to
option is made absolute. If the initial substring of the base URI is exactly the same as the resulting absolute value, then thename
is the portion of the base URI that follows that initial substring.If there is no
relative-to
option or if its value is not the initial substring of the base URI of the document, the name is the path portion of the URI (per []). If the path portion begins with an initial slash, that slash is removed. If there is norelative-to
option or if its value is not the initial substring of the base URI of the document, the name is the path portion of the URI (per []). If the path portion begins with an initial slash, that slash is removed.
It is a dynamic error () if an archive manifest is invalid according to the specification.It is a dynamic error () if an archive manifest is invalid according to the specification.
The format of the archive can be specified using the format
option. Implementations must support the [] format, specified with the value zip
. The format of the archive can be specified using the format
option. Implementations must support the [] format, specified with the value zip
.
When ZIP archives are processed, every name
in the manifest must be a relative path without a leading slash.
The parameters
option can be used to supply parameters to control the archiving. For the zip
format, the following parameters must be supported:The parameters
option can be used to supply parameters to control the archiving. For the zip
format, the following parameters must be supported:
command
Specifies what operation to perform. If not specified, its default value is
update
. Implementations must support the valuesupdate
,create
,freshen
, anddelete
. Thep:archive
step may support additional, implementation-defined commands for ZIP files. Unless otherwise specified, exactly zero or one ZIP archive can appear on thearchive
port for the commands described below. If no archive appears, a new archive will be created. Specifies what operation to perform. If not specified, its default value isupdate
. Implementations must support the valuesupdate
,create
,freshen
, anddelete
. Thep:archive
step may support additional, implementation-defined commands for ZIP files. Unless otherwise specified, exactly zero or one ZIP archive can appear on thearchive
port for the commands described below. If no archive appears, a new archive will be created.update
When the
command
parameter is set toupdate
, the ZIP archive will be updated:For every entry in the ZIP file:
If the manifest contains a
c:entry
with a matchingname
, the entry in the ZIP file is updated with the document identified by thec:entry
in the manifest.If the manifest does not contain a matching
c:entry
, the ZIP entry name is resolved against the base URI of the ZIP file.If a document exists at that URI and either has no timestamp or has a timestamp more than the timestamp in the ZIP file, the entry in the ZIP file will be updated with the document at the resolved URI.
If no document exists at that URI, or the document cannot be accessed, or the document has a timestamp and the timestamp in the ZIP archive is more recent than the timestamp of the document, then the ZIP entry is unchanged.
For every
c:entry
in the manifest that does not have a matching entry in the ZIP file, the ZIP file will be updated by adding the document identified by thec:entry
to the ZIP file.
create
When the
command
parameter is set tocreate
, the ZIP archive will be created. Creating a ZIP archive behaves exactly likeupdate
except that any timestamps are ignored; every ZIP entry will be updated or created if there is a or matching document for it. The document must be obtained by dereferencing the URI inhref
. It is a dynamic error () if the resource referenced by thehref
option does not exist, cannot be accessed or is not a file.When thecommand
parameter is set tocreate
, the ZIP archive will be created. Creating a ZIP archive behaves exactly likeupdate
except that any timestamps are ignored; every ZIP entry will be updated or created if there is a or matching document for it. The document must be obtained by dereferencing the URI inhref
. It is a dynamic error () if the resource referenced by thehref
option does not exist, cannot be accessed or is not a file.freshen
When the
command
parameter is set tofreshen
, existing files in the ZIP archive may be updated, but no new files will be added. Freshing a ZIP archive behaves exactly likeupdate
except that only entries that already exist in the ZIP archive are considered.delete
When the
command
parameter is set todelete
, exactly one document in ZIP format must appear on thearchive
port. For every entry in the ZIP file:When thecommand
parameter is set todelete
, exactly one document in ZIP format must appear on thearchive
port. For every entry in the ZIP file:If the manifest contains a
c:entry
with a matchingname
, the entry in the ZIP file is removed from the ZIP archive.
If the manifest contains
c:entry
elements which do not have a matching entry in the ZIP archive, they are simply ignored.
level
Specifies the default compression level for files added to or updated in the archive. If the
level
attribute is specified on ac:entry
, its value takes precedence for that entry. Values that must be supported for ZIP files are: “smallest
”, “fastest
”, “default
”, “huffman
”, and “none
”.method
Specifies the default compression method for files added to or updated in the archive. If the
method
attribute is specified on ac:entry
, its value takes precedence for that entry. Values that must be supported for ZIP files are: “none
” and “deflated
”.
It is a dynamic error () if the number of documents on the archive
does not match the expected number of archive input documents for the given format
and command
.It is a dynamic error () if the number of documents on the archive
does not match the expected number of archive input documents for the given format
and command
.
Implementations of other archive formats should use the same parameter names if applicable. The value spaces for these parameters may be format-specific though. The actual parameter names supported by p:archive
for a particular format are implementation-defined.
Document properties
No document properties are preserved. The archive has no base-uri
.
The p:archive-manifest
creates an XML manifest file describing the contents of the archive appearing on its source
port.The p:archive-manifest
creates an XML manifest file describing the contents of the archive appearing on its source
port.
<p:declare-step
type
="
p:archive-manifest
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
application/xml
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
<p:option
name
="
relative-to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
override-content-types
"
as
="
array(array(xs:string))?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/xml |
Option name | Type | Default value |
---|---|---|
format | xs:QName? | () |
override-content-types | array(array(xs:string))? | () |
parameters | map(xs:QName, item()*)? | () |
relative-to | xs:anyURI? | () |
Errors
Error code | Description |
---|---|
err:XC0079 | It is a dynamic error if the map parameters contains an entry whose key is defined by the implementation and whose value is not valid for that key. |
err:XC0081 | It is a dynamic error if the format of the archive does not match the format as specified in the format option. |
err:XC0085 | It is a dynamic error if the format of the archive cannot be understood, determined and/or processed. |
err:XC0120 | It is a dynamic error if the relative-to option is not present and the document on the source port does not have a base URI. |
err:XC0146 | It is a dynamic error if the specified value for the override-content-types option is not an array of arrays, where the inner arrays have exactly two members of type xs:string. |
err:XC0147 | It is a dynamic error if the specified value is not a valid XPath regular expression. |
err:XD0064 | It is a dynamic error if the base URI is not both absolute and valid according to . |
err:XD0079 | It is a dynamic error if a supplied content-type is not a valid media type of the form “type/subtype+ext” or “type/subtype”. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined what other formats are supported. |
Defined | It is implementation-defined how the step determines the archive's format. |
Defined | The semantics of the keys and the allowed values for these keys are implementation-defined. |
Defined | Additional information provided for entries in p:archive-manifest is implementation-defined. |
Declaration
<p:declare-step
type
="
p:archive-manifest
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
application/xml
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
<p:option
name
="
relative-to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
override-content-types
"
as
="
array(array(xs:string))?
"
/>
</p:declare-step>
The p:archive-manifest
step inspects the archive appearing on its source
port and outputs a manifest describing the contents of the archive on its result
port. The p:archive-manifest
step inspects the archive appearing on its source
port and outputs a manifest describing the contents of the archive on its result
port.
The format of the archive is determined as follows:
If the
format
option is specified, this determines the format of the archive. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the format of the archive does not match the format as specified in theformat
option. If theformat
option is specified, this determines the format of the archive. Implementations must support the [] format, specified with the valuezip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the format of the archive does not match the format as specified in theformat
option.If no
format
option is specified or if its value is the empty sequence, the archive's format will be determined by the step, using thecontent-type
document-property of the document on thesource
port and/or by inspecting its contents. It is implementation-defined how the step determines the archive's format. Implementations should recognize archives in [] format. If noformat
option is specified or if its value is the empty sequence, the archive's format will be determined by the step, using thecontent-type
document-property of the document on thesource
port and/or by inspecting its contents. It is implementation-defined how the step determines the archive's format. Implementations should recognize archives in [] format.
It is a dynamic error () if the format of the archive does not match the specified format, cannot be understood, determined and/or processed.It is a dynamic error () if the format of the archive does not match the specified format, cannot be understood, determined and/or processed.
The parameters
option can be used to supply parameters to control the archive manifest generation. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the map parameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.The parameters
option can be used to supply parameters to control the archive manifest generation. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the map parameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.
The relative-to
option, when present, is used in creating the value of the manifest's c:entry/@href
attribute. If the option is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].The relative-to
option, when present, is used in creating the value of the manifest's c:entry/@href
attribute. If the option is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].
The generated manifest has the format as described in . Implementations must supply an element and its name
and content-type
attributes for every entry in the archive. The value of the generated manifest's c:entry/@href
attribute will be determined in the same way as a base URI of an unarchived document by . It is a dynamic error () if the relative-to
option is not present and the document on the source
port does not have a base URI. Additional information provided for entries in p:archive-manifest
is implementation-defined.The generated manifest has the format as described in . Implementations must supply an element and its name
and content-type
attributes for every entry in the archive. The value of the generated manifest's c:entry/@href
attribute will be determined in the same way as a base URI of an unarchived document by . It is a dynamic error () if the relative-to
option is not present and the document on the source
port does not have a base URI. Additional information provided for entries in p:archive-manifest
is implementation-defined.
The override-content-types
option can be used to partially override the content-type determination mechanism. If present, it must be an array of arrays, where the inner arrays consist of exactly two strings: The override-content-types
option can be used to partially override the content-type determination mechanism. If present, it must be an array of arrays, where the inner arrays consist of exactly two strings:
The first member in an inner array must be a regular expression as specified in [], section 7.61 “
Regular Expression Syntax
”. It is a dynamic error () if the specified value is not a valid XPath regular expression.The first member in an inner array must be a regular expression as specified in [], section 7.61 “Regular Expression Syntax
”. It is a dynamic error () if the specified value is not a valid XPath regular expression.The second member in an inner array must be a valid a MIME content-type. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”.The second member in an inner array must be a valid a MIME content-type. It is a dynamic error () if a supplied content-type is not a valid media type of the form “type
/subtype
” or “type
/subtype
+ext
”.type
/subtype
It is a dynamic error () if the specified value for the override-content-types
option is not an array of arrays, where the inner arrays have exactly two members of type xs:string
.It is a dynamic error () if the specified value for the override-content-types
option is not an array of arrays, where the inner arrays have exactly two members of type xs:string
.
Determining an archive entry's content-type is as follows:
The XPath regular expressions (the first members of the inner arrays) will be matched against the path of the entry in the archive. This will be done in the order of appearance in the outer array (so order is significant). The matching is done unanchored: it is a match if the regular expression matches part of the entry's path. Informally: matching behaves like applying the XPath
matches#2
function, like inmatches($path-in-archive, $regular-expression)
.Note
Depending on how archives are constructed, the path of an entry in an archive can be with or without a leading slash. Usually it will be without. For archives constructed by
p:archive
no leading slash will be present.If a match is found, the content-type (the second member of the inner array for which the match was found) is used as the entry's content-type.
If no match was found for all inner arrays, the normal (implementation-defined) mechanism for determining the content-type is used.
For example: setting the override-content-types
option to [ ['.rels$', 'application/xml'], ['^special/', 'application/octet-stream'] ]
means that all files ending with .rels
will get the content-type application/xml
. All files in the archive's special
directory (including sub-directories) will get the content-type application/octet-stream
.For example: setting the override-content-types
option to [ ['.rels$', 'application/xml'], ['^special/', 'application/octet-stream'] ]
means that all files ending with .rels
will get the content-type application/xml
. All files in the archive's special
directory (including sub-directories) will get the content-type application/octet-stream
.
Document properties
No document properties are preserved. The manifest has no base-uri
.
The p:cast-content-type
step creates a new document by changing the media type of its input. If the value of the content-type
option and the current media type of the document on source
port are the same, this document will appear unchanged on result
port.The p:cast-content-type
step creates a new document by changing the media type of its input. If the value of the content-type
option and the current media type of the document on source
port are the same, this document will appear unchanged on result
port.
<p:declare-step
type
="
p:cast-content-type
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
content-type
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | any |
Option name | Type | Default value | Required |
---|---|---|---|
content-type | xs:string | ✔ | |
parameters | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XC0071 | It is a dynamic error if the p:cast-content-type step cannot perform the requested cast. |
err:XC0072 | It is a dynamic error if the c:data contains content is not a valid base64 string. |
err:XC0073 | It is a dynamic error if the c:data element does not have a content-type attribute. |
err:XC0074 | It is a dynamic error if the content-type is supplied and is not the same as the content-type specified on the c:data element. |
err:XC0079 | It is a dynamic error if the map parameters contains an entry whose key is defined by the implementation and whose value is not valid for that key. |
err:XD0049 | It is a dynamic error if the text value is not a well-formed XML document |
err:XD0057 | It is a dynamic error if the text document does not conform to the JSON grammar, unless the parameter liberal is true and the processor chooses to accept the deviation. |
err:XD0058 | It is a dynamic error if the parameter duplicates is reject and the text document contains a JSON object with duplicate keys. |
err:XD0059 | It is a dynamic error if the parameter map contains an entry whose key is defined in the specification of fn:parse-json and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameter escape with true() is also present. |
err:XD0060 | It is a dynamic error if the text document can not be converted into the XPath data model |
err:XD0079 | It is a dynamic error if a supplied content-type is not a valid media type of the form “type/subtype+ext” or “type/subtype”. |
Implementation details
Implementation | Description |
---|---|
Defined | The semantics of the keys and the allowed values for these keys are implementation-defined. |
Defined | The precise nature of the conversion from XML to JSON is implementation-defined. |
Defined | Casting from an XML media type to any other media type when the input document is not a c:data document is implementation-defined. |
Defined | Casting from an HTML media type to a JSON media type is implementation-defined. |
Defined | Casting from an HTML media type to any other media type is implementation-defined. |
Defined | It is implementation-defined whether other result formats are supported. |
Defined | Casting from a JSON media type to an HTML media type is implementation-defined. |
Defined | Casting from a JSON media type to any other media type is implementation-defined. |
Defined | The precise way in which text documents are parsed into the XPath data model is implementation-defined. |
Defined | Casting from a text media type to any other media type is implementation-defined. |
Defined | Casting from any other media type to a HTML media type, a JSON media type or a text document is implementation-defined. |
Defined | Casting from any other media type to any other media type is implementation-defined. |
Declaration
<p:declare-step
type
="
p:cast-content-type
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
content-type
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
</p:declare-step>
The input document is transformed from one media type to another. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the type
/subtype
p:cast-content-type
step cannot perform the requested cast. The input document is transformed from one media type to another. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the type
/subtype
p:cast-content-type
step cannot perform the requested cast.
The parameters
can be used to supply parameters to control casting. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the map parameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.The parameters
can be used to supply parameters to control casting. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the map parameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.
Casting from one XML media type to another simply changes the “
content-type
” document property.Casting from an XML media type to an HTML media type changes the “
content-type
” document property and removes any serialization property.Casting from an XML media type to a JSON media type converts the XML into JSON. The precise nature of the conversion from XML to JSON is implementation-defined. If the input document is an XML representation of JSON as defined in [XPath and XQuery Functions and Operators 3.1], implementations must produce the same result as
fn:parse-json(fn:xml-to-json())
by default. If the input document has ac:param-set
document element, an instance ofmap(xs:QName, xs:string)
must be returned that represents the document'sc:param
elements. The serialization property is removed.Casting from an XML media type to a text media type serializes the XML document by calling
fn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting from an XML media type to a text media type serializes the XML document by callingfn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting from an XML media type to any other media type must support the case where the input document is a
c:data
document. The resulting document will have the specified media type and a representation that is the content of thec:data
element after decoding the base64 encoded content The serialization property is removed.It is a dynamic error () if the contains content is not a valid base64 string.It is a dynamic error () if the contains content is not a valid base64 string.
It is a dynamic error () if the element does not have a
content-type
attribute.It is a dynamic error () if the element does not have acontent-type
attribute.It is a dynamic error () if the
content-type
is supplied and is not the same as thecontent-type
specified on the element. It is a dynamic error () if thecontent-type
is supplied and is not the same as thecontent-type
specified on the element.Casting from an XML media type to any other media type when the input document is not a
c:data
document is implementation-defined.
Casting from an HTML media type to an XML media type changes “
content-type
” document property and removes any serialization property.Casting from an HTML media type to another HTML media type changes “
content-type
” document property.Casting from an HTML media type to a JSON media type is implementation-defined.
Casting an an HTML media type to a text media type serializes the HTML document by calling
fn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting an an HTML media type to a text media type serializes the HTML document by callingfn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting from an HTML media type to any other media type is implementation-defined.
Casting from a JSON media type to an XML media type converts the JSON into XML. An implementation must support the format specified in section “XML Representation of JSON” of [XPath and XQuery Functions and Operators 3.1] as default for the resulting XML. It is implementation-defined whether other result formats are supported. The serialization property is removed.
Casting from a JSON media type to an HTML media type is implementation-defined.
Casting from a JSON media type to another JSON media type changes “
content-type
” document property.Casting from a JSON media type to a text media type serializes the JSON document by calling
fn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting from a JSON media type to a text media type serializes the JSON document by callingfn:serialize($doc, $param)
where$doc
is the document on thesource
port and$param
is the serialization property of this document. The resulting string is wrapped by a document node and returned on theresult
port. The serialization property is removed.Casting from a JSON media type to any other media type is implementation-defined.
Casting from a text media type to an XML media type parses the text value of the document on
source
port by callingfn:parse-xml
. It is a dynamic error () if the text value is not a well-formed XML document. The serialization property is removed.Casting from a text media type to an XML media type parses the text value of the document onsource
port by callingfn:parse-xml
. It is a dynamic error () if the text value is not a well-formed XML document. The serialization property is removed.Casting from a text media type to an HTML media type parses the text value of the document on
source
port into an XPath data model document that contains a tree of elements, attributes, and other nodes. The precise way in which text documents are parsed into the XPath data model is implementation-defined.It is a dynamic error () if the text document can not be converted into the XPath data model. The serialization property is removed.Casting from a text media type to an HTML media type parses the text value of the document onsource
port into an XPath data model document that contains a tree of elements, attributes, and other nodes. The precise way in which text documents are parsed into the XPath data model is implementation-defined.It is a dynamic error () if the text document can not be converted into the XPath data model. The serialization property is removed.Casting from a text media type to a JSON media type parses the text value of the document on
source
port by callingfn:parse-json($doc, $par)
where$doc
is the text document and$par
is theparameter
option. It is a dynamic error () if the text document does not conform to the JSON grammar, unless the parameter liberal is true and the processor chooses to accept the deviation. It is a dynamic error () if the parameter duplicates is reject and the text document contains a JSON object with duplicate keys. It is a dynamic error () if the parameter map contains an entry whose key is defined in the specification offn:parse-json
and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameterescape
withtrue()
is also present. The serialization property is removed. Casting from a text media type to a JSON media type parses the text value of the document onsource
port by callingfn:parse-json($doc, $par)
where$doc
is the text document and$par
is theparameter
option. It is a dynamic error () if the text document does not conform to the JSON grammar, unless the parameter liberal is true and the processor chooses to accept the deviation. It is a dynamic error () if the parameter duplicates is reject and the text document contains a JSON object with duplicate keys. It is a dynamic error () if the parameter map contains an entry whose key is defined in the specification offn:parse-json
and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameterescape
withtrue()
is also present. The serialization property is removed.Casting from a text media type to another text media type changes “
content-type
” document property.Casting from a text media type to any other media type is implementation-defined.
Casting from a non-XML media type to an XML media type produces an XML document with a
c:data
document element. The original media type will be preserved in thecontent-type
attribute on thec:data
element.<c:data
content-type = ContentType
charset? = string
encoding? = string>
string
</c:data>The content of the
c:data
element is the base64 encoded representation of the non-XML content. The serialization property is removed.Casting from any other media type to a HTML media type, a JSON media type or a text document is implementation-defined.
Casting from any other media type to any other media type is implementation-defined.
Document properties
All document properties are preserved except the content-type
property which is updated accordingly and the serialization
property which is removed by some casting methods.
The p:compare
step compares two documents for equality.
<p:declare-step
type
="
p:compare
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
/>
<p:input
port
="
alternate
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
application/xml
"
/>
<p:output
port
="
differences
"
content-types
="
any
"
sequence
="
true
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
method
"
as
="
xs:QName?
"
/>
<p:option
name
="
fail-if-not-equal
"
as
="
xs:boolean
"
select
="
false()
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any | |
alternate | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/xml | |
differences | ✔ | any |
Option name | Type | Default value |
---|---|---|
fail-if-not-equal | xs:boolean | false() |
method | xs:QName? | () |
parameters | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XC0019 | It is a dynamic error if the documents are not equal according to the specified comparison method, and the value of the fail-if-not-equal option is true. |
err:XC0076 | It is a dynamic error if the comparison method specified in p:compare is not supported by the implementation. |
err:XC0077 | It is a dynamic error if the media types of the documents supplied are incompatible with the comparison method. |
Implementation details
Implementation | Description |
---|---|
Defined | Implementations of p:compare must support the deep-equal method; other supported methods are implementation-defined. |
Defined | If fail-if-not-equal is false, and the documents differ, an implementation-defined summary of the differences between the two documents may appear on the differences port. |
Declaration
<p:declare-step
type
="
p:compare
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
/>
<p:input
port
="
alternate
"
content-types
="
any
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
application/xml
"
/>
<p:output
port
="
differences
"
content-types
="
any
"
sequence
="
true
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
method
"
as
="
xs:QName?
"
/>
<p:option
name
="
fail-if-not-equal
"
as
="
xs:boolean
"
select
="
false()
"
/>
</p:declare-step>
This step takes single documents on each of two ports and compares them. If method
is not specified, or if deep-equal
is specified, the comparison uses fn:deep-equal
(as defined in []). Implementations of p:compare
must support the deep-equal
method
; other supported methods are implementation-defined.It is a dynamic error () if the comparison method
specified in p:compare
is not supported by the implementation. It is a dynamic error () if the media types of the documents supplied are incompatible with the comparison method
. This step takes single documents on each of two ports and compares them. If method
is not specified, or if deep-equal
is specified, the comparison uses fn:deep-equal
(as defined in []). Implementations of p:compare
must support the deep-equal
method
; other supported methods are implementation-defined.It is a dynamic error () if the comparison method
specified in p:compare
is not supported by the implementation. It is a dynamic error () if the media types of the documents supplied are incompatible with the comparison method
.
It is a dynamic error () if the documents are not equal according to the specified comparison method
, and the value of the fail-if-not-equal
option is true
. If the documents are equal, or if the value of the fail-if-not-equal
option is false
, a document is produced with contents true
if the documents are equal, otherwise false
.It is a dynamic error () if the documents are not equal according to the specified comparison method
, and the value of the fail-if-not-equal
option is true
. If the documents are equal, or if the value of the fail-if-not-equal
option is false
, a document is produced with contents true
if the documents are equal, otherwise false
.
If fail-if-not-equal
is false
, and the documents differ, an implementation-defined summary of the differences between the two documents may appear on the differences
port.
Document properties
No document properties are preserved. The comparison document has no base-uri
.
The p:compress
step serializes the document appearing on its source
port and outputs a compressed version of this on its result
port.The p:compress
step serializes the document appearing on its source
port and outputs a compressed version of this on its result
port.
<p:declare-step
type
="
p:compress
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName
"
select
="
'gzip'
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | any |
Option name | Type | Default value |
---|---|---|
format | xs:QName | 'gzip' |
parameters | map(xs:QName, item()*)? | () |
serialization | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XC0079 | It is a dynamic error if the map parameters contains an entry whose key is defined by the implementation and whose value is not valid for that key. |
err:XC0202 | It is a dynamic error if the compression format cannot be understood, determined and/or processed. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined what other formats are supported. |
Defined | The semantics of the keys and the allowed values for these keys are implementation-defined. |
Declaration
<p:declare-step
type
="
p:compress
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
false
"
/>
<p:option
name
="
format
"
as
="
xs:QName
"
select
="
'gzip'
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
The p:compress
step first serializes the document appearing on its source
. It then compresses the outcome of this serialization and outputs the result on its result
port.The p:compress
step first serializes the document appearing on its source
. It then compresses the outcome of this serialization and outputs the result on its result
port.
The p:compress
step has the following options:
format
The format of the compression can be specified using the
format
option. Implementations must support the [] format, specified with the valuegzip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the compression format cannot be understood, determined and/or processed. The format of the compression can be specified using theformat
option. Implementations must support the [] format, specified with the valuegzip
. It is implementation-defined what other formats are supported.It is a dynamic error () if the compression format cannot be understood, determined and/or processed.parameters
The
parameters
option can be used to supply parameters to control the compression. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the mapparameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.Theparameters
option can be used to supply parameters to control the compression. The semantics of the keys and the allowed values for these keys are implementation-defined.It is a dynamic error () if the mapparameters
contains an entry whose key is defined by the implementation and whose value is not valid for that key.serialization
The
serialization
option is provided to control the serialization of content before compression takes place. If the document to be stored has aserialization
property, the serialization is controlled by the merger of the two maps where the entries in theserialization
property take precedence. Serialization is described in [].Theserialization
option is provided to control the serialization of content before compression takes place. If the document to be stored has aserialization
property, the serialization is controlled by the merger of the two maps where the entries in theserialization
property take precedence. Serialization is described in [].
Document properties
All document properties are preserved, except for the content-type
property which is updated accordingly and the serialization
property which is removed.
The p:count
step counts the number of documents in the source
input sequence and returns a single document on result
containing that number. The generated document contains a single c:result
element whose contents is the string representation of the number of documents in the sequence.
<p:declare-step
type
="
p:count
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
result
"
content-types
="
application/xml
"
/>
<p:option
name
="
limit
"
as
="
xs:integer
"
select
="
0
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/xml |
Option name | Type | Default value |
---|---|---|
limit | xs:integer | 0 |
Declaration
<p:declare-step
type
="
p:count
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
result
"
content-types
="
application/xml
"
/>
<p:option
name
="
limit
"
as
="
xs:integer
"
select
="
0
"
/>
</p:declare-step>
If the limit
option is specified and is greater than zero, the p:count
step will count at most that many documents. This provides a convenient mechanism to discover, for example, if a sequence consists of more than 1 document, without requiring every single document to be buffered before processing can continuecounted.
Document properties
No document properties are preserved. The count document has no base-uri
.
The p:delete
step deletes items specified by a selection pattern from the source
input document and produces the resulting document, with the deleted items removed, on the result
port.The p:delete
step deletes items specified by a selection pattern from the source
input document and produces the resulting document, with the deleted items removed, on the result
port.
<p:declare-step
type
="
p:delete
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text xml html |
Option name | Type | Required |
---|---|---|
match | XSLTSelectionPattern | ✔ |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if the match option matches the document node. |
err:XC0062 | It is a dynamic error if the match option matches a namespace node. |
Declaration
<p:declare-step
type
="
p:delete
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. A selection pattern may match multiple items to be deleted.The value of the match
option must be an XSLTSelectionPattern. A selection pattern may match multiple items to be deleted.
If an element is selected by the match
option, the entire subtree rooted at that element is deleted.If an element is selected by the match
option, the entire subtree rooted at that element is deleted.
It is a dynamic error () if the match
option matches the document node.It is a dynamic error () if the match
option matches the document node.
This step cannot be used to remove namespaces. It is a dynamic error () if the match
option matches a namespace node. Also, note that deleting an attribute named xml:base
does not change the base URI of the element on which it occurred.This step cannot be used to remove namespaces. It is a dynamic error () if the match
option matches a namespace node. Also, note that deleting an attribute named xml:base
does not change the base URI of the element on which it occurred.
Document properties
If the resulting document contains exactly one text node, the content-type
property is changed to text/plain
and the serialization
property is removed, while all other document properties are preserved. In all other cases, all document properties are preserved.
The p:error
step generates a dynamic error using the input provided to the step.
<p:declare-step
type
="
p:error
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
text xml
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
any
"
/>
<p:option
name
="
code
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | text xml |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | any |
Option name | Type | Required |
---|---|---|
code | xs:QName | ✔ |
Declaration
<p:declare-step
type
="
p:error
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
text xml
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
any
"
/>
<p:option
name
="
code
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
This step uses the document provided on its input as the content of the error raised. An instance of the c:errors
element will be produced on the error output port, as is always the case for dynamic errors. The error generated can be caught by a p:try
just like any other dynamic error.
For authoring convenience, the p:error
step is declared with a single, primary output port. With respect to connections, this port behaves like any other output port even though nothing can ever appear on it since the step always fails.
For example, given the following invocation:
<p:error xmlns:my="http://www.example.org/error" name="bad-document" code="my:unk12"> <p:with-input port="source"> <message>The document element is unknown.</message> </p:with-input> </p:error>
The error vocabulary element (and document) generated on the error output port would be:
<c:errors xmlns:c="http://www.w3.org/ns/xproc-step" xmlns:p="http://www.w3.org/ns/xproc" xmlns:my="http://www.example.org/error"> <c:error name="bad-document" type="p:error" code="my:unk12"><message >The document element is unknown.</message> </c:error> </c:errors>
The href
, line
and column
, or offset
, might also be present on the c:error
to identify the location of the p:error
element in the pipeline.
Document properties
No document properties are preserved but that’s irrelevant as no document is ever produced.
The p:filter
step selects portions of the source document based on a (possibly dynamically constructed) XPath select expression.
<p:declare-step
type
="
p:filter
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
text xml html
"
/>
<p:option
name
="
select
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | text xml html |
Option name | Type | Required |
---|---|---|
select | XPathExpression | ✔ |
Declaration
<p:declare-step
type
="
p:filter
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
text xml html
"
/>
<p:option
name
="
select
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
This step behaves just like an p:input
with a select
expression except that the select expression is computed dynamically.
Document properties
No document properties are preserved. The base-uri
property of each document will reflect the base URI of the selected node(s).
The p:hash
step generates a hash, or digital “fingerprint”, for some value and injects it into the source
document.The p:hash
step generates a hash, or digital “fingerprint”, for some value and injects it into the source
document.
<p:declare-step
type
="
p:hash
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
value
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
algorithm
"
required
="
true
"
as
="
xs:QName
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*/node()'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
version
"
as
="
xs:string?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text xml html |
Option name | Type | Default value | Required |
---|---|---|---|
algorithm | xs:QName | ✔ | |
value | xs:string | ✔ | |
match | XSLTSelectionPattern | '/*/node()' | |
parameters | map(xs:QName,item()*)? | () | |
version | xs:string? | () |
Errors
Error code | Description |
---|---|
err:XC0036 | It is a dynamic error if the requested hash algorithm is not one that the processor understands or if the value or parameters are not appropriate for that algorithm. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined what other algorithms are supported. |
Declaration
<p:declare-step
type
="
p:hash
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
value
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
algorithm
"
required
="
true
"
as
="
xs:QName
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*/node()'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
version
"
as
="
xs:string?
"
/>
</p:declare-step>
The value of the algorithm
option must be a QName. If it does not have a prefix, then it must be one of the following values: “crc”, “md”, or “sha”.The value of the algorithm
option must be a QName. If it does not have a prefix, then it must be one of the following values: “crc”, “md”, or “sha”.
If a version
is not specified, the default version is algorithm-defined. For “crc
” it is 32, for “md
” it is 5, for “sha
” it is 1.
A hash is constructed from the string specified in the value
option using the specified algorithm and version. Implementations must support [], [], and [] hashes. It is implementation-defined what other algorithms are supported. The resulting hash should be returned as a string of hexadecimal characters. A hash is constructed from the string specified in the value
option using the specified algorithm and version. Implementations must support [], [], and [] hashes. It is implementation-defined what other algorithms are supported. The resulting hash should be returned as a string of hexadecimal characters.
The value of the match
option must be an XSLTSelectionPattern.The value of the match
option must be an XSLTSelectionPattern.
The hash of the specified value is computed using the algorithm and parameters specified. It is a dynamic error () if the requested hash algorithm is not one that the processor understands or if the value or parameters are not appropriate for that algorithm.The hash of the specified value is computed using the algorithm and parameters specified. It is a dynamic error () if the requested hash algorithm is not one that the processor understands or if the value or parameters are not appropriate for that algorithm.
The matched nodes are specified with the selection pattern in the match
option. For each matching node, the string value of the computed hash is used in the output (if more than one node matches, the same hash value is used in each match). Nodes that do not match are copied without change.The matched nodes are specified with the selection pattern in the match
option. For each matching node, the string value of the computed hash is used in the output (if more than one node matches, the same hash value is used in each match). Nodes that do not match are copied without change.
If the expression given in the match
option matches an attribute, the hash is used as the new value of the attribute in the output. If the attribute is named “xml:base
”, the base URI of the element must also be amended accordingly.If the expression given in the match
option matches an attribute, the hash is used as the new value of the attribute in the output. If the attribute is named “xml:base
”, the base URI of the element must also be amended accordingly.
If the document node is matched, the entire document is replaced by a text node with the hash. What appears on port result is a text document with the text node wrapped in a document node.
If the expression matches any other kind of node, the entire node (and not just its contents) is replaced by the hash.
Document properties
If the resulting document contains exactly one text node, the content-type
property is changed to text/plain
and the serialization
property is removed, while all other document properties are preserved. For other document types, all document properties are preserved.
The p:http-request
step allows authors to interact with resources over HTTP or related protocols. Implementations must support the http
and https
protocols. (Implementors are encouraged to support as many protocols as practical. In particular, pipeline authors may attempt to use p:http-request
to load documents with computed URIs using the file:
scheme.)
<p:declare-step
type
="
p:http-request
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
report
"
content-types
="
application/json
"
/>
<p:option
name
="
href
"
as
="
xs:anyURI
"
required
="
true
"
/>
<p:option
name
="
method
"
as
="
xs:string?
"
select
="
'GET'
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
headers
"
as
="
map(xs:string, xs:string)?
"
/>
<p:option
name
="
auth
"
as
="
map(xs:string, item()+)?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
<p:option
name
="
assert
"
as
="
xs:string
"
select
="
'.?status-code lt 400'
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | any |
report | application/json |
Option name | Type | Default value | Required |
---|---|---|---|
href | xs:anyURI | ✔ | |
assert | xs:string | '.?status-code lt 400' | |
auth | map(xs:string, item()+)? | () | |
headers | map(xs:string, xs:string)? | () | |
method | xs:string? | 'GET' | |
parameters | map(xs:QName, item()*)? | () | |
serialization | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XC0003 | It is a dynamic error if a “username” or a “password” key is present without specifying a value for the “auth-method” key, if the requested auth-method isn't supported, or the authentication challenge contains an authentication method that isn't supported. |
err:XC0030 | It is a dynamic error if the response body cannot be interpreted as requested (e.g. application/json to override application/xml content). |
err:XC0078 | It is a dynamic error if the value associated with the “fail-on-timeout” is associated with true() and a HTTP status code 408 is encountered. |
err:XC0122 | It is a dynamic error if the given method is not supported. |
err:XC0123 | It is a dynamic error if any key in the “auth” map is associated with a value that is not an instance of the required type. |
err:XC0124 | It is a dynamic error if any key in the “parameters” map is associated with a value that is not an instance of the required type. |
err:XC0125 | It is a dynamic error if the key “accept-multipart” as the value false() and a multipart response is detected. |
err:XC0126 | It is a dynamic error if the XPath expression in assert evaluates to false. |
err:XC0127 | It is a dynamic error if the headers map contains two keys that are the same when compared in a case-insensitive manner. |
err:XC0128 | It is a dynamic error if the URI’s scheme is unknown or not supported. |
err:XC0131 | It is a dynamic error if the processor cannot support the requested encoding. |
err:XC0132 | It is a dynamic error if the override content encoding cannot be supported. |
err:XC0133 | It is a dynamic error if more than one document appears on the source port and a content-type header is present and the content type specified is not a multipart content type. |
err:XC0203 | It is a dynamic error if the specified boundary is not valid (for example, if it begins with two hyphens “--”). |
err:XD0079 | It is a dynamic error if a supplied content-type is not a valid media type of the form “type/subtype+ext” or “type/subtype”. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined which HTTP methods are supported. |
Defined | The interpretation of values associated with the “auth-method” key other than “Basic” or “Digest” is implementation-defined. |
Defined | Other key value pairs in map “auth” are implementation-defined. |
Defined | It is implementation-defined which other key/value pairs in the parameters option are supported. |
Defined | The default behaviour in case of a redirect response is implementation-defined. |
Defined | It is implementation-defined how a multipart boundary is constructed. |
Declaration
<p:declare-step
type
="
p:http-request
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
result
"
primary
="
true
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
report
"
content-types
="
application/json
"
/>
<p:option
name
="
href
"
as
="
xs:anyURI
"
required
="
true
"
/>
<p:option
name
="
method
"
as
="
xs:string?
"
select
="
'GET'
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
headers
"
as
="
map(xs:string, xs:string)?
"
/>
<p:option
name
="
auth
"
as
="
map(xs:string, item()+)?
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName, item()*)?
"
/>
<p:option
name
="
assert
"
as
="
xs:string
"
select
="
'.?status-code lt 400'
"
/>
</p:declare-step>
The p:http-request
step performs the HTTP request specified by the method
option against the URI specified in the href
option. In simple cases, for example, a GET request on an unauthenticated URI, nothing else is necessary to form a complete request.The p:http-request
step performs the HTTP request specified by the method
option against the URI specified in the href
option. In simple cases, for example, a GET request on an unauthenticated URI, nothing else is necessary to form a complete request.
If the method, for example, POST, supports a body, the request body is constructed using the document(s) appearing on the source
port. For the convenience of pipeline authors, documents may appear on the source
port even when the request method (such as GET or HEAD) does not define the semantics of a payload. If the semantics are undefined, the documents are ignored when constructing the request unless the parameters
option specifies “send-body-anyway
” as true()
.If the method, for example, POST, supports a body, the request body is constructed using the document(s) appearing on the source
port. For the convenience of pipeline authors, documents may appear on the source
port even when the request method (such as GET or HEAD) does not define the semantics of a payload. If the semantics are undefined, the documents are ignored when constructing the request unless the parameters
option specifies “send-body-anyway
” as true()
.
The headers for the request come from the headers
option (see below). If exactly one document appears on the source
port, its document properties also contribute to the overall request headers.The headers for the request come from the headers
option (see below). If exactly one document appears on the source
port, its document properties also contribute to the overall request headers.
The response from the HTTP request appears on the result
and report
ports. Any documents contained in the response body will appear on the result
port. Each document in the response will be parsed according to its content-type (but see “override-content-type
” in the parameters
option). Details about the outcome of the request will appear as a map on the report
port. The map will always contain: The response from the HTTP request appears on the result
and report
ports. Any documents contained in the response body will appear on the result
port. Each document in the response will be parsed according to its content-type (but see “override-content-type
” in the parameters
option). Details about the outcome of the request will appear as a map on the report
port. The map will always contain:
status-code
(anxs:integer
)This is the HTTP status code returned for the request.
base-uri
(anxs:anyURI
)This is the URI of the last request made and is always available in the report even when the request does not return any documents. In the case of HTTP redirection, the base URI returned may be different from the original request URI.
headers
(amap(xs:string, xs:string)
)These are the HTTP headers returned for the request. The map may be empty. Header names are converted to lowercase.
The p:http-request
step has the following options:
href
The
href
option specifies the request’s IRI. Relative values are resolved against the base URI of the element on which the option is specified (the relevantp:with-option
or the step element in the case of a syntactic shortcut value).Thehref
option specifies the request’s IRI. Relative values are resolved against the base URI of the element on which the option is specified (the relevantp:with-option
or the step element in the case of a syntactic shortcut value).Fragment identifiers are removed before making the request. Query parameters are passed through unchanged. It is a dynamic error () if the URI’s scheme is unknown or not supported. It is the pipeline author’s responsibility to escape problematic UTF-8 characters in the
href
value, for example withescape-html-uri()
. Fragment identifiers are removed before making the request. Query parameters are passed through unchanged. It is a dynamic error () if the URI’s scheme is unknown or not supported. It is the pipeline author’s responsibility to escape problematic UTF-8 characters in thehref
value, for example withescape-html-uri()
.method
The
method
specifies the HTTP request method. The value is implicitly turned into an uppercase string if necessary. It is implementation -defined which HTTP methods are supported. An implementation should implement at least the methodsGET
,POST
,PUT
,DELETE
, andHEAD
(for HTTP and HTTPS). It is a dynamic error () if the given method is not supported.Themethod
specifies the HTTP request method. The value is implicitly turned into an uppercase string if necessary. It is implementation -defined which HTTP methods are supported. An implementation should implement at least the methodsGET
,POST
,PUT
,DELETE
, andHEAD
(for HTTP and HTTPS). It is a dynamic error () if the given method is not supported.serialization
The
serialization
option is used to control the serialization of documents for the request body. If a document has a “serialization
” document property, the effective value of the serialization options is the union of the two maps, where the entries in the “serialization
” document property take precedence.Theserialization
option is used to control the serialization of documents for the request body. If a document has a “serialization
” document property, the effective value of the serialization options is the union of the two maps, where the entries in the “serialization
” document property take precedence.headers
The key/value pairs in the
headers
map are used to construct the request headers. Each map key is used as a header name and the value associated with that key in the map is used as the header value.The key/value pairs in theheaders
map are used to construct the request headers. Each map key is used as a header name and the value associated with that key in the map is used as the header value.If a single document appears on the
source
port, then document properties on that document may be added as additional headers. For XML, HTML, and text documents with aserialization
document property having anencoding
key, acharset
is appended to the createdcontent-type
header of the HTTP request. Properties in thehttp://www.w3.org/ns/xproc-http
namespace will be added to the headers, using the local-name of the property QName as the header name. These properties are only copied if they are not specified in theheader
map. In other words, if the same header name appears in both places, the value from the map is used and the value from the document properties is ignored. (Header names are case-insensitive, so a case-insensitive comparison must be performed.) If multiple documents appear on thesource
port, none of their properties are used in the request headers.If a single document appears on thesource
port, then document properties on that document may be added as additional headers. For XML, HTML, and text documents with aserialization
document property having anencoding
key, acharset
is appended to the createdcontent-type
header of the HTTP request. Properties in thehttp://www.w3.org/ns/xproc-http
namespace will be added to the headers, using the local-name of the property QName as the header name. These properties are only copied if they are not specified in theheader
map. In other words, if the same header name appears in both places, the value from the map is used and the value from the document properties is ignored. (Header names are case-insensitive, so a case-insensitive comparison must be performed.) If multiple documents appear on thesource
port, none of their properties are used in the request headers.The behavior of the
p:http-request
depends on the headers specified. In particular:content-type
If a
content-type
header is provided, it will be used. For a single document request, this overrides the content type value of the document. If the content type specified begins with “multipart/
”, a multipart request will be sent to the server.It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”.It is a dynamic error () if a supplied content-type is not a valid media type of the form “type
/subtype
” or “type
/subtype
+ext
”.type
/subtype
transfer-encoding
If a
transfer-encoding
header is provided, the request must be sent with that encoding. It is a dynamic error () if the processor cannot support the requested encoding.If atransfer-encoding
header is provided, the request must be sent with that encoding. It is a dynamic error () if the processor cannot support the requested encoding.authorization
The
authorization
header is used to authenticate a request. If theauth
option is specified, any key or property that would have contributed a header named “authorization
” (irrespective of case) is ignored. The authorization header is determined exclusively by theauth
option when it is present.Theauthorization
header is used to authenticate a request. If theauth
option is specified, any key or property that would have contributed a header named “authorization
” (irrespective of case) is ignored. The authorization header is determined exclusively by theauth
option when it is present.
HTTP headers are case-insensitive but keys in maps are not; be careful when specifying the request headers. It is a dynamic error () if the
headers
map contains two keys that are the same when compared in a case-insensitive manner. (That is, whenfn:uppercase($key1) = fn:uppercase($key2)
.) HTTP headers are case-insensitive but keys in maps are not; be careful when specifying the request headers. It is a dynamic error () if theheaders
map contains two keys that are the same when compared in a case-insensitive manner. (That is, whenfn:uppercase($key1) = fn:uppercase($key2)
.)auth
Many web services are only available to authenticated users, that is, to users who have “logged in”. The
auth
option allows the pipeline author to specify information that may be required to generate an “Authorization
” header. The standard values support HTTP “Basic” and “Digest” authentication, but other authentication methods are allowed.Many web services are only available to authenticated users, that is, to users who have “logged in”. Theauth
option allows the pipeline author to specify information that may be required to generate an “Authorization
” header. The standard values support HTTP “Basic” and “Digest” authentication, but other authentication methods are allowed.The following standard keys are defined:
username
(xs:string
)The username.
password
(xs:string
)The password associated with the username.
auth-method
(xs:string
)The authentication method. Appropriate values for the “
auth-method
” key are “Basic
” or “Digest
” but other values are allowed. If the authentication method is “Basic
” or “Digest
”, authentication is handled as per [RFC 2617]. The interpretation of values associated with the “auth-method
” key other than “Basic
” or “Digest
” is implementation -defined.send-authorization
(xs:boolean
)The “
send-authorization
” key can be used to attempt to allow the request to avoid an authentication challenge. If the “send-authorization
” key is “true()
”, and the authentication method specified by the value associated with the “auth-method
” key supports generation of an “Authorization
” header without a challenge, then the header is generated and sent on the first request. If the “send-authorization
” key is absent or does not have the value “true
”, the first request is sent without an “Authorization
” header.
Other key value pairs in map “
auth
” are implementation -defined.It is a dynamic error () if any key in the “auth
” map is associated with a value that is not an instance of the required type.Other key value pairs in map “auth
” are implementation -defined.It is a dynamic error () if any key in the “auth
” map is associated with a value that is not an instance of the required type.If the initial response to the request is an authentication challenge, the values provided in the
auth
map and any relevant data from the challenge are used to generate an “Authorization
” header and the request is sent again. If that authorization fails, the request is not retried.It is a dynamic error () if a “
username
” or a “password
” key is present without specifying a value for the “auth-method
” key, if the requestedauth-method
isn't supported, or the authentication challenge contains an authentication method that isn't supported. All implementations must support “Basic” and “Digest” authentication per [].It is a dynamic error () if a “username
” or a “password
” key is present without specifying a value for the “auth-method
” key, if the requestedauth-method
isn't supported, or the authentication challenge contains an authentication method that isn't supported. All implementations must support “Basic” and “Digest” authentication per [].- parameters
The
parameter
option can be used to provide values for fine tuning the construction of the request and/or handling of the server response. A number of parameters are defined in this specification. It is implementation -defined which other key/value pairs in theparameters
option are supported.Theparameter
option can be used to provide values for fine tuning the construction of the request and/or handling of the server response. A number of parameters are defined in this specification. It is implementation -defined which other key/value pairs in theparameters
option are supported.- override-content-type (
xs:string
) Ordinarily, the value of the
content-type
header provided in the server response controls the interpretation of any body in the response. If the “override-content-type
” parameter is provided, then its value is used to interpret the body. The content-type header that appears on thereport
port is not changed. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the response body cannot be interpreted as requested (e.g.type
/subtype
application/json
to overrideapplication/xml
content).Ordinarily, the value of thecontent-type
header provided in the server response controls the interpretation of any body in the response. If the “override-content-type
” parameter is provided, then its value is used to interpret the body. The content-type header that appears on thereport
port is not changed. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the response body cannot be interpreted as requested (e.g.type
/subtype
application/json
to overrideapplication/xml
content).- http-version (
xs:string
) The
http-version
parameter indicates which version of HTTP must be used for the request.- accept-multipart (
xs:boolean
) If the
accept-multipart
parameter is present and explicitly has the valuefalse()
, a dynamic error will be raised, if a multipart response is received from the server. This feature is a convenience for pipeline authors as it will raise an error when the multipart request is received, rather than having the presence of a sequence raise an error further along in the pipeline, or simply producing anomalous results. It is a dynamic error () if the key “accept-multipart
” as the valuefalse()
and a multipart response is detected.If theaccept-multipart
parameter is present and explicitly has the valuefalse()
, a dynamic error will be raised, if a multipart response is received from the server. This feature is a convenience for pipeline authors as it will raise an error when the multipart request is received, rather than having the presence of a sequence raise an error further along in the pipeline, or simply producing anomalous results. It is a dynamic error () if the key “accept-multipart
” as the valuefalse()
and a multipart response is detected.- override-content-encoding (
xs:string
) If the “
override-content-encoding
” parameter is present, the response will be treated as if the response contained a “content-encoding
” header with the specified value. The content-encoding header that appears on thereport
port is not changed. It is a dynamic error () if the override content encoding cannot be supported. If the “override-content-encoding
” parameter is present, the response will be treated as if the response contained a “content-encoding
” header with the specified value. The content-encoding header that appears on thereport
port is not changed. It is a dynamic error () if the override content encoding cannot be supported.- permit-expired-ssl-certificate (
xs:boolean
) If “
permit-expired-ssl-certificate
” is true, then the processor should not reject responses where the server provides an expired SSL certificate.- permit-untrusted-ssl-certificate (
xs:boolean
) If “
permit-untrusted-ssl-certificate
” is true, then the processor should not reject response where the server provides an SSL certificate which is not trusted, for example, because the certificate authority (CA) is unknown.- follow-redirect (
xs:integer
) The “
follow-redirect
” parameter allows the pipeline author to specify the step’s behaviour in the case of a redirect response. A value of0
indicates that redirects are not to be followed,-1
indicates that redirects are to be followed indefinitely, and a specific number indicates the maximum number of redirects to follow. The default behaviour in case of a redirect response is implementation -defined.- timeout (
xs:integer
) If a “
timeout
” is specified, it must be a non-negative integer. It controls the time the XProc processor waits for the request to be answered. If a value is given, it is taken as the number of seconds to wait for the response to be delivered. If no response is received after that time, the request is terminated and a status-code408
is assumed.- fail-on-timeout (
xs:boolean
) If “
fail-on-timeout
” is true, a dynamic error is raised if a408
response is received (either as a consequence of setting a value for the “timeout
” parameter or as status code returned by a server). It is a dynamic error () if the value associated with the “fail-on-timeout
” is associated withtrue()
and a HTTP status code408
is encountered. If “fail-on-timeout
” is true, it prevents any dynamic error with codeC0126
resulting from theassert
option to be raised for request's timeout.If “fail-on-timeout
” is true, a dynamic error is raised if a408
response is received (either as a consequence of setting a value for the “timeout
” parameter or as status code returned by a server). It is a dynamic error () if the value associated with the “fail-on-timeout
” is associated withtrue()
and a HTTP status code408
is encountered. If “fail-on-timeout
” is true, it prevents any dynamic error with codeC0126
resulting from theassert
option to be raised for request's timeout.Note
Please note that the “
fail-on-timeout
” parameter is different from the “timeout
” option on thep:http-request
step (see Controlling long running steps in XProc 3.0: An XML Pipeline Language). If the step does not finish in the specified time,D0053
is raised. If the request does not finish in time, andfail-on-timeout
is true,C0078
is raised. The actual times after which a timeout is detected may also differ slightly.- status-only (
xs:boolean
) If the “
status-only
” parameter is true, this indicates that the pipeline author is only interested in the response code. An empty sequence is always returned on theresult
port in this case. The implementation may save resources by ignoring the response body. The map on thereport
will contain the status code and an empty map for “headers
”.If the “status-only
” parameter is true, this indicates that the pipeline author is only interested in the response code. An empty sequence is always returned on theresult
port in this case. The implementation may save resources by ignoring the response body. The map on thereport
will contain the status code and an empty map for “headers
”.- suppress-cookies (
xs:boolean
) If the “
suppress-cookies
” parameter is true, the implementation must not send any cookies with the request.- send-body-anyway (
xs:boolean
) If the “
send-body-anyway
” parameter is true, and one or more documents appear on thesource
port, a request body is constructed from the documents and sent with the request, even if the semantics of sending a body are not specified for the HTTP method in use. If the “send-body-anyway
” parameter is true, and one or more documents appear on thesource
port, a request body is constructed from the documents and sent with the request, even if the semantics of sending a body are not specified for the HTTP method in use.
It is a dynamic error () if any key in the “parameters” map is associated with a value that is not an instance of the required type.It is a dynamic error () if any key in the “parameters” map is associated with a value that is not an instance of the required type.
- override-content-type (
- assert (
xs:string
) The
assert
option can be used by pipeline authors to raise a dynamic error if the response does not fulfill the expectations of the receiver. The option's value (if present) is interpreted as an XPath expression which will be executed using the map that appears on thereport
port as its context item. If the effective boolean value of the expression isfalse()
, a dynamic error is raised. It is a dynamic error () if the XPath expression inassert
evaluates tofalse
. Implementations should provide an XML representation of the map used as the context item with the error document to enable pipelines to access the error's cause.Theassert
option can be used by pipeline authors to raise a dynamic error if the response does not fulfill the expectations of the receiver. The option's value (if present) is interpreted as an XPath expression which will be executed using the map that appears on thereport
port as its context item. If the effective boolean value of the expression isfalse()
, a dynamic error is raised. It is a dynamic error () if the XPath expression inassert
evaluates tofalse
. Implementations should provide an XML representation of the map used as the context item with the error document to enable pipelines to access the error's cause.
If more than one document appears on the source
port, or if the specified “content-type
” header begins “multipart/
”, a multipart request will be constructed, per []. The content type of the request is derived from the “content-type
” header:If more than one document appears on the source
port, or if the specified “content-type
” header begins “multipart/
”, a multipart request will be constructed, per []. The content type of the request is derived from the “content-type
” header:
If the “
content-type
” header specifies a multipart content type, that value will be used as the content type. If the header includes aboundary
parameter, that value will be used as the boundary. It is a dynamic error () if the specified boundary is not valid (for example, if it begins with two hyphens “--”). If the “content-type
” header specifies a multipart content type, that value will be used as the content type. If the header includes aboundary
parameter, that value will be used as the boundary. It is a dynamic error () if the specified boundary is not valid (for example, if it begins with two hyphens “--”).If the “
content-type
” header is not specified, “multipart/mixed
” will be used.It is a dynamic error () if more than one document appears on the
source
port and acontent-type
header is present and the content type specified is not a multipart content type. It is a dynamic error () if more than one document appears on thesource
port and acontent-type
header is present and the content type specified is not a multipart content type.
A multipart request must have a boundary marker, if one isn’t specified in the content type, the implementation must construct one. It is implementation-defined how a multipart boundary is constructed. Implementations are not required to guarantee that the constructed value does not appear accidentally in the multipart data. If it does, the request will be malformed; pipeline authors must provide a boundary if they wish to assure that this cannot happen.
Each document in the sequence is serialized. If the document has a “serialization
” document property, its values are used to determine how serialization is performed.
All of the document properties in the http://www.w3.org/ns/xproc-http
namespace will be added as headers for the part, using the local-name of the property QName as the header name. In particular, this is how the “id
”, “description
”, “disposition
” and other multipart headers can be provided.
When a multipart response is received, each part is interpreted according to it’s content type and a pipeline document is constructed. Any additional headers associated with the part are added to the document properties of the constructed document.
The multipart response is the resulting sequence of documents.
Document properties
No document properties are preserved.
The p:identity
step makes a verbatim copy of its input available on its output.
<p:declare-step
type
="
p:identity
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
any
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | any |
Declaration
<p:declare-step
type
="
p:identity
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
any
"
/>
</p:declare-step>
If the implementation supports passing PSVI annotations between steps, the p:identity
step must preserve any annotations that appear in the input.
Document properties
All document properties are preserved.
The p:insert
step inserts the insertion
port's document into the source
port's document relative to the matching elements in the source
port's document.The p:insert
step inserts the insertion
port's document into the source
port's document relative to the matching elements in the source
port's document.
<p:declare-step
type
="
p:insert
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:input
port
="
insertion
"
sequence
="
true
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
position
"
values
="
('first-child','last-child','before','after')
"
select
="
'after'
"
/>
<!--
string -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html | |
insertion | ✔ | xml html text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html text |
Option name | Type | Values | Default value |
---|---|---|---|
match | XSLTSelectionPattern | '/*' | |
position | xs:string | ('first-child','last-child','before','after') | 'after' |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if that pattern matches an attribute or a namespace node. |
err:XC0024 | It is a dynamic error if the selection pattern matches a document node and the value of the position is “before” or “after”. |
err:XC0025 | It is a dynamic error if the selection pattern matches anything other than an element or a document node and the value of the position option is “first-child” or “last-child”. |
Declaration
<p:declare-step
type
="
p:insert
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:input
port
="
insertion
"
sequence
="
true
"
content-types
="
xml html text
"
/>
<p:output
port
="
result
"
content-types
="
xml html text
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
position
"
values
="
('first-child','last-child','before','after')
"
select
="
'after'
"
/>
<!--
string -->
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches an attribute or a namespace node. Multiple matches are allowed, in which case multiple copies of the insertion
documents will occur. If no elements match, then the document is unchanged.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches an attribute or a namespace node. Multiple matches are allowed, in which case multiple copies of the insertion
documents will occur. If no elements match, then the document is unchanged.
The value of the position
option must be an NMTOKEN in the following list: The value of the position
option must be an NMTOKEN in the following list:
“
first-child
” - the insertion is made as the first child of the match;“
last-child
” - the insertion is made as the last child of the match;“
before
” - the insertion is made as the immediate preceding sibling of the match;“
after
” - the insertion is made as the immediate following sibling of the match.
It is a dynamic error () if the selection pattern matches anything other than an element or a document node and the value of the position
option is “first-child
” or “last-child
”. It is a dynamic error () if the selection pattern matches a document node and the value of the position
is “before
” or “after
”.It is a dynamic error () if the selection pattern matches anything other than an element or a document node and the value of the position
option is “first-child
” or “last-child
”. It is a dynamic error () if the selection pattern matches a document node and the value of the position
is “before
” or “after
”.
As the inserted elements are part of the output of the step they are not considered in determining matching elements. If an empty sequence appears on the insertion
port, the result will be the same as the source.As the inserted elements are part of the output of the step they are not considered in determining matching elements. If an empty sequence appears on the insertion
port, the result will be the same as the source.
Document properties
All document properties on the source
port are preserved. The document properties on the insertion
port are not preserved or present in the result document.All document properties on the source
port are preserved. The document properties on the insertion
port are not preserved or present in the result document.
The p:json-join
step joins the sequence of documents on port source
into a single JSON document (an array) appearing on port result
. If the sequence on port source
is empty, the empty sequence is returned on port result
.The p:json-join
step joins the sequence of documents on port source
into a single JSON document (an array) appearing on port result
. If the sequence on port source
is empty, the empty sequence is returned on port result
.
<p:declare-step
type
="
p:json-join
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
application/json
"
/>
<p:option
name
="
flatten-to-depth
"
as
="
xs:string?
"
select
="
'0'
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/json |
Option name | Type | Default value |
---|---|---|
flatten-to-depth | xs:string? | '0' |
Errors
Error code | Description |
---|---|
err:XC0111 | It is a dynamic error if a document of an unsupported document type appears on port source of p:json-join. |
err:XC0119 | It is a dynamic error if flatten is neither “unbounded”, nor a string that may be cast to a non-negative integer. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined if p:json-join is able to process document types not mentioned yet, i.e. types of binary documents. |
Declaration
<p:declare-step
type
="
p:json-join
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
application/json
"
/>
<p:option
name
="
flatten-to-depth
"
as
="
xs:string?
"
select
="
'0'
"
/>
</p:declare-step>
The step inspects the documents on port source
in turn to create the resulting array:The step inspects the documents on port source
in turn to create the resulting array:
If the document under inspection is a JSON document representing an array, the array is copied to the resulting array according to the setting of option
flatten-to-depth
.If the document under inspection is a JSON document representing an array, the array is copied to the resulting array according to the setting of optionflatten-to-depth
.For every other type of JSON document, for XML documents, HTML documents, or text documents, their XDM representation is appended to the resulting array.
It is implementation -defined if
p:json-join
is able to process document types not mentioned yet, i.e. types of binary documents. If a processor supports a given type of documents, an entry is created as described above. It is a dynamic error () if a document of an unsupported document type appears on portsource
ofp:json-join
. It is implementation -defined ifp:json-join
is able to process document types not mentioned yet, i.e. types of binary documents. If a processor supports a given type of documents, an entry is created as described above. It is a dynamic error () if a document of an unsupported document type appears on portsource
ofp:json-join
.
The option flatten-to-depth
controls whether and to which depth members of an array appearing on port source
are flattened. It is a dynamic error () if flatten
is neither “unbounded
”, nor a string that may be cast to a non-negative integer. An integer value of 0
, which is the default, means that no flattening takes place, so the array appearing on port source
will be contained as an array in the resulting array. An integer value of 1
means that an array on port source
is flattened, i.e. the members of that array will appear as individual members in the resulting array. Any value greater than 1
means that the flattening is applied recursively to arrays in arrays up to the given depth. A value of “unbounded
” means that all arrays in arrays will be flattened. As a consequence, the resulting array appearing on port result
will not have any arrays as members. The option flatten-to-depth
controls whether and to which depth members of an array appearing on port source
are flattened. It is a dynamic error () if flatten
is neither “unbounded
”, nor a string that may be cast to a non-negative integer. An integer value of 0
, which is the default, means that no flattening takes place, so the array appearing on port source
will be contained as an array in the resulting array. An integer value of 1
means that an array on port source
is flattened, i.e. the members of that array will appear as individual members in the resulting array. Any value greater than 1
means that the flattening is applied recursively to arrays in arrays up to the given depth. A value of “unbounded
” means that all arrays in arrays will be flattened. As a consequence, the resulting array appearing on port result
will not have any arrays as members.
Document properties
No document properties are preserved. The joined document has no base-uri
.
The p:json-merge
step merges the sequence of appearing on port source
into a single JSON object appearing on port result
. If the sequence on port source
is empty, the empty sequence is returned on port result
.The p:json-merge
step merges the sequence of appearing on port source
into a single JSON object appearing on port result
. If the sequence on port source
is empty, the empty sequence is returned on port result
.
<p:declare-step
type
="
p:json-merge
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
application/json
"
/>
<p:option
name
="
duplicates
"
values
="
('reject', 'use-first', 'use-last', 'use-any', 'combine')
"
select
="
'use-first'
"
/>
<!--
string -->
<p:option
name
="
key
"
as
="
xs:string
"
select
="
'concat("_",$p:index)'
"
/>
<!--
XPathExpression -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/json |
Option name | Type | Values | Default value |
---|---|---|---|
duplicates | xs:string | ('reject', 'use-first', 'use-last', 'use-any', 'combine') | 'use-first' |
key | XPathExpression | 'concat("_",$p:index)' |
Errors
Error code | Description |
---|---|
err:XC0106 | It is a dynamic error if duplicate keys are encountered and option duplicates has value “reject”. |
err:XC0107 | It is a dynamic error if a document of a not supported document type appears on port source of p:json-merge. |
err:XC0110 | It is a dynamic error if the evaluation of the XPath expression in option key for a given item returns either a sequence, an array, a map, or a function. |
Implementation details
Implementation | Description |
---|---|
Defined | It is implementation-defined if p:json-merge is able to process document types not mentioned yet, i.e. types of binary documents. |
Declaration
<p:declare-step
type
="
p:json-merge
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
application/json
"
/>
<p:option
name
="
duplicates
"
values
="
('reject', 'use-first', 'use-last', 'use-any', 'combine')
"
select
="
'use-first'
"
/>
<!--
string -->
<p:option
name
="
key
"
as
="
xs:string
"
select
="
'concat("_",$p:index)'
"
/>
<!--
XPathExpression -->
</p:declare-step>
The step inspects the documents on port source
in turn to create the resulting map:The step inspects the documents on port source
in turn to create the resulting map:
If the document under inspection is a JSON document representing a map, all key-value pairs are copied into the result map unless this map already contains an entry with the given key. In this case the value of option
duplicates
determines the policy for handling duplicate keys as specified for functionmap:merge
in []. It is a dynamic error () if duplicate keys are encountered and optionduplicates
has value “reject
”.If the document under inspection is a JSON document representing a map, all key-value pairs are copied into the result map unless this map already contains an entry with the given key. In this case the value of optionduplicates
determines the policy for handling duplicate keys as specified for functionmap:merge
in []. It is a dynamic error () if duplicate keys are encountered and optionduplicates
has value “reject
”.For every other type of JSON document, for XML documents, HTML documents, or text documents a new key-value pair is created and put into the resulting map. The key is created by evaluating the XPath expression in option
key
with the inspected document as context item. If the evaluation result is a single atomic value, it is taken as key. If the evaluation result is a node, its string value is taken as key. It is a dynamic error () if the evaluation of the XPath expression in optionkey
for a given item returns either a sequence, an array, a map, or a function. Duplicate keys are handled as described above. The XDM representation of the inspected document is taken as value of the key-value pair.For every other type of JSON document, for XML documents, HTML documents, or text documents a new key-value pair is created and put into the resulting map. The key is created by evaluating the XPath expression in optionkey
with the inspected document as context item. If the evaluation result is a single atomic value, it is taken as key. If the evaluation result is a node, its string value is taken as key. It is a dynamic error () if the evaluation of the XPath expression in optionkey
for a given item returns either a sequence, an array, a map, or a function. Duplicate keys are handled as described above. The XDM representation of the inspected document is taken as value of the key-value pair.It is implementation -defined if
p:json-merge
is able to process document types not mentioned yet, i.e. types of binary documents. If a processor supports a given type of documents, the key-value pair is created as described above. It is a dynamic error () if a document of a not supported document type appears on portsource
ofp:json-merge
. It is implementation -defined ifp:json-merge
is able to process document types not mentioned yet, i.e. types of binary documents. If a processor supports a given type of documents, the key-value pair is created as described above. It is a dynamic error () if a document of a not supported document type appears on portsource
ofp:json-merge
.
An implementation must bind the variable “p:index
” in the static context of each evaluation of the XPath expression to the position of the document in the sequence of documents on port source
, starting with “1
”. An implementation must bind the variable “p:index
” in the static context of each evaluation of the XPath expression to the position of the document in the sequence of documents on port source
, starting with “1
”.
Document properties
No document properties are preserved. The merged document has no base-uri
.
The p:label-elements
step generates a label for each matched element and stores that label in the specified attribute.
<p:declare-step
type
="
p:label-elements
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
attribute
"
as
="
xs:QName
"
select
="
'xml:id'
"
/>
<p:option
name
="
label
"
as
="
xs:string
"
select
="
'concat("_",$p:index)'
"
/>
<!--
XPathExpression -->
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
replace
"
as
="
xs:boolean
"
select
="
true()
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value |
---|---|---|
attribute | xs:QName | 'xml:id' |
label | XPathExpression | 'concat("_",$p:index)' |
match | XSLTSelectionPattern | '*' |
replace | xs:boolean | true() |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if that expression matches anything other than element nodes. |
Declaration
<p:declare-step
type
="
p:label-elements
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
attribute
"
as
="
xs:QName
"
select
="
'xml:id'
"
/>
<p:option
name
="
label
"
as
="
xs:string
"
select
="
'concat("_",$p:index)'
"
/>
<!--
XPathExpression -->
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
replace
"
as
="
xs:boolean
"
select
="
true()
"
/>
</p:declare-step>
The value of the label
option is an XPath expression used to generate the value of the attribute label.The value of the label
option is an XPath expression used to generate the value of the attribute label.
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that expression matches anything other than element nodes.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that expression matches anything other than element nodes.
The value of the replace
must be a boolean value and is used to indicate whether existing attribute values are replaced.The value of the replace
must be a boolean value and is used to indicate whether existing attribute values are replaced.
This step operates by generating attribute labels for each element matched. For every matched element, the expression is evaluated with the context node set to the matched element. An attribute is added to the matched element using the attribute name is specified the attribute
option and the string value of result of evaluating the expression. If the attribute already exists on the matched element, the value is replaced with the string value only if the replace
option has the value of true
.This step operates by generating attribute labels for each element matched. For every matched element, the expression is evaluated with the context node set to the matched element. An attribute is added to the matched element using the attribute name is specified the attribute
option and the string value of result of evaluating the expression. If the attribute already exists on the matched element, the value is replaced with the string value only if the replace
option has the value of true
.
If this step is used to add or change the value of an attribute named “xml:base
”, the base URI of the element must also be amended accordingly.
An implementation must bind the variable “p:index
” in the static context of each evaluation of the XPath expression to the position of the element in the sequence of matched elements. In other words, the first element (in document order) matched gets the value “1
”, the second gets the value “2
”, the third, “3
”, etc.
The result of the p:label-elements step is the input document with the attribute labels associated with matched elements. All other non-matching content remains the same.
Document properties
All document properties are preserved.
The p:load
step has no inputs but produces as its result a document specified by an IRI.
<p:declare-step
type
="
p:load
"
>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
href
"
required
="
true
"
as
="
xs:anyURI
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
content-type
"
as
="
xs:string?
"
/>
<p:option
name
="
document-properties
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
Summary
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | any |
Option name | Type | Default value | Required |
---|---|---|---|
href | xs:anyURI | ✔ | |
content-type | xs:string? | () | |
document-properties | map(xs:QName, item()*)? | () | |
parameters | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XD0011 | It is a dynamic error if the resource referenced by a p:load element does not exist or cannot be accessed. |
err:XD0023 | It is a dynamic error if a DTD validation is performed and either the document is not valid or no DTD is found. |
err:XD0043 | It is a dynamic error if the dtd-validate parameter is true and the processor does not support DTD validation. |
err:XD0049 | It is a dynamic error if the loaded content is not a well-formed XML document. |
err:XD0057 | It is a dynamic error if the loaded content does not conform to the JSON grammar, unless the parameter liberal is true and the processor chooses to accept the deviation. |
err:XD0058 | It is a dynamic error if the parameter duplicates is reject and the value of loaded content contains a JSON object with duplicate keys. |
err:XD0059 | It is a dynamic error if the parameter map contains an entry whose key is defined in the specification of fn:parse-json and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameter escape with true() is also present. |
err:XD0060 | It is a dynamic error if the content-type specifies an encoding, which is not supported by the processor. |
err:XD0062 | It is a dynamic error if the content-type is specified and the document-properties has a “content-type” that is not the same. |
err:XD0064 | It is a dynamic error if the base URI is not both absolute and valid according to . |
err:XD0078 | It is a dynamic error if the loaded document cannot be represented as an HTML document in the XPath data model. |
err:XD0079 | It is a dynamic error if a supplied content-type is not a valid media type of the form “type/subtype+ext” or “type/subtype”. |
Implementation details
Implementation | Description |
---|---|
Defined | In the absence of an explicit type, the content type is implementation-defined |
Defined | Additional XML parameters are implementation-defined. |
Defined | Text parameters are implementation-defined. |
Defined | Additional JSON parameters are implementation-defined. |
Defined | The precise way in which HTML documents are parsed into the XPath data model is implementation-defined. |
Defined | HTML parameters are implementation-defined. |
Defined | How a processor interprets other media types is implementation-defined. |
Defined | Parameters for other media types are implementation-defined. |
Declaration
<p:declare-step
type
="
p:load
"
>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
href
"
required
="
true
"
as
="
xs:anyURI
"
/>
<p:option
name
="
parameters
"
as
="
map(xs:QName,item()*)?
"
/>
<p:option
name
="
content-type
"
as
="
xs:string?
"
/>
<p:option
name
="
document-properties
"
as
="
map(xs:QName, item()*)?
"
/>
</p:declare-step>
If the option is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). If the href
is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:load
in the case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].If the option is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or the step in case of a syntactic shortcut value). If the href
is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:load
in the case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].
The document identified by the href
URI is loaded and returned. If the URI protocol supports redirection, then redirects must be followed.The document identified by the href
URI is loaded and returned. If the URI protocol supports redirection, then redirects must be followed.
It is a dynamic error () if the resource referenced by a p:load
element does not exist or cannot be accessed.It is a dynamic error () if the resource referenced by a p:load
element does not exist or cannot be accessed.
The behavior of this step depends on the content type of the document loaded. The content type of a document is determined as follows:
If a
content-type
property is specified indocument-properties
orcontent-type
is present, then the document must be interpreted according to that content type. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if thetype
/subtype
content-type
is specified and the document-properties has a “content-type
” that is not the same. If acontent-type
property is specified indocument-properties
orcontent-type
is present, then the document must be interpreted according to that content type. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if thetype
/subtype
content-type
is specified and the document-properties has a “content-type
” that is not the same.If the document is retrieved with a URI protocol that specifies a content type (for example,
http:
), then the document must be interpreted according to that content type.In the absence of an explicit type, the content type is implementation-defined.
The parameters
map contains additional, optional parameters that may influence the way that content is loaded. The interpretation of this map varies according to the content type. Parameter names that are in no namespace are treated as strings using only the local-name where appropriate.The parameters
map contains additional, optional parameters that may influence the way that content is loaded. The interpretation of this map varies according to the content type. Parameter names that are in no namespace are treated as strings using only the local-name where appropriate.
Broadly speaking, there are five categories of data that might be loaded: XML, text, JSON, HTML, and “other” binary data.
For an XML media type, the content is loaded and parsed as XML.
It is a dynamic error () if the loaded content is not a well-formed XML document.It is a dynamic error () if the loaded content is not a well-formed XML document.
If the dtd-validate
parameter is true
, then DTD validation must be performed when parsing the document. It is a dynamic error () if a DTD validation is performed and either the document is not valid or no DTD is found. It is a dynamic error () if the dtd-validate
parameter is true
and the processor does not support DTD validation.If the dtd-validate
parameter is true
, then DTD validation must be performed when parsing the document. It is a dynamic error () if a DTD validation is performed and either the document is not valid or no DTD is found. It is a dynamic error () if the dtd-validate
parameter is true
and the processor does not support DTD validation.
Additional XML parameters are implementation-defined.
For a text media type, the content is loaded as a text document. (A text document is an XPath data model document consisting of a single text node.)
It is a dynamic error () if the content-type
specifies an encoding, which is not supported by the processor.It is a dynamic error () if the content-type
specifies an encoding, which is not supported by the processor.
Text parameters are implementation-defined.
For a JSON media type, the content is loaded and parsed as JSON.
The parameters specified for the fn:parse-json
function in [XPath and XQuery Functions and Operators 3.1] must be supported. Additional JSON parameters are implementation-defined.
It is a dynamic error () if the loaded content does not conform to the JSON grammar, unless the parameter liberal
is true
and the processor chooses to accept the deviation.It is a dynamic error () if the loaded content does not conform to the JSON grammar, unless the parameter liberal
is true
and the processor chooses to accept the deviation.
It is a dynamic error () if the parameter duplicates
is reject
and the value of loaded content contains a JSON object with duplicate keys.It is a dynamic error () if the parameter duplicates
is reject
and the value of loaded content contains a JSON object with duplicate keys.
It is a dynamic error () if the parameter map contains an entry whose key is defined in the specification of fn:parse-json
and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameter escape
with true()
is also present.It is a dynamic error () if the parameter map contains an entry whose key is defined in the specification of fn:parse-json
and whose value is not valid for that key, or if it contains an entry with the key fallback when the parameter escape
with true()
is also present.
For an HTML media type, the content is loaded and parsed into an XPath data model document that contains a tree of elements, attributes, and other nodes.
The precise way in which HTML documents are parsed into the XPath data model is implementation-defined.
It is a dynamic error () if the loaded document cannot be represented as an HTML document in the XPath data model.It is a dynamic error () if the loaded document cannot be represented as an HTML document in the XPath data model.
HTML parameters are implementation-defined.
An XProc processor may load other, arbitrary data types. How a processor interprets other media types is implementation-defined.
Parameters for other media types are implementation-defined.
Document properties
The properties specified in document-properties
are applied. If the properties do not specify a base-uri
, the base-uri
property will reflect the base URI of the loaded document. The properties specified in document-properties
are applied. If the properties do not specify a base-uri
, the base-uri
property will reflect the base URI of the loaded document.
The p:make-absolute-uris
step makes an element or attribute's value in the source document an absolute IRI value in the result document.
<p:declare-step
type
="
p:make-absolute-uris
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
base-uri
"
as
="
xs:anyURI?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value | Required |
---|---|---|---|
match | XSLTSelectionPattern | ✔ | |
base-uri | xs:anyURI? | () |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if the pattern matches anything other than element or attribute nodes. |
err:XD0064 | It is a dynamic error if the base URI is not both absolute and valid according to . |
Implementation details
Implementation | Description |
---|---|
Dependent | If the IRI reference specified by the base-uri option on p:make-absolute-uris is absent and the input document has no base URI, the results are implementation-dependent. |
Declaration
<p:declare-step
type
="
p:make-absolute-uris
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
base-uri
"
as
="
xs:anyURI?
"
/>
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the pattern matches anything other than element or attribute nodes.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the pattern matches anything other than element or attribute nodes.
The value of the base-uri
option must be an anyURI
. It is interpreted as an IRI reference. If it is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:make-absolute-uris
in the case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].The value of the base-uri
option must be an anyURI
. It is interpreted as an IRI reference. If it is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:make-absolute-uris
in the case of a syntactic shortcut value). It is a dynamic error () if the base URI is not both absolute and valid according to [].
For every element or attribute in the input document which matches the specified pattern, its XPath string-value is resolved against the specified base URI and the resulting absolute IRI is used as the matched node's entire contents in the output.
The base URI used for resolution defaults to the matched attribute's element or the matched element's base URI unless the base-uri
option is specified. When the base-uri
option is specified, the option value is used as the base URI regardless of any contextual base URI value in the document. This option value is resolved against the base URI of the p:option
element used to set the option.The base URI used for resolution defaults to the matched attribute's element or the matched element's base URI unless the base-uri
option is specified. When the base-uri
option is specified, the option value is used as the base URI regardless of any contextual base URI value in the document. This option value is resolved against the base URI of the p:option
element used to set the option.
If the IRI reference specified by the base-uri
option on p:make-absolute-uris
is absent and the input document has no base URI, the results are implementation-dependent.
Document properties
All document properties are preserved.
The p:namespace-delete
step deletes all of the namespaces identified by the specified prefixes from the document appearing on port source
.The p:namespace-delete
step deletes all of the namespaces identified by the specified prefixes from the document appearing on port source
.
<p:declare-step
type
="
p:namespace-delete
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
prefixes
"
required
="
true
"
as
="
xs:string
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Required |
---|---|---|
prefixes | xs:string | ✔ |
Errors
Error code | Description |
---|---|
err:XC0108 | It is a dynamic error if any prefix is not in-scope at the point where the p:namespace-delete occurs. |
err:XC0109 | It is a dynamic error if a namespace is to be removed from an attribute and the element already has an attribute with the resulting name. |
Declaration
<p:declare-step
type
="
p:namespace-delete
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
prefixes
"
required
="
true
"
as
="
xs:string
"
/>
</p:declare-step>
The value of option prefixes
is taken as a space separated list of prefixes. It is a dynamic error () if any prefix is not in-scope at the point where the p:namespace-delete
occurs.The value of option prefixes
is taken as a space separated list of prefixes. It is a dynamic error () if any prefix is not in-scope at the point where the p:namespace-delete
occurs.
For any prefix the associated namespace is removed from the elements and attributes in the document appearing on port source
. The respective elements or attributes in the document appearing on port result
will be in no namespace.For any prefix the associated namespace is removed from the elements and attributes in the document appearing on port source
. The respective elements or attributes in the document appearing on port result
will be in no namespace.
It is a dynamic error () if a namespace is to be removed from an attribute and the element already has an attribute with the resulting name.It is a dynamic error () if a namespace is to be removed from an attribute and the element already has an attribute with the resulting name.
Document properties
All document properties are preserved.
The p:namespace-rename
step renames any namespace declaration or use of a namespace in a document to a new IRI value.
<p:declare-step
type
="
p:namespace-rename
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
from
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
apply-to
"
select
="
'all'
"
values
="
('all','elements','attributes')
"
/>
<!--
string -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Values | Default value |
---|---|---|---|
apply-to | xs:string | ('all','elements','attributes') | 'all' |
from | xs:anyURI? | () | |
to | xs:anyURI? | () |
Errors
Error code | Description |
---|---|
err:XC0014 | It is a dynamic error if the XML namespace (http://www.w3.org/XML/1998/namespace) or the XMLNS namespace (http://www.w3.org/2000/xmlns/) is the value of either the from option or the to option. |
err:XC0092 | It is a dynamic error if as a consequence of changing or removing the namespace of an attribute the attribute's name is not unique on the respective element. |
Declaration
<p:declare-step
type
="
p:namespace-rename
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
from
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
to
"
as
="
xs:anyURI?
"
/>
<p:option
name
="
apply-to
"
select
="
'all'
"
values
="
('all','elements','attributes')
"
/>
<!--
string -->
</p:declare-step>
The value of the from
option must be an anyURI
. It should be either empty or absolute, but will not be resolved in any case.The value of the from
option must be an anyURI
. It should be either empty or absolute, but will not be resolved in any case.
The value of the to
option must be an anyURI
. It should be empty or absolute, but will not be resolved in any case.The value of the to
option must be an anyURI
. It should be empty or absolute, but will not be resolved in any case.
The value of the apply-to
option must be one of “all
”, “elements
”, or “attributes
”. If the value is “elements
”, only elements will be renamed, if the value is “attributes
”, only attributes will be renamed, if the value is “all
”, both elements and attributes will be renamed.The value of the apply-to
option must be one of “all
”, “elements
”, or “attributes
”. If the value is “elements
”, only elements will be renamed, if the value is “attributes
”, only attributes will be renamed, if the value is “all
”, both elements and attributes will be renamed.
It is a dynamic error () if the XML namespace (http://www.w3.org/XML/1998/namespace
) or the XMLNS namespace (http://www.w3.org/2000/xmlns/
) is the value of either the from
option or the to
option.It is a dynamic error () if the XML namespace (http://www.w3.org/XML/1998/namespace
) or the XMLNS namespace (http://www.w3.org/2000/xmlns/
) is the value of either the from
option or the to
option.
If the value of the from
option is the same as the value of the to
option, the input is reproduced unchanged on the output. Otherwise, namespace bindings, namespace attributes and element and attribute names are changed as follows:If the value of the from
option is the same as the value of the to
option, the input is reproduced unchanged on the output. Otherwise, namespace bindings, namespace attributes and element and attribute names are changed as follows:
Namespace bindings: If the
from
option is present and its value is not the empty string, then every binding of a prefix (or the default namespace) in the input document whose value is the same as the value of thefrom
option isNamespace bindings: If thefrom
option is present and its value is not the empty string, then every binding of a prefix (or the default namespace) in the input document whose value is the same as the value of thefrom
option isreplaced in the output with a binding to the value of the
to
option, provided it is present and not the empty string;replaced in the output with a binding to the value of theto
option, provided it is present and not the empty string;otherwise (the
to
option is not specified or has an empty string as its value) absent from the output.otherwise (theto
option is not specified or has an empty string as its value) absent from the output.
If the
from
option is absent, or its value is the empty string, then no bindings are changed or removed.If thefrom
option is absent, or its value is the empty string, then no bindings are changed or removed.Elements and attributes: If the
from
option is present and its value is not the empty string, for every element and attribute, as appropriate, in the input whose namespace name is the same as the value of thefrom
option, in the output its namespace name isElements and attributes: If thefrom
option is present and its value is not the empty string, for every element and attribute, as appropriate, in the input whose namespace name is the same as the value of thefrom
option, in the output its namespace name isreplaced with the value of the
to
option, provided it is present and not the empty string;replaced with the value of theto
option, provided it is present and not the empty string;otherwise (the
to
option is not specified or has an empty string as its value) changed to have no value.otherwise (theto
option is not specified or has an empty string as its value) changed to have no value.
If the
from
option is absent, or its value is the empty string, then for every element and attribute, as appropriate, whose namespace name has no value, in the output its namespace name is set to the value of theto
option.If thefrom
option is absent, or its value is the empty string, then for every element and attribute, as appropriate, whose namespace name has no value, in the output its namespace name is set to the value of theto
option.It is a dynamic error () if as a consequence of changing or removing the namespace of an attribute the attribute's name is not unique on the respective element.It is a dynamic error () if as a consequence of changing or removing the namespace of an attribute the attribute's name is not unique on the respective element.
Namespace attributes: If the
from
option is present and its value is not the empty string, for every namespace attribute in the input whose value is the same as the value of thefrom
option, in the outputNamespace attributes: If thefrom
option is present and its value is not the empty string, for every namespace attribute in the input whose value is the same as the value of thefrom
option, in the outputthe namespace attribute's value is replaced with the value of the
to
option, provided it is present and not the empty string;the namespace attribute's value is replaced with the value of theto
option, provided it is present and not the empty string;otherwise (the
to
option is not specified or has an empty string as its value) the namespace attribute is absent.otherwise (theto
option is not specified or has an empty string as its value) the namespace attribute is absent.
Note
The apply-to
option is primarily intended to make it possible to avoid renaming attributes when the from
option specifies no namespace, since many attributes are in no namespace.The apply-to
option is primarily intended to make it possible to avoid renaming attributes when the from
option specifies no namespace, since many attributes are in no namespace.
Care should be taken when specifying no namespace with the to
option. Prefixed names in content, for example QNames and XPath expressions, may end up with no appropriate namespace binding.Care should be taken when specifying no namespace with the to
option. Prefixed names in content, for example QNames and XPath expressions, may end up with no appropriate namespace binding.
Document properties
All document properties are preserved.
The p:pack
step merges two document sequences in a pair-wise fashion.
<p:declare-step
type
="
p:pack
"
>
<p:input
port
="
source
"
content-types
="
text xml html
"
sequence
="
true
"
primary
="
true
"
/>
<p:input
port
="
alternate
"
sequence
="
true
"
content-types
="
text xml html
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
application/xml
"
/>
<p:option
name
="
wrapper
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | text xml html |
alternate | ✔ | text xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | application/xml |
Option name | Type | Required |
---|---|---|
wrapper | xs:QName | ✔ |
Declaration
<p:declare-step
type
="
p:pack
"
>
<p:input
port
="
source
"
content-types
="
text xml html
"
sequence
="
true
"
primary
="
true
"
/>
<p:input
port
="
alternate
"
sequence
="
true
"
content-types
="
text xml html
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
application/xml
"
/>
<p:option
name
="
wrapper
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
The step takes each pair of documents, in order, one from the source
port and one from the alternate
port, wraps them with a new element node whose QName is the value specified in the wrapper
option, and writes that element to the result
port as a document.The step takes each pair of documents, in order, one from the source
port and one from the alternate
port, wraps them with a new element node whose QName is the value specified in the wrapper
option, and writes that element to the result
port as a document.
If the step reaches the end of one input sequence before the other, then it simply wraps each of the remaining documents in the longer sequence.
Note
In the common case, where the document element of a document in the result
sequence has two element children, any comments, processing instructions, or white space text nodes that occur between them may have come from either of the input documents; this step does not attempt to distinguish which one.In the common case, where the document element of a document in the result
sequence has two element children, any comments, processing instructions, or white space text nodes that occur between them may have come from either of the input documents; this step does not attempt to distinguish which one.
Document properties
No document properties are preserved. The result documents do not have a base-uri
property.
The p:rename
step renames elements, attributes, or processing-instruction targets in a document.
<p:declare-step
type
="
p:rename
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
new-name
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value | Required |
---|---|---|---|
new-name | xs:QName | ✔ | |
match | XSLTSelectionPattern | '/*' |
Errors
Error code | Description |
---|---|
err:XC0013 | It is a dynamic error if the pattern matches a processing instruction and the new name has a non-null namespace. |
err:XC0023 | It is a dynamic error if the pattern matches anything other than element, attribute, or processing instruction nodes, or if it matches more than one attribute on a single element. |
Declaration
<p:declare-step
type
="
p:rename
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
new-name
"
required
="
true
"
as
="
xs:QName
"
/>
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the pattern matches anything other than element, attribute, or processing instruction nodes, or if it matches more than one attribute on a single element.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if the pattern matches anything other than element, attribute, or processing instruction nodes, or if it matches more than one attribute on a single element.
Each element, attribute, or processing-instruction in the input matched by the selection pattern specified in the match
option is renamed in the output to the name specified by the new-name
option.Each element, attribute, or processing-instruction in the input matched by the selection pattern specified in the match
option is renamed in the output to the name specified by the new-name
option.
If the match
option matches an attribute and if the element on which it occurs already has an attribute whose expanded name is the same as the expanded name of the specified new-name
, then the results is as if the current attribute named “new-name
” was deleted before renaming the matched attribute.If the match
option matches an attribute and if the element on which it occurs already has an attribute whose expanded name is the same as the expanded name of the specified new-name
, then the results is as if the current attribute named “new-name
” was deleted before renaming the matched attribute.
With respect to attributes named “xml:base
”, the following semantics apply: renaming an from “xml:base
” to something else has no effect on the underlying base URI of the element; however, if an attribute is renamed from something else to “xml:base
”, the base URI of the element must also be amended accordingly.
If the pattern matches processing instructions, then it is the processing instruction target that is renamed. It is a dynamic error () if the pattern matches a processing instruction and the new name has a non-null namespace. If the pattern matches processing instructions, then it is the processing instruction target that is renamed. It is a dynamic error () if the pattern matches a processing instruction and the new name has a non-null namespace.
Document properties
All document properties are preserved.
The p:replace
step replaces matching nodes in its primary input with the top-level node(s) of the replacement
port's document.
<p:declare-step
type
="
p:replace
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:input
port
="
replacement
"
content-types
="
text xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html | |
replacement | text xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text xml html |
Option name | Type | Required |
---|---|---|
match | XSLTSelectionPattern | ✔ |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if that pattern matches an attribute or a namespace nodes. |
Declaration
<p:declare-step
type
="
p:replace
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:input
port
="
replacement
"
content-types
="
text xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches an attribute or a namespace nodes. Multiple matches are allowed, in which case multiple copies of the replacement
document will occur.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches an attribute or a namespace nodes. Multiple matches are allowed, in which case multiple copies of the replacement
document will occur.
Every node in the primary input matching the specified pattern is replaced in the output by the top-level node(s) of the replacement
document. Only non-nested matches are replaced. That is, once a node is replaced, its descendants cannot be matched.Every node in the primary input matching the specified pattern is replaced in the output by the top-level node(s) of the replacement
document. Only non-nested matches are replaced. That is, once a node is replaced, its descendants cannot be matched.
If the document node is matched and port replacement contains a text document, the entire document is replaced by the text node. What appears on port, the result is a text document with the text node wrapped in a document node.
Document properties
If the resulting document contains exactly one text node, the content-type
property is changed to text/plain
and the serialization
property is removed, while all other document properties are preserved. For other document types, all document properties are preserved.
The p:set-attributes
step sets attributes on matching elements.
<p:declare-step
type
="
p:set-attributes
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
attributes
"
required
="
true
"
as
="
map(xs:QName, xs:anyAtomicType)
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | xml html |
Option name | Type | Default value | Required |
---|---|---|---|
attributes | map(xs:QName, xs:anyAtomicType) | ✔ | |
match | XSLTSelectionPattern | '/*' |
Errors
Error code | Description |
---|---|
err:XC0023 | It is a dynamic error if that pattern matches anything other than element nodes. |
err:XC0059 | It is a dynamic error if the name of any attribute is “xmlns” or uses the prefix “xmlns” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/. |
Declaration
<p:declare-step
type
="
p:set-attributes
"
>
<p:input
port
="
source
"
primary
="
true
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
xml html
"
/>
<p:option
name
="
match
"
as
="
xs:string
"
select
="
'/*'
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
attributes
"
required
="
true
"
as
="
map(xs:QName, xs:anyAtomicType)
"
/>
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches anything other than element nodes.The value of the match
option must be an XSLTSelectionPattern. It is a dynamic error () if that pattern matches anything other than element nodes.
A new attribute is created for each entry in the map appearing on the attributes
option. The attribute name is taken from the entry's key while the attribute value is taken from the string value of the entry's value.A new attribute is created for each entry in the map appearing on the attributes
option. The attribute name is taken from the entry's key while the attribute value is taken from the string value of the entry's value.
If an attribute with the same name as one of the attributes to be created already exists, the value specified on the attributes
option is used. The result port of this step produces a copy of the source
port's document with the matching elements' attributes modified.If an attribute with the same name as one of the attributes to be created already exists, the value specified on the attributes
option is used. The result port of this step produces a copy of the source
port's document with the matching elements' attributes modified.
The matching elements are specified by the selection pattern in the match
option. All matching elements are processed. If no elements match, the step will not change any elements.The matching elements are specified by the selection pattern in the match
option. All matching elements are processed. If no elements match, the step will not change any elements.
IfThis step cannot be used to add namespace declarations. It is a dynamic error (the ) if the name of any attribute is “xmlns
” or uses the prefix “xmlns
” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/
. However, if the attributes taken from the attributes
use namespaces, prefixes, or prefixes bound to different namespaces, the document produced on the result
output port will require namespace fixup.IfThis step cannot be used to add namespace declarations. It is a dynamic error (the ) if the name of any attribute is “xmlns
” or uses the prefix “xmlns
” or any other prefix that resolves to the namespace name http://www.w3.org/2000/xmlns/
. However, if the attributes taken from the attributes
use namespaces, prefixes, or prefixes bound to different namespaces, the document produced on the result
output port will require namespace fixup.
If an attribute named xml:base
is added or changed, the base URI of the element must also be amended accordingly.
Document properties
All document properties are preserved.
The p:set-properties
step sets document properties on the source document.
<p:declare-step
type
="
p:set-properties
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
properties
"
required
="
true
"
as
="
map(xs:QName,item()*)
"
/>
<p:option
name
="
merge
"
select
="
true()
"
as
="
xs:boolean
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | any |
Option name | Type | Default value | Required |
---|---|---|---|
properties | map(xs:QName,item()*) | ✔ | |
merge | xs:boolean | true() |
Errors
Error code | Description |
---|---|
err:XC0069 | It is a dynamic error if the properties map contains a key equal to the string “content-type”. |
err:XD0064 | It is a dynamic error if the base URI is not both absolute and valid according to . |
err:XD0070 | It is a dynamic error if a value is assigned to the serialization document property that cannot be converted into map(xs:QName, item()*) according to the rules in section “QName handling” of . |
Declaration
<p:declare-step
type
="
p:set-properties
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
/>
<p:option
name
="
properties
"
required
="
true
"
as
="
map(xs:QName,item()*)
"
/>
<p:option
name
="
merge
"
select
="
true()
"
as
="
xs:boolean
"
/>
</p:declare-step>
The document properties of the document on the source
port are augmented with the values specified in the properties
option. The document produced on the result
port has the same representation but the adjusted property values.The document properties of the document on the source
port are augmented with the values specified in the properties
option. The document produced on the result
port has the same representation but the adjusted property values.
If the merge
option is true, then the supplied properties are added to the existing properties, overwriting already existing values for a given key. If it is false, the document’s properties are replaced by the new set.If the merge
option is true, then the supplied properties are added to the existing properties, overwriting already existing values for a given key. If it is false, the document’s properties are replaced by the new set.
It is a dynamic error () if a value is assigned to the serialization
document property that cannot be converted into map(xs:QName, item()*)
according to the rules in section “QName handling” of [].It is a dynamic error () if a value is assigned to the serialization
document property that cannot be converted into map(xs:QName, item()*)
according to the rules in section “QName handling” of [].
It is a dynamic error () if the properties
map contains a key equal to the string “content-type
”. It is a dynamic error () if the properties
map contains a key equal to the string “content-type
”.
If the properties
map contains a key equal to the string “base-uri
” the associated value is taken as the new base URI of the resulting document. It is a dynamic error () if the base URI is not both absolute and valid according to [].If the properties
map contains a key equal to the string “base-uri
” the associated value is taken as the new base URI of the resulting document. It is a dynamic error () if the base URI is not both absolute and valid according to [].
Document properties
If merge
is true, document properties not overridden by settings in the properties
map are preserved, otherwise the resulting document has only the content-type
property and the properties specified in the properties
map. In particular, if merge
is false, the base-uri
property will not be preserved. This means that the resulting document will not have a base URI if the properties
map does not contain a base-uri
entry. If merge
is true, document properties not overridden by settings in the properties
map are preserved, otherwise the resulting document has only the content-type
property and the properties specified in the properties
map. In particular, if merge
is false, the base-uri
property will not be preserved. This means that the resulting document will not have a base URI if the properties
map does not contain a base-uri
entry.
The p:sink
step accepts a sequence of documents and discards them. It has no output.
<p:declare-step
type
="
p:sink
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Declaration
<p:declare-step
type
="
p:sink
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
</p:declare-step>
Document properties
Not applicable.
The p:sleep
step introduces a delay.
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | ✔ | any |
Option name | Type | Required |
---|---|---|
duration | xs:nonNegativeInteger | ✔ |
Declaration
<p:declare-step
type
="
p:sleep
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
any
"
/>
<p:output
port
="
result
"
sequence
="
true
"
content-types
="
any
"
/>
<p:option
name
="
duration
"
as
="
xs:nonNegativeInteger
"
required
="
true
"
/>
</p:declare-step>
The p:sleep
step copies each of the documents on the source
port to the result
port without changing them. Before copying the documents, it pauses for a period of time not less than duration
milliseconds.
Note
In multi-threaded implementations, there is no guarantee that this will pause the execution of more than one thread. However, any steps that depend on the output of this step will wait for this step to complete.
Document properties
All document properties are preserved.
The p:split-sequence
step accepts a sequence of documents and divides it into two sequences.
<p:declare-step
type
="
p:split-sequence
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
matched
"
sequence
="
true
"
primary
="
true
"
content-types
="
any
"
/>
<p:output
port
="
not-matched
"
sequence
="
true
"
content-types
="
any
"
/>
<p:option
name
="
initial-only
"
as
="
xs:boolean
"
select
="
false()
"
/>
<p:option
name
="
test
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
matched | ✔ | ✔ | any |
not-matched | ✔ | any |
Option name | Type | Default value | Required |
---|---|---|---|
test | XPathExpression | ✔ | |
initial-only | xs:boolean | false() |
Errors
Error code | Description |
---|---|
err:XC0150 | It is a dynamic error if evaluating the XPath expression in option test on a context document results in an error. |
Declaration
<p:declare-step
type
="
p:split-sequence
"
>
<p:input
port
="
source
"
content-types
="
any
"
sequence
="
true
"
/>
<p:output
port
="
matched
"
sequence
="
true
"
primary
="
true
"
content-types
="
any
"
/>
<p:output
port
="
not-matched
"
sequence
="
true
"
content-types
="
any
"
/>
<p:option
name
="
initial-only
"
as
="
xs:boolean
"
select
="
false()
"
/>
<p:option
name
="
test
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
The value of the test
option must be an XPathExpression.The value of the test
option must be an XPathExpression.
The XPath expression in the test
option is applied to each document in the input sequence. If the effective boolean value of the expression is true, the document is copied to the matched
port; otherwise it is copied to the not-matched
port.The XPath expression in the test
option is applied to each document in the input sequence. If the effective boolean value of the expression is true, the document is copied to the matched
port; otherwise it is copied to the not-matched
port.
If the initial-only
option is true, then when the first document that does not satisfy the test expression is encountered, it and all the documents that follow it are written to the not-matched
port. In other words, it only writes the initial series of matched documents (which may be empty) to the matched
port. All other documents are written to the not-matched
port, irrespective of whether or not they match.If the initial-only
option is true, then when the first document that does not satisfy the test expression is encountered, it and all the documents that follow it are written to the not-matched
port. In other words, it only writes the initial series of matched documents (which may be empty) to the matched
port. All other documents are written to the not-matched
port, irrespective of whether or not they match.
The XPath context for the test
option changes over time. For each document that appears on the source
port, the expression is evaluated with that document as the context document. The context position (position()
) is the position of that document within the sequence and the context size (last()
) is the total number of documents in the sequence. It is a dynamic error () if evaluating the XPath expression in option test
on a context document results in an error.The XPath context for the test
option changes over time. For each document that appears on the source
port, the expression is evaluated with that document as the context document. The context position (position()
) is the position of that document within the sequence and the context size (last()
) is the total number of documents in the sequence. It is a dynamic error () if evaluating the XPath expression in option test
on a context document results in an error.
Note
In principle, this component cannot stream because it must buffer all of the input sequence in order to find the context size. In practice, if the test expression does not use the last()
function, the implementation can stream and ignore the context size.
If the implementation supports passing PSVI annotations between steps, the p:split-sequence
step must preserve any annotations that appear in the input.
Document properties
All document properties are preserved.
The p:store
step stores (a possibly serialized version of) its input to a URI. It outputs a reference to the location of the stored document on the result-uri
port. Aside from these side-effects, it behaves like a step, copying its input to the result
port.The p:store
step stores (a possibly serialized version of) its input to a URI. It outputs a reference to the location of the stored document on the result-uri
port. Aside from these side-effects, it behaves like a step, copying its input to the result
port.
<p:declare-step
type
="
p:store
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
primary
="
true
"
/>
<p:output
port
="
result-uri
"
content-types
="
application/xml
"
/>
<p:option
name
="
href
"
required
="
true
"
as
="
xs:anyURI
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | any |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | any | |
result-uri | application/xml |
Option name | Type | Default value | Required |
---|---|---|---|
href | xs:anyURI | ✔ | |
serialization | map(xs:QName,item()*)? | () |
Errors
Error code | Description |
---|---|
err:XC0050 | It is a dynamic error if the URI scheme is not supported or the step cannot store to the specified location. |
Declaration
<p:declare-step
type
="
p:store
"
>
<p:input
port
="
source
"
content-types
="
any
"
/>
<p:output
port
="
result
"
content-types
="
any
"
primary
="
true
"
/>
<p:output
port
="
result-uri
"
content-types
="
application/xml
"
/>
<p:option
name
="
href
"
required
="
true
"
as
="
xs:anyURI
"
/>
<p:option
name
="
serialization
"
as
="
map(xs:QName,item()*)?
"
/>
</p:declare-step>
The value of the href
option must be an anyURI
. If it is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:store
in the case of a syntactic shortcut value).The value of the href
option must be an anyURI
. If it is relative, it is made absolute against the base URI of the element on which it is specified (p:with-option
or p:store
in the case of a syntactic shortcut value).
The step attempts to store the document to the specified URI. If the URI scheme “file:
” is supported, the processor should try to create all non existing folders in the URI’s path. It is a dynamic error () if the URI scheme is not supported or the step cannot store to the specified location.The step attempts to store the document to the specified URI. If the URI scheme “file:
” is supported, the processor should try to create all non existing folders in the URI’s path. It is a dynamic error () if the URI scheme is not supported or the step cannot store to the specified location.
The output of this step on the result-uri
port is a document containing a single element whose content is the absolute URI of the document stored by the step.The output of this step on the result-uri
port is a document containing a single element whose content is the absolute URI of the document stored by the step.
The serialization
option is provided to control the serialization of content when it is stored. If the document to be stored has a “serialization” property, the serialization is controlled by the merger of the two maps where the entries in the “serialization” property take precedence. Serialization is described in [].The serialization
option is provided to control the serialization of content when it is stored. If the document to be stored has a “serialization” property, the serialization is controlled by the merger of the two maps where the entries in the “serialization” property take precedence. Serialization is described in [].
Document properties
All document properties are preserved.
The p:string-replace
step matches nodes in the document provided on the source
port and replaces them with the string result of evaluating an XPath expression.The p:string-replace
step matches nodes in the document provided on the source
port and replaces them with the string result of evaluating an XPath expression.
<p:declare-step
type
="
p:string-replace
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
replace
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | xml html |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text xml html |
Option name | Type | Required |
---|---|---|
match | XSLTSelectionPattern | ✔ |
replace | XPathExpression | ✔ |
Declaration
<p:declare-step
type
="
p:string-replace
"
>
<p:input
port
="
source
"
content-types
="
xml html
"
/>
<p:output
port
="
result
"
content-types
="
text xml html
"
/>
<p:option
name
="
match
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XSLTSelectionPattern -->
<p:option
name
="
replace
"
required
="
true
"
as
="
xs:string
"
/>
<!--
XPathExpression -->
</p:declare-step>
The value of the match
option must be an XSLTSelectionPattern.The value of the match
option must be an XSLTSelectionPattern.
The value of the replace
option must be an XPathExpression.The value of the replace
option must be an XPathExpression.
The matched nodes are specified with the selection pattern in the match
option. For each matching node, the XPath expression provided by the replace
option is evaluated with the matching node as the XPath context node. The string value of the result is used in the output. Nodes that do not match are copied without change.The matched nodes are specified with the selection pattern in the match
option. For each matching node, the XPath expression provided by the replace
option is evaluated with the matching node as the XPath context node. The string value of the result is used in the output. Nodes that do not match are copied without change.
If the expression given in the match
option matches an attribute, the string value of the replace
expression is used as the new value of the attribute in the output. If the attribute is named “xml:base
”, the base URI of the element must also be amended accordingly.If the expression given in the match
option matches an attribute, the string value of the replace
expression is used as the new value of the attribute in the output. If the attribute is named “xml:base
”, the base URI of the element must also be amended accordingly.
If the document node is matched, the entire document is replaced by the string value of the replace expression. What appears on port result is a text document with the text node wrapped in a document node.
If the expression matches any other kind of node, the entire node (and not just its contents) is replaced by the string value of the replace
expression.If the expression matches any other kind of node, the entire node (and not just its contents) is replaced by the string value of the replace
expression.
Document properties
If the resulting document contains exactly one text node, the content-type
property is changed to text/plain
and the serialization
property is removed, while all other document properties are preserved. For other document types, all document properties are preserved.
The p:text-count
step counts the number of lines in a text document and returns a single XML document containing that number.
<p:declare-step
type
="
p:text-count
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
application/xml
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | application/xml |
Declaration
<p:declare-step
type
="
p:text-count
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
application/xml
"
/>
</p:declare-step>
The p:text-count
step counts the number of lines in the text document appearing on its source
port. It returns on its result
port an XML document containing a single element whose contents is the string representing this count.The p:text-count
step counts the number of lines in the text document appearing on its source
port. It returns on its result
port an XML document containing a single element whose contents is the string representing this count.
Lines are identified as described in XML, 2.11 End-of-Line Handling. For the purpose of identifying lines, if the very last character in the text document is a newline ( ), that newline is ignored. (It is not a separator between that line and a following line that contains no characters.)
Document properties
No document properties are preserved. The count document does not have a base-uri
property.
The p:text-head
step returns lines from the beginning of a text document.
<p:declare-step
type
="
p:text-head
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
count
"
required
="
true
"
as
="
xs:integer
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text |
Option name | Type | Required |
---|---|---|
count | xs:integer | ✔ |
Declaration
<p:declare-step
type
="
p:text-head
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
count
"
required
="
true
"
as
="
xs:integer
"
/>
</p:declare-step>
The p:text-head
step returns on its result
port lines from the text document that appears on its source
port:The p:text-head
step returns on its result
port lines from the text document that appears on its source
port:
If the
count
option is positive, thep:text-head
step returns the firstcount
linesIf thecount
option is positive, thep:text-head
step returns the firstcount
linesIf the
count
option is zero, thep:text-head
step returns all linesIf thecount
option is zero, thep:text-head
step returns all linesIf the
count
option is negative, thep:text-head
step returns all lines except the firstcount
linesIf thecount
option is negative, thep:text-head
step returns all lines except the firstcount
lines
Lines are identified as described in XML, 2.11 End-of-Line Handling. All lines returned by p:text-head
are terminated with a single newline ( ).
Document properties
All document properties are preserved.
The p:text-join
step concatenates text documents.
<p:declare-step
type
="
p:text-join
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
text
"
/>
<p:output
port
="
result
"
content-types
="
text
"
/>
<p:option
name
="
separator
"
as
="
xs:string?
"
/>
<p:option
name
="
prefix
"
as
="
xs:string?
"
/>
<p:option
name
="
suffix
"
as
="
xs:string?
"
/>
<p:option
name
="
override-content-type
"
as
="
xs:string?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | ✔ | text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text |
Option name | Type | Default value |
---|---|---|
override-content-type | xs:string? | () |
prefix | xs:string? | () |
separator | xs:string? | () |
suffix | xs:string? | () |
Errors
Error code | Description |
---|---|
err:XC0001 | It is a dynamic error if the value of option override-content-type is not a text media type. |
err:XD0079 | It is a dynamic error if a supplied content-type is not a valid media type of the form “type/subtype+ext” or “type/subtype”. |
Declaration
<p:declare-step
type
="
p:text-join
"
>
<p:input
port
="
source
"
sequence
="
true
"
content-types
="
text
"
/>
<p:output
port
="
result
"
content-types
="
text
"
/>
<p:option
name
="
separator
"
as
="
xs:string?
"
/>
<p:option
name
="
prefix
"
as
="
xs:string?
"
/>
<p:option
name
="
suffix
"
as
="
xs:string?
"
/>
<p:option
name
="
override-content-type
"
as
="
xs:string?
"
/>
</p:declare-step>
The p:text-join
step concatenates the text documents appearing on its source
port into a single document on its result
port. The documents will be concatenated in order of appearance. The p:text-join
step concatenates the text documents appearing on its source
port into a single document on its result
port. The documents will be concatenated in order of appearance.
When the
separator
option is specified, its value will be inserted in between adjacent documents.When theseparator
option is specified, its value will be inserted in between adjacent documents.When the
prefix
option is specified, the document appearing on theresult
port will always start with its value (also when there are no documents on thesource
port).When theprefix
option is specified, the document appearing on theresult
port will always start with its value (also when there are no documents on thesource
port).When the
suffix
option is specified, the document appearing on theresult
port will always end with its value (also when there are no documents on thesource
port).When thesuffix
option is specified, the document appearing on theresult
port will always end with its value (also when there are no documents on thesource
port).
When the override-content-type
option is specified, the document appearing on the port result
will have this media type as part of its document properties. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the value of option type
/subtype
override-content-type
is not a text media type. When the override-content-type
option is specified, the document appearing on the port result
will have this media type as part of its document properties. It is a dynamic error () if a supplied content-type is not a valid media type of the form “
” or “type
/subtype
+ext
”. It is a dynamic error () if the value of option type
/subtype
override-content-type
is not a text media type.
Concatenating text documents does not require identifying individual lines in each document, consequently no special end-of-line handling is performed.
Document properties
No document properties are preserved. The joined document has no base-uri
property.
The p:text-replace
step replaces all occurrences of substrings in a text document that match a supplied regular expression with a given replacement string.
<p:declare-step
type
="
p:text-replace
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
pattern
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
replacement
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
flags
"
as
="
xs:string?
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text |
Option name | Type | Default value | Required |
---|---|---|---|
pattern | xs:string | ✔ | |
replacement | xs:string | ✔ | |
flags | xs:string? | () |
Errors
Error code | Description |
---|---|
err:XC0147 | It is a dynamic error if the specified value is not a valid XPath regular expression. |
Declaration
<p:declare-step
type
="
p:text-replace
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
pattern
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
replacement
"
required
="
true
"
as
="
xs:string
"
/>
<p:option
name
="
flags
"
as
="
xs:string?
"
/>
</p:declare-step>
The p:text-replace
step replaces all occurrences of substrings in the text document appearing on its source
port that match a supplied regular expression with a given replacement string. The result is returned (as another text document) on its result
port. The p:text-replace
step replaces all occurrences of substrings in the text document appearing on its source
port that match a supplied regular expression with a given replacement string. The result is returned (as another text document) on its result
port.
This step is a convenience wrapper around the XPath fn:replace
function to ease text replacements in the document flow of a pipeline.
The pattern
, replacement
and flags
options are specified the same as the parameters with the same names of the function. The pattern
option must be a regular expression as specified in [], section 7.61 “Regular Expression Syntax
”. It is a dynamic error () if the specified value is not a valid XPath regular expression.The pattern
, replacement
and flags
options are specified the same as the parameters with the same names of the function. The pattern
option must be a regular expression as specified in [], section 7.61 “Regular Expression Syntax
”. It is a dynamic error () if the specified value is not a valid XPath regular expression.
Replacing strings in text documents does not require identifying individual lines in each document, consequently no special end-of-line handling is performed.
Document properties
All document properties are preserved.
The p:text-sort
step sorts lines in a text document.
<p:declare-step
type
="
p:text-sort
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
sort-key
"
as
="
xs:string
"
select
="
'.'
"
/>
<!--
XPathExpression -->
<p:option
name
="
order
"
as
="
xs:string
"
select
="
'ascending'
"
values
="
('ascending', 'descending')
"
/>
<!--
string -->
<p:option
name
="
case-order
"
as
="
xs:string?
"
values
="
('upper-first', 'lower-first')
"
/>
<!--
string -->
<p:option
name
="
lang
"
as
="
xs:language?
"
/>
<p:option
name
="
collation
"
as
="
xs:string
"
select
="
'https://www.w3.org/2005/xpath-functions/collation/codepoint'
"
/>
<p:option
name
="
stable
"
as
="
xs:boolean
"
select
="
true()
"
/>
</p:declare-step>
Summary
Input port | Primary | Sequence | Content types |
---|---|---|---|
source | ✔ | text |
Output port | Primary | Sequence | Content types |
---|---|---|---|
result | ✔ | text |
Option name | Type | Values | Default value |
---|---|---|---|
case-order | xs:string? | ('upper-first', 'lower-first') | () |
collation | xs:string | 'http://www.w3.org/2005/xpath-functions/collation/codepoint' | |
lang | xs:language? | () | |
order | xs:string | ('ascending', 'descending') | 'ascending' |
sort-key | XPathExpression | '.' | |
stable | xs:boolean | true() |
Errors
Error code | Description |
---|---|
err:XC0098 | It is a dynamic error if a dynamic XPath error occurred while applying sort-key to a line. |
err:XC0099 | It is a dynamic error if the result of applying sort-key to a given line results in a sequence with more than one item. |
Implementation details
Implementation | Description |
---|---|
Defined | Support for other collations is implementation-defined. |
Declaration
<p:declare-step
type
="
p:text-sort
"
>
<p:input
port
="
source
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:output
port
="
result
"
primary
="
true
"
sequence
="
false
"
content-types
="
text
"
/>
<p:option
name
="
sort-key
"
as
="
xs:string
"
select
="
'.'
"
/>
<!--
XPathExpression -->
<p:option
name
="
order
"
as
="
xs:string
"
select
="
'ascending'
"
values
="
('ascending', 'descending')
"
/>
<!--
string -->
<p:option
name
="
case-order
"
as
="
xs:string?
"
values
="
('upper-first', 'lower-first')