|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--org.apache.xerces.dom.DOMNormalizer
This class adds implementation for normalizeDocument method. It acts as if the document was going through a save and load cycle, putting the document in a "normal" form. The actual result depends on the features being set and governing what operations actually take place. See setNormalizationFeature for details. Noticeably this method normalizes Text nodes, makes the document "namespace wellformed", according to the algorithm described below in pseudo code, by adding missing namespace declaration attributes and adding or changing namespace prefixes, updates the replacement tree of EntityReference nodes, normalizes attribute values, etc. Mutation events, when supported, are generated to reflect the changes occuring on the document. See Namespace normalization for details on how namespace declaration attributes and prefixes are normalized. NOTE: There is an initial support for DOM revalidation with XML Schema as a grammar. The tree might not be validated correctly if entityReferences, CDATA sections are present in the tree. The PSVI information is not exposed, normalized data (including element default content is not available).
Inner Class Summary | |
protected class |
DOMNormalizer.XMLAttributesProxy
|
Field Summary | |
static java.lang.RuntimeException |
abort
If the user stops the process, this exception will be thrown. |
protected static boolean |
DEBUG
Debug namespace fix up algorithm |
protected static boolean |
DEBUG_EVENTS
Debug document handler events |
protected static boolean |
DEBUG_ND
Debug normalize document |
protected java.util.Vector |
fAttributeList
list of attributes |
protected DOMNormalizer.XMLAttributesProxy |
fAttrProxy
|
protected DOMConfigurationImpl |
fConfiguration
|
protected org.w3c.dom.Node |
fCurrentNode
for setting the PSVI |
protected CoreDocumentImpl |
fDocument
|
protected org.w3c.dom.DOMErrorHandler |
fErrorHandler
error handler. |
protected org.apache.xerces.xni.NamespaceContext |
fLocalNSBinder
Stores all namespace bindings on the current element |
protected DOMLocatorImpl |
fLocator
DOM Locator - for namespace fixup algorithm |
protected org.apache.xerces.xni.NamespaceContext |
fNamespaceContext
The namespace context of this document: stores namespaces in scope |
protected boolean |
fNamespaceValidation
|
protected boolean |
fPSVI
|
protected org.apache.xerces.xni.QName |
fQName
|
protected SymbolTable |
fSymbolTable
symbol table |
protected RevalidationHandler |
fValidationHandler
Validation handler represents validator instance. |
protected static java.lang.String |
PREFIX
prefix added by namespace fixup algorithm should follow a pattern "NS" + index |
Constructor Summary | |
DOMNormalizer()
|
Method Summary | |
protected void |
addNamespaceDecl(java.lang.String prefix,
java.lang.String uri,
ElementImpl element)
Adds a namespace attribute or replaces the value of existing namespace attribute with the given prefix and value for URI. |
void |
characters(org.apache.xerces.xni.XMLString text,
org.apache.xerces.xni.Augmentations augs)
Character content. |
void |
comment(org.apache.xerces.xni.XMLString text,
org.apache.xerces.xni.Augmentations augs)
A comment. |
void |
doctypeDecl(java.lang.String rootElement,
java.lang.String publicId,
java.lang.String systemId,
org.apache.xerces.xni.Augmentations augs)
Notifies of the presence of the DOCTYPE line in the document. |
void |
emptyElement(org.apache.xerces.xni.QName element,
org.apache.xerces.xni.XMLAttributes attributes,
org.apache.xerces.xni.Augmentations augs)
An empty element. |
void |
endCDATA(org.apache.xerces.xni.Augmentations augs)
The end of a CDATA section. |
void |
endDocument(org.apache.xerces.xni.Augmentations augs)
The end of the document. |
void |
endElement(org.apache.xerces.xni.QName element,
org.apache.xerces.xni.Augmentations augs)
The end of an element. |
void |
endGeneralEntity(java.lang.String name,
org.apache.xerces.xni.Augmentations augs)
This method notifies the end of a general entity. |
protected void |
expandEntityRef(org.w3c.dom.Node parent,
org.w3c.dom.Node reference)
|
org.apache.xerces.xni.parser.XMLDocumentSource |
getDocumentSource()
Returns the document source. |
void |
ignorableWhitespace(org.apache.xerces.xni.XMLString text,
org.apache.xerces.xni.Augmentations augs)
Ignorable whitespace. |
static void |
isAttrValueWF(org.w3c.dom.DOMErrorHandler errorHandler,
DOMErrorImpl error,
DOMLocatorImpl locator,
org.w3c.dom.NamedNodeMap attributes,
org.w3c.dom.Attr a,
java.lang.String value,
boolean xml11Version)
NON-DOM: check if attribute value is well-formed |
static void |
isCDataWF(org.w3c.dom.DOMErrorHandler errorHandler,
DOMErrorImpl error,
DOMLocatorImpl locator,
java.lang.String datavalue,
boolean isXML11Version)
Check if CDATA section is well-formed |
static void |
isCommentWF(org.w3c.dom.DOMErrorHandler errorHandler,
DOMErrorImpl error,
DOMLocatorImpl locator,
java.lang.String datavalue,
boolean isXML11Version)
NON-DOM: check if value of the comment is well-formed |
static void |
isXMLCharWF(org.w3c.dom.DOMErrorHandler errorHandler,
DOMErrorImpl error,
DOMLocatorImpl locator,
java.lang.String datavalue,
boolean isXML11Version)
NON-DOM: check for valid XML characters as per the XML version |
protected void |
namespaceFixUp(ElementImpl element,
AttributeMap attributes)
|
protected void |
normalizeDocument(CoreDocumentImpl document,
DOMConfigurationImpl config)
Normalizes document. |
protected org.w3c.dom.Node |
normalizeNode(org.w3c.dom.Node node)
This method acts as if the document was going through a save and load cycle, putting the document in a "normal" form. |
void |
processingInstruction(java.lang.String target,
org.apache.xerces.xni.XMLString data,
org.apache.xerces.xni.Augmentations augs)
A processing instruction. |
static void |
reportDOMError(org.w3c.dom.DOMErrorHandler errorHandler,
DOMErrorImpl error,
DOMLocatorImpl locator,
java.lang.String message,
short severity,
java.lang.String type)
Reports a DOM error to the user handler. |
void |
setDocumentSource(org.apache.xerces.xni.parser.XMLDocumentSource source)
Sets the document source. |
void |
startCDATA(org.apache.xerces.xni.Augmentations augs)
The start of a CDATA section. |
void |
startDocument(org.apache.xerces.xni.XMLLocator locator,
java.lang.String encoding,
org.apache.xerces.xni.NamespaceContext namespaceContext,
org.apache.xerces.xni.Augmentations augs)
The start of the document. |
void |
startElement(org.apache.xerces.xni.QName element,
org.apache.xerces.xni.XMLAttributes attributes,
org.apache.xerces.xni.Augmentations augs)
The start of an element. |
void |
startGeneralEntity(java.lang.String name,
org.apache.xerces.xni.XMLResourceIdentifier identifier,
java.lang.String encoding,
org.apache.xerces.xni.Augmentations augs)
This method notifies the start of a general entity. |
void |
textDecl(java.lang.String version,
java.lang.String encoding,
org.apache.xerces.xni.Augmentations augs)
Notifies of the presence of a TextDecl line in an entity. |
protected void |
updateQName(org.w3c.dom.Node node,
org.apache.xerces.xni.QName qname)
|
void |
xmlDecl(java.lang.String version,
java.lang.String encoding,
java.lang.String standalone,
org.apache.xerces.xni.Augmentations augs)
Notifies of the presence of an XMLDecl line in the document. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
protected static final boolean DEBUG_ND
protected static final boolean DEBUG
protected static final boolean DEBUG_EVENTS
protected static final java.lang.String PREFIX
protected DOMConfigurationImpl fConfiguration
protected CoreDocumentImpl fDocument
protected final DOMNormalizer.XMLAttributesProxy fAttrProxy
protected final org.apache.xerces.xni.QName fQName
protected RevalidationHandler fValidationHandler
protected SymbolTable fSymbolTable
protected org.w3c.dom.DOMErrorHandler fErrorHandler
protected boolean fNamespaceValidation
protected boolean fPSVI
protected final org.apache.xerces.xni.NamespaceContext fNamespaceContext
protected final org.apache.xerces.xni.NamespaceContext fLocalNSBinder
protected final java.util.Vector fAttributeList
protected final DOMLocatorImpl fLocator
protected org.w3c.dom.Node fCurrentNode
public static final java.lang.RuntimeException abort
Constructor Detail |
public DOMNormalizer()
Method Detail |
protected void normalizeDocument(CoreDocumentImpl document, DOMConfigurationImpl config)
protected org.w3c.dom.Node normalizeNode(org.w3c.dom.Node node)
node
- Modified node or null. If node is returned, we need
to normalize again starting on the node returned.protected final void expandEntityRef(org.w3c.dom.Node parent, org.w3c.dom.Node reference)
protected final void namespaceFixUp(ElementImpl element, AttributeMap attributes)
protected final void addNamespaceDecl(java.lang.String prefix, java.lang.String uri, ElementImpl element)
prefix
- uri
- java.io.IOException
- public static final void isCDataWF(org.w3c.dom.DOMErrorHandler errorHandler, DOMErrorImpl error, DOMLocatorImpl locator, java.lang.String datavalue, boolean isXML11Version)
datavalue
- isXML11Version
- = true if XML 1.1public static final void isXMLCharWF(org.w3c.dom.DOMErrorHandler errorHandler, DOMErrorImpl error, DOMLocatorImpl locator, java.lang.String datavalue, boolean isXML11Version)
datavalue
- isXML11Version
- = true if XML 1.1public static final void isCommentWF(org.w3c.dom.DOMErrorHandler errorHandler, DOMErrorImpl error, DOMLocatorImpl locator, java.lang.String datavalue, boolean isXML11Version)
datavalue
- isXML11Version
- = true if XML 1.1public static final void isAttrValueWF(org.w3c.dom.DOMErrorHandler errorHandler, DOMErrorImpl error, DOMLocatorImpl locator, org.w3c.dom.NamedNodeMap attributes, org.w3c.dom.Attr a, java.lang.String value, boolean xml11Version)
attributes
- a
- value
- public static final void reportDOMError(org.w3c.dom.DOMErrorHandler errorHandler, DOMErrorImpl error, DOMLocatorImpl locator, java.lang.String message, short severity, java.lang.String type)
protected final void updateQName(org.w3c.dom.Node node, org.apache.xerces.xni.QName qname)
public void startDocument(org.apache.xerces.xni.XMLLocator locator, java.lang.String encoding, org.apache.xerces.xni.NamespaceContext namespaceContext, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
startDocument
in interface org.apache.xerces.xni.XMLDocumentHandler
locator
- The document locator, or null if the document
location cannot be reported during the parsing
of this document. However, it is strongly
recommended that a locator be supplied that can
at least report the system identifier of the
document.encoding
- The auto-detected IANA encoding name of the entity
stream. This value will be null in those situations
where the entity encoding is not auto-detected (e.g.
internal entities or a document entity that is
parsed from a java.io.Reader).namespaceContext
- The namespace context in effect at the
start of this document.
This object represents the current context.
Implementors of this class are responsible
for copying the namespace bindings from the
the current context (and its parent contexts)
if that information is important.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void xmlDecl(java.lang.String version, java.lang.String encoding, java.lang.String standalone, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
xmlDecl
in interface org.apache.xerces.xni.XMLDocumentHandler
version
- The XML version.encoding
- The IANA encoding name of the document, or null if
not specified.standalone
- The standalone value, or null if not specified.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void doctypeDecl(java.lang.String rootElement, java.lang.String publicId, java.lang.String systemId, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
doctypeDecl
in interface org.apache.xerces.xni.XMLDocumentHandler
rootElement
- The name of the root element.publicId
- The public identifier if an external DTD or null
if the external DTD is specified using SYSTEM.systemId
- The system identifier if an external DTD, null
otherwise.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void comment(org.apache.xerces.xni.XMLString text, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
comment
in interface org.apache.xerces.xni.XMLDocumentHandler
text
- The text in the comment.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by application to signal an error.public void processingInstruction(java.lang.String target, org.apache.xerces.xni.XMLString data, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
Typically, a processing instruction's data will contain a series of pseudo-attributes. These pseudo-attributes follow the form of element attributes but are not parsed or presented to the application as anything other than text. The application is responsible for parsing the data.
processingInstruction
in interface org.apache.xerces.xni.XMLDocumentHandler
target
- The target.data
- The data or null if none specified.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void startElement(org.apache.xerces.xni.QName element, org.apache.xerces.xni.XMLAttributes attributes, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
startElement
in interface org.apache.xerces.xni.XMLDocumentHandler
element
- The name of the element.attributes
- The element attributes.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void emptyElement(org.apache.xerces.xni.QName element, org.apache.xerces.xni.XMLAttributes attributes, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
emptyElement
in interface org.apache.xerces.xni.XMLDocumentHandler
element
- The name of the element.attributes
- The element attributes.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void startGeneralEntity(java.lang.String name, org.apache.xerces.xni.XMLResourceIdentifier identifier, java.lang.String encoding, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
Note: This method is not called for entity references appearing as part of attribute values.
startGeneralEntity
in interface org.apache.xerces.xni.XMLDocumentHandler
name
- The name of the general entity.identifier
- The resource identifier.encoding
- The auto-detected IANA encoding name of the entity
stream. This value will be null in those situations
where the entity encoding is not auto-detected (e.g.
internal entities or a document entity that is
parsed from a java.io.Reader).augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void textDecl(java.lang.String version, java.lang.String encoding, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
Note: This method will never be called for the document entity; it is only called for external general entities referenced in document content.
Note: This method is not called for entity references appearing as part of attribute values.
textDecl
in interface org.apache.xerces.xni.XMLDocumentHandler
version
- The XML version, or null if not specified.encoding
- The IANA encoding name of the entity.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void endGeneralEntity(java.lang.String name, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
Note: This method is not called for entity references appearing as part of attribute values.
endGeneralEntity
in interface org.apache.xerces.xni.XMLDocumentHandler
name
- The name of the entity.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void characters(org.apache.xerces.xni.XMLString text, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
characters
in interface org.apache.xerces.xni.XMLDocumentHandler
text
- The content.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void ignorableWhitespace(org.apache.xerces.xni.XMLString text, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
ignorableWhitespace
in interface org.apache.xerces.xni.XMLDocumentHandler
text
- The ignorable whitespace.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void endElement(org.apache.xerces.xni.QName element, org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
endElement
in interface org.apache.xerces.xni.XMLDocumentHandler
element
- The name of the element.augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void startCDATA(org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
startCDATA
in interface org.apache.xerces.xni.XMLDocumentHandler
augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void endCDATA(org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
endCDATA
in interface org.apache.xerces.xni.XMLDocumentHandler
augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void endDocument(org.apache.xerces.xni.Augmentations augs) throws org.apache.xerces.xni.XNIException
endDocument
in interface org.apache.xerces.xni.XMLDocumentHandler
augs
- Additional information that may include infoset augmentationsorg.apache.xerces.xni.XNIException
- Thrown by handler to signal an error.public void setDocumentSource(org.apache.xerces.xni.parser.XMLDocumentSource source)
setDocumentSource
in interface org.apache.xerces.xni.XMLDocumentHandler
public org.apache.xerces.xni.parser.XMLDocumentSource getDocumentSource()
getDocumentSource
in interface org.apache.xerces.xni.XMLDocumentHandler
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |