i am using Jtidy(html tidy) to get a DOM out of some html files and then
i get all Links (all Elements with nodename "a").Now i want to take this
dom and want it to
process with XSLT
when i use the following Code i get the following Exception
XSLTProcessor processor = XSLTProcessorFactory.getProcessor();
processor.process(new XSLTInputSource(
XSLTInputSource(new FileInputStream(xslPath)),
new XSLTResultTarget(new FileOutputStream(outputFile)));
XSL Error: Cannot use a DTMLiaison for a input DOM node... pass a
org.apache.xalan.xpath.xdom.XercesLiaison instead!
XSL Error: SAX Exception
at org.apache.xalan.xslt.XSLTEngineImpl.error(XSLTEngineImpl.java:1799)
at org.apache.xalan.xslt.XSLTEngineImpl.error(XSLTEngineImpl.java:1691)
at DOMToHtmlSerializer.serialize(DOMToHtmlSerializer.java:39)
at HtmlLinkValidator.validate(HtmlLinkValidator.java:56)
at Main.<init>(Main.java:44)
at Main.main(Main.java:55)
Ok i thought , if he want it that way i pass a xerces liasion
XercesLiaison xl = new XercesLiaison();
XSLTProcessor processor = XSLTProcessorFactory.getProcessor(xl);
processor.process(new XSLTInputSource(doc),new
XSLTInputSource(new FileInputStream(xslPath)),
new XSLTResultTarget(new FileOutputStream(outputFile)));
than i get the following exception
XSL Error: SAX Exception
org.apache.xalan.xslt.XSLProcessorException: XercesLiaison can not
handle nodes of type class org.w3c.tidy.DOMDocumentImpl
at org.apache.xalan.xslt.XSLTEngineImpl.error(XSLTEngineImpl.java:1753)
at org.apache.xalan.xslt.XSLTEngineImpl.error(XSLTEngineImpl.java:1717)
at DOMToHtmlSerializer.serialize(DOMToHtmlSerializer.java:39)
at HtmlLinkValidator.validate(HtmlLinkValidator.java:56)
at Main.<init>(Main.java:44)
at Main.main(Main.java:55)
org.apache.xalan.xslt.XSLProcessorException: XercesLiaison can not
handle nodes of type class org.w3c.tidy.DOMDocumentImpl "
Why is JTidy using its own DOMDocumentImpl(org.w3c.tidy.DOMDocumentImp)
and not the DOMDocumentImpl from w3c(org.w3c.dom.DOMDocumentImp) ?? (
This would have saved my a lot of time
Now what can i do ?
Solution 1: write the tidy-dom to disk and the reparse it with any
xml-parser , and the process it
Solution 2.
write a wrapper wich changes the tidy-dom to an pure
and then process it
Solution 3 :
Search for another tool doing it
Hmm can anyone of u , especially the developers of this too / libraryl,
tell me what to do?