[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: well-formedness error




At 08:49 PM 6/16/2004, you wrote:
* James Robertson wrote:
>Yeah, it's a choice :) We can be lenient and have users, or be strict and
>not have any.  I suppose the latter course would be easier on me as a
>developer, but then again - as a user of my own software, I also want to
>read content that might not strictly conform to all the specs.

This is not about strict conformance to all the specifications, but
rather well-formedness. If a document is not well-formed, it cannot
be processed by general purpose XML tools. Do you think that Atom is
special in this regard or do you think that this is something that
applies to all XML formats? As far as I can tell, common XSLT, XHTML,
XForms, SVG, XML Query, Voice XML, SMIL, SOAP, P3P, MathML, ...
implementations are not really designed to recover from well-formed-
ness errors; they might not choke on all the errors, but generally
a missing quote mark for an attribute value or a missing end tag
would cause these implementations to reject the content, so it does
not seem that unrealistic to me to be "strict" in this regard and
have customers.


In my work with BottomFeeder, I ignore some errors - such as illegal characters and mismatched encoding - and bail on others (open tags). I use the XML Parser that is part of VisualWorks, although I have subclassed it so that I can ignore a raft of recoverable errors. I don't do any regex parsing at all....



<Talk Small and Carry a Big Class Library>
James Robertson, Product Manager, Cincom Smalltalk
http://www.cincomsmalltalk.com/blog/blogView
jarober@xxxxxxxxxxxxxxx