[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: Resources for AtomPub parser validation
Dave Pawson wrote:
> Daniel Jalkut wrote:
> > In particular, I continue to have mind-bending problems
> > interpreting the ins and outs of how particular content
> > needs to be treated in the Atom format that is used by
> > AtomPub. Right now, for instance, I'm trying to learn
> > definitively whether an escaped "&" inside a
> > content type xhtml div should be left alone or converted
> > to an ampersand in the parsed content output.
For content and text constucts with type='xhtml' there is no extra
escaping/unescaping to deal with; you can parse it using a regular XML
parser and you shouldn't do any preprocessing. For type='html', you need to
unescape the whole body exactly once before passing it through your HTML
parser.
- Brian