Package org.jsoup.parser
Contains the HTML parser, tag specifications, and HTML tokeniser.
-
Class Summary Class Description CharacterReader CharacterReader consumes tokens off a string.HtmlTreeBuilder HTML Tree Builder; creates a DOM from Tokens.ParseError A Parse Error records an error in the input HTML that occurs in either the tokenisation or the tree building phase.ParseErrorList A container for ParseErrors.Parser Parses HTML into aDocument
.ParseSettings Controls parser settings, to optionally preserve tag and/or attribute name case.Tag HTML Tag capabilities.TokenQueue A character queue with parsing helpers.XmlTreeBuilder Use theXmlTreeBuilder
when you want to parse XML without any of the HTML DOM rules being applied to the document.