Class HTMLWorker
java.lang.Object
com.itextpdf.text.html.simpleparser.HTMLWorker
- All Implemented Interfaces:
DocListener, ElementListener, SimpleXMLDocHandler, EventListener
Deprecated.
since 5.5.2; please switch to XML Worker instead (this is a separate project)
Old iText class that allows you to convert HTML to PDF.
We've completely rewritten HTML to PDF conversion and we made it a separate project named XML Worker.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate final ChainedPropertiesDeprecated.The current hierarchy chain of tags.protected ParagraphDeprecated.Keeps the content of the current paragraphprotected DocListenerDeprecated.DocListener that will listen to the Elements produced by parsing the HTML.private final ElementFactoryDeprecated.Factory that is able to create iText Element objects.static final StringDeprecated.Key used to store the font provider in the providers map.static final StringDeprecated.Key used to store the image baseurl provider in the providers map.static final StringDeprecated.Key used to store the image processor in the providers map.static final StringDeprecated.Key used to store the image provider in the providers map.static final StringDeprecated.Key used to store the image store in the providers map.private booleanDeprecated.Boolean to keep track of PRE tagsstatic final StringDeprecated.Key used to store the link provider in the providers map.private static LoggerDeprecated.Deprecated.The resulting list of elements.private booleanDeprecated.Boolean to keep track of LI tagsprivate booleanDeprecated.Boolean to keep track of TD and TH tagsprivate booleanDeprecated.Boolean to keep track of TR tags.Deprecated.Map containing providers such as a FontProvider or ImageProvider.protected booleanDeprecated.Indicates if text needs to be skipped.Deprecated.Stack with the Elements that already have been processed.private StyleSheetDeprecated.The object defining all the styles.private final Stack<boolean[]> Deprecated.Stack to keep track of table tags.protected Map<String, HTMLTagProcessor> Deprecated.The map with all the supported tags. -
Constructor Summary
ConstructorsConstructorDescriptionHTMLWorker(DocListener document) Deprecated.Creates a new instance of HTMLWorkerHTMLWorker(DocListener document, Map<String, HTMLTagProcessor> tags, StyleSheet style) Deprecated.Creates a new instance of HTMLWorker -
Method Summary
Modifier and TypeMethodDescriptionbooleanDeprecated.Signals that anElementwas added to theDocument.voidDeprecated.Flushes the current paragraph, indicating that we're starting a new block.voidclose()Deprecated.Signals that theDocumentwas closed and that no otherElementswill be added.createCell(String tag) Deprecated.Creates a Cell.createChunk(String content) Deprecated.Creates a Chunk using the factory.createImage(Map<String, String> attrs) Deprecated.Creates an Image object.createLineSeparator(Map<String, String> attrs) Deprecated.Creates a LineSeparator object.createList(String tag) Deprecated.Creates a List object.Deprecated.Creates a ListItem object.Deprecated.Creates a Paragraph using the factory.voidDeprecated.Called after the document is parsed.voidendElement(String tag) Deprecated.Called when an end tag is found.voidDeprecated.Stacks the current paragraph, indicating that we're starting a new span.Deprecated.use getProviders() insteadbooleanDeprecated.booleanDeprecated.booleanDeprecated.booleanDeprecated.booleanDeprecated.voidnewLine()Deprecated.Adds a new line to the currentParagraph.booleannewPage()Deprecated.Signals that an new page has to be started.voidopen()Deprecated.Signals that theDocumenthas been opened and thatElementscan be added.voidDeprecated.Parses content read from a java.io.Reader object.parseToList(Reader reader, StyleSheet style) Deprecated.Parses an HTML source to a List of Element objectsparseToList(Reader reader, StyleSheet style, HashMap<String, Object> providers) Deprecated.Parses an HTML source to a List of Element objectsparseToList(Reader reader, StyleSheet style, Map<String, HTMLTagProcessor> tags, HashMap<String, Object> providers) Deprecated.Parses an HTML source to a List of Element objectsvoidDeprecated.Pops the values of pendingTR and pendingTD from a state stack.voidprocessImage(Image img, Map<String, String> attrs) Deprecated.Processes an Image.voidDeprecated.Adds a link to the current paragraph.voidDeprecated.Fetches the List from the Stack and adds it to the TextElementArray on top of the Stack, or to the Document if the Stack is empty.voidDeprecated.Looks for the List object on the Stack, and adds the ListItem to the List.voidDeprecated.Gets the TableWrapper from the Stack and adds a new row.voidDeprecated.Processes the Table.voidDeprecated.Pushes the values of pendingTR and pendingTD to a state stack.voidpushToStack(Element element) Deprecated.Pushes an element to the Stack.voidDeprecated.Sets the page number to 0.voidsetInsidePRE(boolean insidePRE) Deprecated.voidsetInterfaceProps(HashMap<String, Object> providers) Deprecated.use setProviders() insteadbooleansetMarginMirroring(boolean marginMirroring) Deprecated.Parameter that allows you to do left/right margin mirroring (odd/even pages)booleansetMarginMirroringTopBottom(boolean marginMirroring) Deprecated.Parameter that allows you to do top/bottom margin mirroring (odd/even pages)booleansetMargins(float marginLeft, float marginRight, float marginTop, float marginBottom) Deprecated.Sets the margins.voidsetPageCount(int pageN) Deprecated.Sets the page number.booleansetPageSize(Rectangle pageSize) Deprecated.Sets the pagesize.voidsetPendingLI(boolean pendingLI) Deprecated.voidsetPendingTD(boolean pendingTD) Deprecated.voidsetPendingTR(boolean pendingTR) Deprecated.voidsetProviders(Map<String, Object> providers) Deprecated.Setter for the providers.voidsetSkipText(boolean skipText) Deprecated.voidsetStyleSheet(StyleSheet style) Deprecated.Setter for the StyleSheetvoidsetSupportedTags(Map<String, HTMLTagProcessor> tags) Deprecated.Sets the map with supported tags.voidDeprecated.Called when the document starts to be parsed.voidstartElement(String tag, Map<String, String> attrs) Deprecated.Called when a start tag is found.voidDeprecated.Called when a text element is found.voidupdateChain(String tag) Deprecated.Updates the chain by removing a tag.voidupdateChain(String tag, Map<String, String> attrs) Deprecated.Updates the chain with a new tag and new attributes.
-
Field Details
-
LOGGER
Deprecated. -
document
Deprecated.DocListener that will listen to the Elements produced by parsing the HTML. This can be a com.lowagie.text.Document adding the elements to a Document directly, or an HTMLWorker instance strong the objects in a List -
tags
Deprecated.The map with all the supported tags.- Since:
- 5.0.6
-
style
-
stack
-
currentParagraph
Deprecated.Keeps the content of the current paragraph- Since:
- iText 5.0.6 (private => protected)
-
chain
-
IMG_PROVIDER
Deprecated.Key used to store the image provider in the providers map.- Since:
- 5.0.6
- See Also:
-
IMG_PROCESSOR
Deprecated.Key used to store the image processor in the providers map.- Since:
- 5.0.6
- See Also:
-
IMG_STORE
Deprecated.Key used to store the image store in the providers map.- Since:
- 5.0.6
- See Also:
-
IMG_BASEURL
Deprecated.Key used to store the image baseurl provider in the providers map.- Since:
- 5.0.6
- See Also:
-
FONT_PROVIDER
Deprecated.Key used to store the font provider in the providers map.- Since:
- 5.0.6
- See Also:
-
LINK_PROVIDER
Deprecated.Key used to store the link provider in the providers map.- Since:
- 5.0.6
- See Also:
-
providers
-
factory
Deprecated.Factory that is able to create iText Element objects.- Since:
- 5.0.6
-
tableState
-
pendingTR
private boolean pendingTRDeprecated.Boolean to keep track of TR tags. -
pendingTD
private boolean pendingTDDeprecated.Boolean to keep track of TD and TH tags -
pendingLI
private boolean pendingLIDeprecated.Boolean to keep track of LI tags -
insidePRE
private boolean insidePREDeprecated.Boolean to keep track of PRE tags- Since:
- 5.0.6 renamed from isPRE
-
skipText
protected boolean skipTextDeprecated.Indicates if text needs to be skipped.- Since:
- iText 5.0.6 (private => protected)
-
objectList
-
-
Constructor Details
-
HTMLWorker
Deprecated.Creates a new instance of HTMLWorker- Parameters:
document- A class that implementsDocListener
-
HTMLWorker
Deprecated.Creates a new instance of HTMLWorker- Parameters:
document- A class that implementsDocListenertags- A map containing the supported tagsstyle- A StyleSheet- Since:
- 5.0.6
-
-
Method Details
-
setSupportedTags
Deprecated.Sets the map with supported tags.- Parameters:
tags-- Since:
- 5.0.6
-
setStyleSheet
Deprecated.Setter for the StyleSheet- Parameters:
style- the StyleSheet
-
parse
Deprecated.Parses content read from a java.io.Reader object.- Parameters:
reader- the content- Throws:
IOException
-
startDocument
public void startDocument()Deprecated.Description copied from interface:SimpleXMLDocHandlerCalled when the document starts to be parsed.- Specified by:
startDocumentin interfaceSimpleXMLDocHandler- See Also:
-
startElement
Deprecated.Description copied from interface:SimpleXMLDocHandlerCalled when a start tag is found.- Specified by:
startElementin interfaceSimpleXMLDocHandler- Parameters:
tag- the tag nameattrs- the tag's attributes- See Also:
-
text
Deprecated.Description copied from interface:SimpleXMLDocHandlerCalled when a text element is found.- Specified by:
textin interfaceSimpleXMLDocHandler- Parameters:
content- the text element, probably a fragment.- See Also:
-
endElement
Deprecated.Description copied from interface:SimpleXMLDocHandlerCalled when an end tag is found.- Specified by:
endElementin interfaceSimpleXMLDocHandler- Parameters:
tag- the tag name- See Also:
-
endDocument
public void endDocument()Deprecated.Description copied from interface:SimpleXMLDocHandlerCalled after the document is parsed.- Specified by:
endDocumentin interfaceSimpleXMLDocHandler- See Also:
-
newLine
public void newLine()Deprecated.Adds a new line to the currentParagraph.- Since:
- 5.0.6
-
carriageReturn
Deprecated.Flushes the current paragraph, indicating that we're starting a new block. If the stack is empty, the paragraph is added to the document. Otherwise the Paragraph is added to the stack.- Throws:
DocumentException- Since:
- 5.0.6
-
flushContent
public void flushContent()Deprecated.Stacks the current paragraph, indicating that we're starting a new span.- Since:
- 5.0.6
-
pushToStack
Deprecated.Pushes an element to the Stack.- Parameters:
element-- Since:
- 5.0.6
-
updateChain
-
updateChain
Deprecated.Updates the chain by removing a tag.- Parameters:
tag- the new tag- Since:
- 5.0.6
-
setProviders
-
createChunk
-
createParagraph
Deprecated.Creates a Paragraph using the factory.- Returns:
- a Paragraph without any content
- Since:
- 5.0.6
-
createList
-
createListItem
Deprecated.Creates a ListItem object.- Returns:
- a ListItem object
- Since:
- 5.0.6
-
createLineSeparator
Deprecated.Creates a LineSeparator object.- Parameters:
attrs- properties of the LineSeparator- Returns:
- a LineSeparator object
- Since:
- 5.0.6
-
createImage
Deprecated.Creates an Image object.- Parameters:
attrs- properties of the Image- Returns:
- an Image object (or null if the Image couldn't be found)
- Throws:
DocumentExceptionIOException- Since:
- 5.0.6
-
createCell
Deprecated.Creates a Cell.- Parameters:
tag- the tag- Returns:
- a CellWrapper object
- Since:
- 5.0.6
-
processLink
public void processLink()Deprecated.Adds a link to the current paragraph.- Since:
- 5.0.6
-
processList
Deprecated.Fetches the List from the Stack and adds it to the TextElementArray on top of the Stack, or to the Document if the Stack is empty.- Throws:
DocumentException- Since:
- 5.0.6
-
processListItem
Deprecated.Looks for the List object on the Stack, and adds the ListItem to the List.- Throws:
DocumentException- Since:
- 5.0.6
-
processImage
-
processTable
-
processRow
public void processRow()Deprecated.Gets the TableWrapper from the Stack and adds a new row.- Since:
- 5.0.6
-
pushTableState
public void pushTableState()Deprecated.Pushes the values of pendingTR and pendingTD to a state stack.- Since:
- 5.0.6
-
popTableState
public void popTableState()Deprecated.Pops the values of pendingTR and pendingTD from a state stack.- Since:
- 5.0.6
-
isPendingTR
public boolean isPendingTR()Deprecated.- Returns:
- the pendingTR
- Since:
- 5.0.6
-
setPendingTR
public void setPendingTR(boolean pendingTR) Deprecated.- Parameters:
pendingTR- the pendingTR to set- Since:
- 5.0.6
-
isPendingTD
public boolean isPendingTD()Deprecated.- Returns:
- the pendingTD
- Since:
- 5.0.6
-
setPendingTD
public void setPendingTD(boolean pendingTD) Deprecated.- Parameters:
pendingTD- the pendingTD to set- Since:
- 5.0.6
-
isPendingLI
public boolean isPendingLI()Deprecated.- Returns:
- the pendingLI
- Since:
- 5.0.6
-
setPendingLI
public void setPendingLI(boolean pendingLI) Deprecated.- Parameters:
pendingLI- the pendingLI to set- Since:
- 5.0.6
-
isInsidePRE
public boolean isInsidePRE()Deprecated.- Returns:
- the insidePRE
- Since:
- 5.0.6
-
setInsidePRE
public void setInsidePRE(boolean insidePRE) Deprecated.- Parameters:
insidePRE- the insidePRE to set- Since:
- 5.0.6
-
isSkipText
public boolean isSkipText()Deprecated.- Returns:
- the skipText
- Since:
- 5.0.6
-
setSkipText
public void setSkipText(boolean skipText) Deprecated.- Parameters:
skipText- the skipText to set- Since:
- 5.0.6
-
parseToList
Deprecated.Parses an HTML source to a List of Element objects- Parameters:
reader- the HTML sourcestyle- a StyleSheet object- Returns:
- a List of Element objects
- Throws:
IOException
-
parseToList
public static List<Element> parseToList(Reader reader, StyleSheet style, HashMap<String, Object> providers) throws IOExceptionDeprecated.Parses an HTML source to a List of Element objects- Parameters:
reader- the HTML sourcestyle- a StyleSheet objectproviders- map containing classes with extra info- Returns:
- a List of Element objects
- Throws:
IOException
-
parseToList
public static List<Element> parseToList(Reader reader, StyleSheet style, Map<String, HTMLTagProcessor> tags, HashMap<String, Object> providers) throws IOExceptionDeprecated.Parses an HTML source to a List of Element objects- Parameters:
reader- the HTML sourcestyle- a StyleSheet objecttags- a map containing supported tags and their processorsproviders- map containing classes with extra info- Returns:
- a List of Element objects
- Throws:
IOException- Since:
- 5.0.6
-
add
Deprecated.Description copied from interface:ElementListenerSignals that anElementwas added to theDocument.- Specified by:
addin interfaceElementListener- Parameters:
element- a high level object- Returns:
trueif the element was added,falseif not.- Throws:
DocumentException- when a document isn't open yet, or has been closed- See Also:
-
close
public void close()Deprecated.Description copied from interface:DocListenerSignals that theDocumentwas closed and that no otherElementswill be added.The outputstream of every writer implementing
DocListenerwill be closed.- Specified by:
closein interfaceDocListener- See Also:
-
newPage
public boolean newPage()Deprecated.Description copied from interface:DocListenerSignals that an new page has to be started.- Specified by:
newPagein interfaceDocListener- Returns:
trueif the page was added,falseif not.- See Also:
-
open
public void open()Deprecated.Description copied from interface:DocListenerSignals that theDocumenthas been opened and thatElementscan be added.- Specified by:
openin interfaceDocListener- See Also:
-
resetPageCount
public void resetPageCount()Deprecated.Description copied from interface:DocListenerSets the page number to 0.- Specified by:
resetPageCountin interfaceDocListener- See Also:
-
setMarginMirroring
public boolean setMarginMirroring(boolean marginMirroring) Deprecated.Description copied from interface:DocListenerParameter that allows you to do left/right margin mirroring (odd/even pages)- Specified by:
setMarginMirroringin interfaceDocListener- Parameters:
marginMirroring-- Returns:
- true if successful
- See Also:
-
setMarginMirroringTopBottom
public boolean setMarginMirroringTopBottom(boolean marginMirroring) Deprecated.Description copied from interface:DocListenerParameter that allows you to do top/bottom margin mirroring (odd/even pages)- Specified by:
setMarginMirroringTopBottomin interfaceDocListener- Parameters:
marginMirroring-- Returns:
- true if successful
- Since:
- 2.1.6
- See Also:
-
setMargins
public boolean setMargins(float marginLeft, float marginRight, float marginTop, float marginBottom) Deprecated.Description copied from interface:DocListenerSets the margins.- Specified by:
setMarginsin interfaceDocListener- Parameters:
marginLeft- the margin on the leftmarginRight- the margin on the rightmarginTop- the margin on the topmarginBottom- the margin on the bottom- Returns:
- a
boolean - See Also:
-
setPageCount
public void setPageCount(int pageN) Deprecated.Description copied from interface:DocListenerSets the page number.- Specified by:
setPageCountin interfaceDocListener- Parameters:
pageN- the new page number- See Also:
-
setPageSize
Deprecated.Description copied from interface:DocListenerSets the pagesize.- Specified by:
setPageSizein interfaceDocListener- Parameters:
pageSize- the new pagesize- Returns:
- a
boolean - See Also:
-
setInterfaceProps
Deprecated.use setProviders() insteadSets the providers. -
getInterfaceProps
Deprecated.use getProviders() insteadGets the providers
-