Not able to identify escaped/unescaped html entity in the text nodes #2206
Closed as not planned
Description
Not able to identify whether the input document has &
or &
in the text node, since Jsoup escapes the character in text node. Same goes to other entities like <
/<
.
This does not provide any control to the Jsoup users where they can take any action based on input. For example; If we want to remove <
character in text node but preserve when given as entity <
Note: Please let me know if there is already a way to differentiate this.
Providing an option where I could input Jsoup to not modify the text node will be super helpful. This provides more flexibility and control to the customers.
Metadata
Assignees
Labels
No labels