any23-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <>
Subject [jira] [Commented] (ANY23-324) Replace net.sourceforge.nekohtml with jsoup
Date Wed, 24 Jan 2018 06:13:00 GMT


ASF GitHub Bot commented on ANY23-324:

Github user asfgit closed the pull request at:

> Replace net.sourceforge.nekohtml with jsoup 
> --------------------------------------------
>                 Key: ANY23-324
>                 URL:
>             Project: Apache Any23
>          Issue Type: Improvement
>          Components: core
>            Reporter: Lewis John McGibbney
>            Priority: Major
>             Fix For: 2.2
> A long standing issue relates to the performance of the existing default [|].
There are a number of issues which now relate to limitations in the way nekohtml parses HTML5
for example [ANY23-317|], [ANY23-273|],
[ANY23-267|]... there are several others.
> I propose to @Deprecate the implementation for the next release (possibly
making it configurable via I also propose to replace it
with AFAIK, Apache Tika also did this several years ago.

This message was sent by Atlassian JIRA

View raw message