8000 Tags · vuamitom/goose · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Tags: vuamitom/goose

Tags

2.1.6

Toggle 2.1.6's commit message
version 2.1.6

2.1.4

Toggle 2.1.4's commit message
8000
Version 2.1.4

2.1.2

Toggle 2.1.2's commit message
version 2.1.2

2.0.2

Toggle 2.0.2's commit message
upping to version 2.0.2

2.0.1

Toggle 2.0.1's commit message
MINOR: RE-enabling Additional Data Extraction. Upping to version 2.0.1

1.4.1-FINALJAVA

Toggle 1.4.1-FINALJAVA's commit message
Final release of the Java version

1.4.1

Toggle 1.4.1's commit message
Resolving goofy maven issue. it required a new version to fully update.

1.4.0

Toggle 1.4.0's commit message
< 6869 /div>
Major: DefaultOutputFormatter#getFormattedText now unescapes HTML inc…

…luding all HTML Entities

Minor: I have begun to convert the usage of DefaultOutputFormatter so that you only use a single method: getFormattedText(Element topNode)

Bug fixes:
  * clean by class name was too restrictive and removed actual content elements, modified the list of names to only remove classes
    that end in "meta" instead of just containing the word "meta"

  * Modified DefaultDocumentCleaner#cleanBadTags to only select from within the body element to avoid removing it.

  * Added a helper method for removing nodes to handle cases where the node's parentNode is null (already removed). This was previously
    throwing an IllegalArgumentException from within jSoup and thus failing the extraction.

1.3.14

Toggle 1.3.14's commit message
Version 1.3.14

1.3.13

Toggle 1.3.13's commit message
upping to version 1.3.13 that contains a minor fix to tag extraction

0