About 7 results
Open links in new tab
  1. Impact of log4j CVE-2021-44228 on heritrix3? #522 - GitHub

    Dec 10, 2021 · This is an issue to track the impact of a recent log4j remote exploit (CVE-2021-44228) in the context of heritrix3. My brief read of the situation is that log4j versions 2.0.x …

  2. Re-instate biblio.com affiliate link (s) · Issue #960 - GitHub

    May 11, 2018 · Some of them seemed to simply exploit OL links to harvest user data/drop cookies, etc. Not sure if that was the case with biblio. There ought to be a less invasive sort of …

  3. Summarize web archive capture index (CDX) files. - GitHub

    Summarize web archive capture index (CDX) files. Contribute to internetarchive/cdx-summary development by creating an account on GitHub.

  4. Improve OCR quality · Issue #348 · internetarchive/openlibrary

    Oct 11, 2016 · ebook edition quality (generally terrible, but not usually due to the underlying technology) and devising strategies to improve it, based on some of the work done at the …

  5. Different results when querying API by ISBN vs LCCN #282

    It's an aside, but I've actually done a bunch of work both measuring IA OCR ebook edition quality (generally terrible, but not usually due to the underlying technology) and devising strategies to …

  6. Improve markup accessibility: Aria, Schema.org, fb open graph

    May 28, 2018 · Corporate actors are duty bound to exploit any information they get in the interest of their shareholders, we couldn't expect them not to advertise based on what they can learn …

  7. ImportBot importing titlepage instead of cover #2147

    May 24, 2019 · Of course we should not be deceptive: other sites indicate this with "Other editions" or "Similar items" Good to know. Perhaps a more informative name, to more …